Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.09510
Cited By
Learning Belief Representations for Imitation Learning in POMDPs
22 June 2019
Tanmay Gangwani
Joel Lehman
Qiang Liu
Jian Peng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Belief Representations for Imitation Learning in POMDPs"
31 / 31 papers shown
Title
Neural Predictive Belief Representations
Z. Guo
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Rémi Munos
SSL
40
80
0
15 Nov 2018
Recurrent World Models Facilitate Policy Evolution
David R Ha
Jürgen Schmidhuber
SyDa
TPM
109
930
0
04 Sep 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
233
10,152
0
10 Jul 2018
Temporal Difference Variational Auto-Encoder
Karol Gregor
George Papamakarios
F. Besse
Lars Buesing
Theophane Weber
DRL
40
127
0
08 Jun 2018
Deep Variational Reinforcement Learning for POMDPs
Maximilian Igl
L. Zintgraf
T. Le
Frank Wood
Shimon Whiteson
BDL
OffRL
56
260
0
06 Jun 2018
Playing Atari with Six Neurons
Giuseppe Cuccu
Julian Togelius
Philippe Cudré-Mauroux
124
43
0
04 Jun 2018
Recurrent Predictive State Policy Networks
Ahmed S. Hefny
Zita Marinho
Wen Sun
S. Srinivasa
Geoffrey J. Gordon
51
19
0
05 Mar 2018
Learning and Querying Fast Generative Models for Reinforcement Learning
Lars Buesing
T. Weber
S. Racanière
S. M. Ali Eslami
Danilo Jimenez Rezende
...
Fabio Viola
F. Besse
Karol Gregor
Demis Hassabis
Daan Wierstra
OffRL
57
134
0
08 Feb 2018
Z-Forcing: Training Stochastic Recurrent Networks
Anirudh Goyal
Alessandro Sordoni
Marc-Alexandre Côté
Nan Rosemary Ke
Yoshua Bengio
BDL
56
183
0
15 Nov 2017
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Arun Venkatraman
Nicholas Rhinehart
Wen Sun
Lerrel Pinto
M. Hebert
Byron Boots
Kris Kitani
J. Andrew Bagnell
AI4CE
60
42
0
25 Sep 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
243
18,685
0
20 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
490
129,831
0
12 Jun 2017
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
122
3,279
0
08 May 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
45
1,225
0
16 Nov 2016
Learning to Navigate in Complex Environments
Piotr Wojciech Mirowski
Razvan Pascanu
Fabio Viola
Hubert Soyer
Andy Ballard
...
Ross Goroshin
Laurent Sifre
Koray Kavukcuoglu
D. Kumaran
R. Hadsell
72
876
0
11 Nov 2016
Learning to Act by Predicting the Future
Alexey Dosovitskiy
V. Koltun
130
280
0
06 Nov 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
114
3,089
0
10 Jun 2016
Sequential Neural Models with Stochastic Layers
Marco Fraccaro
Søren Kaae Sønderby
Ulrich Paquet
Ole Winther
BDL
100
395
0
24 May 2016
Benchmarking Deep Reinforcement Learning for Continuous Control
Yan Duan
Xi Chen
Rein Houthooft
John Schulman
Pieter Abbeel
OffRL
66
1,689
0
22 Apr 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
168
8,805
0
04 Feb 2016
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
87
852
0
31 Jul 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
97
1,668
0
23 Jul 2015
Gradient Estimation Using Stochastic Computation Graphs
John Schulman
N. Heess
T. Weber
Pieter Abbeel
OffRL
123
391
0
17 Jun 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
45
3,368
0
08 Jun 2015
Variational Inference with Normalizing Flows
Danilo Jimenez Rezende
S. Mohamed
DRL
BDL
258
4,143
0
21 May 2015
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
245
6,722
0
19 Feb 2015
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
162
6,760
0
03 Sep 2014
Reinforcement and Imitation Learning via Interactive No-Regret Learning
Stéphane Ross
J. Andrew Bagnell
OffRL
85
262
0
23 Jun 2014
Generative Adversarial Networks
Ian Goodfellow
Jean Pouget-Abadie
M. Berk Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
GAN
130
2,191
0
10 Jun 2014
Online Planning Algorithms for POMDPs
Stéphane Ross
Joelle Pineau
Sébastien Paquet
B. Chaib-draa
73
584
0
15 Jan 2014
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
166
3,196
0
02 Nov 2010
1