Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.06093
Cited By
Direct Advantage Estimation
13 September 2021
Hsiao-Ru Pan
Nico Gürtler
Alexander Neitz
Bernhard Schölkopf
OffRL
CML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Direct Advantage Estimation"
7 / 7 papers shown
Title
Sequential Causal Imitation Learning with Unobserved Confounders
D. Kumor
Junzhe Zhang
Elias Bareinboim
CML
21
39
0
12 Aug 2022
Towards Causal Representation Learning
Bernhard Schölkopf
Francesco Locatello
Stefan Bauer
Nan Rosemary Ke
Nal Kalchbrenner
Anirudh Goyal
Yoshua Bengio
OOD
CML
AI4CE
84
320
0
22 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
80
1,780
0
08 Jun 2020
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
125
1,584
0
05 Feb 2018
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
103
611
0
08 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
148
8,805
0
04 Feb 2016
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
50
2,992
0
19 Jul 2012
1