Direct Advantage Estimation

13 September 2021

Papers citing "Direct Advantage Estimation"

7 / 7 papers shown

Title
Sequential Causal Imitation Learning with Unobserved Confounders D. Kumor Junzhe Zhang Elias Bareinboim CML 21 39 0 12 Aug 2022
Towards Causal Representation Learning Bernhard Schölkopf Francesco Locatello Stefan Bauer Nan Rosemary Ke Nal Kalchbrenner Anirudh Goyal Yoshua Bengio OOD CML AI4CE 84 320 0 22 Feb 2021
Conservative Q-Learning for Offline Reinforcement Learning Aviral Kumar Aurick Zhou George Tucker Sergey Levine OffRL OnRL 80 1,780 0 08 Jun 2020
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures L. Espeholt Hubert Soyer Rémi Munos Karen Simonyan Volodymyr Mnih ... Vlad Firoiu Tim Harley Iain Dunning Shane Legg Koray Kavukcuoglu 125 1,584 0 05 Feb 2018
Safe and Efficient Off-Policy Reinforcement Learning Rémi Munos T. Stepleton Anna Harutyunyan Marc G. Bellemare OffRL 103 611 0 08 Jun 2016
Asynchronous Methods for Deep Reinforcement Learning Volodymyr Mnih Adria Puigdomenech Badia M. Berk Mirza Alex Graves Timothy Lillicrap Tim Harley David Silver Koray Kavukcuoglu 148 8,805 0 04 Feb 2016
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 50 2,992 0 19 Jul 2012