v1v2v3v4 (latest)

Prioritized Experience Replay

18 November 2015

David Silver

Papers citing "Prioritized Experience Replay"

50 / 1,454 papers shown

Title
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States Peter Wolf Karl Kurzer Tobias Wingert Florian Kuhnt Johann Marius Zöllner 60 56 0 10 Sep 2018
Neural Guided Constraint Logic Programming for Program Synthesis Lisa Zhang Gregory Rosenblatt Ethan Fetaya Renjie Liao William E. Byrd M. Might R. Urtasun R. Zemel NAI 147 30 0 08 Sep 2018
Challenges of Context and Time in Reinforcement Learning: Introducing Space Fortress as a Benchmark Akshat Agarwal Ryan Hope Katia Sycara OffRL 34 9 0 06 Sep 2018
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay Sameera Lanka Tianfu Wu 66 30 0 06 Sep 2018
Sample-Efficient Imitation Learning via Generative Adversarial Nets Lionel Blondé Alexandros Kalousis GAN 67 47 0 06 Sep 2018
Goal-oriented Dialogue Policy Learning from Failures Keting Lu Shiqi Zhang Xiaoping Chen OffRL 46 29 0 20 Aug 2018
Reinforcement Learning for Autonomous Defence in Software-Defined Networking Yi Han Benjamin I. P. Rubinstein Tamas Abraham T. Alpcan O. Vel S. Erfani David Hubczenko C. Leckie Paul Montague AAML 55 69 0 17 Aug 2018
Small Sample Learning in Big Data Era Jun Shu Zongben Xu Deyu Meng 108 72 0 14 Aug 2018
A Survey of Machine and Deep Learning Methods for Internet of Things (IoT) Security M. Al-garadi Amr M. Mohamed A. Al-Ali Xiaojiang Du Mohsen Guizani 98 834 0 29 Jul 2018
FuzzerGym: A Competitive Framework for Fuzzing and Learning W. Drozd Michael D. Wagner 66 33 0 19 Jul 2018
Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms P. Tavallali G. Doran L. Mandrake 28 0 0 16 Jul 2018
Remember and Forget for Experience Replay G. Novati Petros Koumoutsakos OffRL 108 92 0 16 Jul 2018
Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem Hugo Penedones Damien Vincent Hartmut Maennel Sylvain Gelly Timothy A. Mann André Barreto AAML 38 7 0 09 Jul 2018
Memory Augmented Policy Optimization for Program Synthesis and Semantic Parsing Chen Liang Mohammad Norouzi Jonathan Berant Quoc V. Le Ni Lao 134 134 0 06 Jul 2018
Goal-oriented Trajectories for Efficient Exploration Fabio Pardo Vitaly Levdik Petar Kormushev 33 2 0 05 Jul 2018
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization Xiangxiang Chu 105 9 0 02 Jul 2018
Learning to Drive in a Day Alex Kendall Jeffrey Hawke David Janz Przemyslaw Mazur Daniele Reda John M. Allen Vinh-Dieu Lam Alex Bewley Amar Shah 115 659 0 01 Jul 2018
Deictic Image Maps: An Abstraction For Learning Pose Invariant Manipulation Policies Robert Platt Colin Kohler Marcus Gualtieri 93 11 0 26 Jun 2018
Q-DeckRec: A Fast Deck Recommendation System for Collectible Card Games Zhengxing Chen Chris Amato Truong-Huy D. Nguyen Seth Cooper Yizhou Sun M. S. El-Nasr BDL 61 40 0 26 Jun 2018
Learning-to-Ask: Knowledge Acquisition via 20 Questions Yihong Chen B. Chen Xuguang Duan Jian-Guang Lou Yue Wang Wenwu Zhu Yong Cao 54 15 0 22 Jun 2018
RUDDER: Return Decomposition for Delayed Rewards Jose A. Arjona-Medina Michael Gillhofer Michael Widrich Thomas Unterthiner Johannes Brandstetter Sepp Hochreiter 130 222 0 20 Jun 2018
Sim-to-Real Reinforcement Learning for Deformable Object Manipulation J. Matas Stephen James Andrew J. Davison AI4CE 77 361 0 20 Jun 2018
Evolving simple programs for playing Atari games Dennis G. Wilson Sylvain Cussat-Blanc H. Luga J. Miller 74 62 0 14 Jun 2018
Self-Imitation Learning Junhyuk Oh Yijie Guo Satinder Singh Honglak Lee SSL 88 251 0 14 Jun 2018
Implicit Quantile Networks for Distributional Reinforcement Learning Will Dabney Georg Ostrovski David Silver Rémi Munos OffRL 174 535 0 14 Jun 2018
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network Wenjia Meng Qian Zheng L. Yang Pengfei Li Gang Pan 45 21 0 14 Jun 2018
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains Yangchen Pan M. Zaheer Adam White Andrew Patterson Martha White 80 46 0 12 Jun 2018
Multi-Agent Deep Reinforcement Learning with Human Strategies Thanh Nguyen Ngoc Duy Nguyen S. Nahavandi 82 12 0 12 Jun 2018
Deep Curiosity Loops in Social Environments Jonatan Barkan Goren Gordon 33 2 0 10 Jun 2018
Learning to Search in Long Documents Using Document Structure Mor Geva Jonathan Berant RALM 80 15 0 09 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning Ian Osband John Aslanides Albin Cassirer UQCV BDL 97 380 0 08 Jun 2018
Program Synthesis Through Reinforcement Learning Guided Tree Search Riley Simmons-Edler Anders Miltner Sebastian Seung 131 11 0 08 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions Fernando Martínez-Plumed S. Avin Miles Brundage Allan Dafoe Seán Ó hÉigeartaigh José Hernández-Orallo 64 3 0 02 Jun 2018
Deep Curiosity Search: Intra-Life Exploration Can Improve Performance on Challenging Deep Reinforcement Learning Problems C. Stanton Jeff Clune LRM 62 41 0 01 Jun 2018
Sequential Attacks on Agents for Long-Term Adversarial Goals E. Tretschk Seong Joon Oh Mario Fritz OnRL 409 48 1 31 May 2018
Transfer Learning for Related Reinforcement Learning Tasks via Image-to-Image Translation Shani Gamrian Yoav Goldberg 124 108 0 31 May 2018
Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update Su Young Lee Sung-Ik Choi Sae-Young Chung BDL 79 75 0 31 May 2018
Observe and Look Further: Achieving Consistent Performance on Atari Tobias Pohlen Bilal Piot Todd Hester M. G. Azar Dan Horgan ... John Quan Mel Vecerík Matteo Hessel Rémi Munos Olivier Pietquin 68 121 0 29 May 2018
Bayesian Inference with Anchored Ensembles of Neural Networks, and Application to Exploration in Reinforcement Learning Tim Pearce Nicolas Anastassacos Mohamed H. Zaki A. Neely BDL UQCV 83 17 0 29 May 2018
Meta-Gradient Reinforcement Learning Zhongwen Xu H. V. Hasselt David Silver 117 327 0 24 May 2018
Deep Reinforcement Learning For Sequence to Sequence Models Yaser Keneshloo Tian Shi Naren Ramakrishnan Chandan K. Reddy AIMat 3DV OffRL 92 211 0 24 May 2018
Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training Boyu Chen Wenlian Lu Ernest Fokoue 52 1 0 22 May 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning Elad Sarafian Aviv Tamar Sarit Kraus OffRL 60 11 0 20 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning Yinlam Chow Ofir Nachum Edgar A. Duénez-Guzmán Mohammad Ghavamzadeh 165 510 0 20 May 2018
Episodic Memory Deep Q-Networks Zichuan Lin Tianqi Zhao Guangwen Yang Lintao Zhang OffRL 61 87 0 19 May 2018
Learning Permutations with Sinkhorn Policy Gradient Patrick Emami Sanjay Ranka 65 55 0 18 May 2018
Language Expansion In Text-Based Games Ghulam Ahmed Ansari P. SagarJ. A. Chandar Balaraman Ravindran LLMAG 37 8 0 17 May 2018
Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning Tharindu Fernando Simon Denman Sridha Sridharan Clinton Fookes GAN 80 26 0 13 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes T. P. Le Ngo Anh Vien Abu Layek TaeChoong Chung 53 52 0 11 May 2018
Deep Reinforcement Learning for Optimal Control of Space Heating Ádám Nagy H. Kazmi Farah Cheaib Johan Driesen AI4CE 52 45 0 10 May 2018