An Introduction to Deep Reinforcement Learning

30 November 2018

Vincent François-Lavet

Papers citing "An Introduction to Deep Reinforcement Learning"

28 / 178 papers shown

Title
Learning Continuous Control Policies by Stochastic Value Gradients N. Heess Greg Wayne David Silver Timothy Lillicrap Yuval Tassa Tom Erez 90 560 0 30 Oct 2015
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning S. Mohamed Danilo Jimenez Rezende DRL SSL 54 400 0 29 Sep 2015
Deep Reinforcement Learning with Double Q-learning H. V. Hasselt A. Guez David Silver OffRL 134 7,590 0 22 Sep 2015
Recurrent Reinforcement Learning: A Hybrid Approach Xiujun Li Lihong Li Jianfeng Gao Xiaodong He Jianshu Chen Li Deng Ji He OffRL 30 77 0 10 Sep 2015
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 210 13,174 0 09 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games Junhyuk Oh Xiaoxiao Guo Honglak Lee Richard L. Lewis Satinder Singh 85 852 0 31 Jul 2015
Deep Recurrent Q-Learning for Partially Observable MDPs Matthew J. Hausknecht Peter Stone 97 1,668 0 23 Jul 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models Bradly C. Stadie Sergey Levine Pieter Abbeel 76 502 0 03 Jul 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images Manuel Watter Jost Tobias Springenberg Joschka Boedecker Martin Riedmiller BDL 50 839 0 24 Jun 2015
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning Y. Gal Zoubin Ghahramani UQCV BDL 476 9,233 0 06 Jun 2015
End-to-End Training of Deep Visuomotor Policies Sergey Levine Chelsea Finn Trevor Darrell Pieter Abbeel BDL 235 3,418 0 02 Apr 2015
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 245 6,722 0 19 Feb 2015
Towards Biologically Plausible Deep Learning Yoshua Bengio Dong-Hyun Lee J. Bornschein Thomas Mesnard Zhouhan Lin DRL OOD 54 349 0 14 Feb 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift Sergey Ioffe Christian Szegedy OOD 354 43,154 0 11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention Ke Xu Jimmy Ba Ryan Kiros Kyunghyun Cho Aaron Courville Ruslan Salakhutdinov R. Zemel Yoshua Bengio DiffM 286 10,034 0 10 Feb 2015
From Pixels to Torques: Policy Learning with Deep Dynamical Models Niklas Wahlström Thomas B. Schon M. Deisenroth 48 189 0 08 Feb 2015
Neural Turing Machines Alex Graves Greg Wayne Ivo Danihelka 81 2,318 0 20 Oct 2014
ImageNet Large Scale Visual Recognition Challenge Olga Russakovsky Jia Deng Hao Su J. Krause S. Satheesh ... A. Karpathy A. Khosla Michael S. Bernstein Alexander C. Berg Li Fei-Fei VLM ObjD 1.1K 39,383 0 01 Sep 2014
Changing the Environment Based on Empowerment as Intrinsic Motivation Christoph Salge C. Glackin Daniel Polani 38 67 0 03 Jun 2014
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning R. Ortner Odalric-Ambrym Maillard D. Ryabko 131 27 0 12 May 2014
Deep Learning in Neural Networks: An Overview Jürgen Schmidhuber HAI 179 16,311 0 30 Apr 2014
Hierarchical Solution of Markov Decision Processes using Macro-actions Milos Hauskrecht Nicolas Meuleau L. Kaelbling T. Dean Craig Boutilier 52 328 0 30 Jan 2013
Model-Based Bayesian Exploration R. Dearden N. Friedman D. Andre 72 288 0 23 Jan 2013
The Arcade Learning Environment: An Evaluation Platform for General Agents Marc G. Bellemare Yavar Naddaf J. Veness Michael Bowling 82 2,992 0 19 Jul 2012
Learning Parameterized Skills Bruno C. da Silva George Konidaris A. Barto 94 207 0 27 Jun 2012
Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods Gergely Neu Csaba Szepesvári 52 244 0 20 Jun 2012
Planning to Be Surprised: Optimal Bayesian Exploration in Dynamic Environments Yi Sun Faustino J. Gomez Jürgen Schmidhuber 73 163 0 29 Mar 2011
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms Lihong Li Wei Chu John Langford Xuanhui Wang OffRL 152 574 0 31 Mar 2010