ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.12560
  4. Cited By
An Introduction to Deep Reinforcement Learning

An Introduction to Deep Reinforcement Learning

30 November 2018
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "An Introduction to Deep Reinforcement Learning"

28 / 178 papers shown
Title
Learning Continuous Control Policies by Stochastic Value Gradients
Learning Continuous Control Policies by Stochastic Value Gradients
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
90
560
0
30 Oct 2015
Variational Information Maximisation for Intrinsically Motivated
  Reinforcement Learning
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
54
400
0
29 Sep 2015
Deep Reinforcement Learning with Double Q-learning
Deep Reinforcement Learning with Double Q-learning
H. V. Hasselt
A. Guez
David Silver
OffRL
134
7,590
0
22 Sep 2015
Recurrent Reinforcement Learning: A Hybrid Approach
Recurrent Reinforcement Learning: A Hybrid Approach
Xiujun Li
Lihong Li
Jianfeng Gao
Xiaodong He
Jianshu Chen
Li Deng
Ji He
OffRL
30
77
0
10 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
210
13,174
0
09 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
85
852
0
31 Jul 2015
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
97
1,668
0
23 Jul 2015
Incentivizing Exploration In Reinforcement Learning With Deep Predictive
  Models
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
Bradly C. Stadie
Sergey Levine
Pieter Abbeel
76
502
0
03 Jul 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control
  from Raw Images
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
50
839
0
24 Jun 2015
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
476
9,233
0
06 Jun 2015
End-to-End Training of Deep Visuomotor Policies
End-to-End Training of Deep Visuomotor Policies
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
235
3,418
0
02 Apr 2015
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
245
6,722
0
19 Feb 2015
Towards Biologically Plausible Deep Learning
Towards Biologically Plausible Deep Learning
Yoshua Bengio
Dong-Hyun Lee
J. Bornschein
Thomas Mesnard
Zhouhan Lin
DRL
OOD
54
349
0
14 Feb 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
354
43,154
0
11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
286
10,034
0
10 Feb 2015
From Pixels to Torques: Policy Learning with Deep Dynamical Models
From Pixels to Torques: Policy Learning with Deep Dynamical Models
Niklas Wahlström
Thomas B. Schon
M. Deisenroth
48
189
0
08 Feb 2015
Neural Turing Machines
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
81
2,318
0
20 Oct 2014
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.1K
39,383
0
01 Sep 2014
Changing the Environment Based on Empowerment as Intrinsic Motivation
Changing the Environment Based on Empowerment as Intrinsic Motivation
Christoph Salge
C. Glackin
Daniel Polani
38
67
0
03 Jun 2014
Selecting Near-Optimal Approximate State Representations in
  Reinforcement Learning
Selecting Near-Optimal Approximate State Representations in Reinforcement Learning
R. Ortner
Odalric-Ambrym Maillard
D. Ryabko
131
27
0
12 May 2014
Deep Learning in Neural Networks: An Overview
Deep Learning in Neural Networks: An Overview
Jürgen Schmidhuber
HAI
179
16,311
0
30 Apr 2014
Hierarchical Solution of Markov Decision Processes using Macro-actions
Hierarchical Solution of Markov Decision Processes using Macro-actions
Milos Hauskrecht
Nicolas Meuleau
L. Kaelbling
T. Dean
Craig Boutilier
52
328
0
30 Jan 2013
Model-Based Bayesian Exploration
Model-Based Bayesian Exploration
R. Dearden
N. Friedman
D. Andre
72
288
0
23 Jan 2013
The Arcade Learning Environment: An Evaluation Platform for General
  Agents
The Arcade Learning Environment: An Evaluation Platform for General Agents
Marc G. Bellemare
Yavar Naddaf
J. Veness
Michael Bowling
82
2,992
0
19 Jul 2012
Learning Parameterized Skills
Learning Parameterized Skills
Bruno C. da Silva
George Konidaris
A. Barto
94
207
0
27 Jun 2012
Apprenticeship Learning using Inverse Reinforcement Learning and
  Gradient Methods
Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods
Gergely Neu
Csaba Szepesvári
52
244
0
20 Jun 2012
Planning to Be Surprised: Optimal Bayesian Exploration in Dynamic
  Environments
Planning to Be Surprised: Optimal Bayesian Exploration in Dynamic Environments
Yi Sun
Faustino J. Gomez
Jürgen Schmidhuber
73
163
0
29 Mar 2011
Unbiased Offline Evaluation of Contextual-bandit-based News Article
  Recommendation Algorithms
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
152
574
0
31 Mar 2010
Previous
1234