ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.12560
  4. Cited By
An Introduction to Deep Reinforcement Learning

An Introduction to Deep Reinforcement Learning

30 November 2018
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
    OffRL
    AI4CE
ArXivPDFHTML

Papers citing "An Introduction to Deep Reinforcement Learning"

50 / 178 papers shown
Title
Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning
Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning
Thomas Budiarjo
Santana Yuda Pradata
Kadek Gemilang Santiyuda
Muhammad Alfian Amrizal
Reza Pulungan
Hiroyuki Takizawa
71
0
0
27 Feb 2025
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Learning more with the same effort: how randomization improves the robustness of a robotic deep reinforcement learning agent
Lucía Güitta-López
Jaime Boal
Álvaro J. López-López
78
5
0
24 Jan 2025
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
65
1
0
27 Oct 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
149
2
0
07 Jun 2024
Large Language Models for Cyber Security: A Systematic Literature Review
Large Language Models for Cyber Security: A Systematic Literature Review
HanXiang Xu
Shenao Wang
Ningke Li
Kaidi Wang
Yanjie Zhao
Kai Chen
Ting Yu
Yang Liu
Haoyu Wang
76
33
0
08 May 2024
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
102
5
0
13 Dec 2023
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
59
74
0
16 Jun 2020
Two Decades of AI4NETS-AI/ML for Data Networks: Challenges & Research
  Directions
Two Decades of AI4NETS-AI/ML for Data Networks: Challenges & Research Directions
P. Casas
GNN
48
8
0
03 Mar 2020
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
44
139
0
01 Nov 2018
Exploration by Random Network Distillation
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
95
1,310
0
30 Oct 2018
Social Influence as Intrinsic Motivation for Multi-Agent Deep
  Reinforcement Learning
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
Natasha Jaques
Angeliki Lazaridou
Edward Hughes
Çağlar Gülçehre
Pedro A. Ortega
D. Strouse
Joel Z Leibo
Nando de Freitas
48
57
0
19 Oct 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
94
144
0
15 Oct 2018
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with
  Real World Experience
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience
Yevgen Chebotar
Ankur Handa
Viktor Makoviychuk
Miles Macklin
J. Issac
Nathan D. Ratliff
Dieter Fox
66
503
0
12 Oct 2018
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
T. Paine
Sergio Gomez Colmenarejo
Ziyun Wang
Scott E. Reed
Y. Aytar
...
Matthew W. Hoffman
Gabriel Barth-Maron
Serkan Cabi
David Budden
Nando de Freitas
OffRL
40
25
0
11 Oct 2018
Episodic Curiosity through Reachability
Episodic Curiosity through Reachability
Nikolay Savinov
Anton Raichuk
Raphaël Marinier
Damien Vincent
Marc Pollefeys
Timothy Lillicrap
Sylvain Gelly
39
267
0
04 Oct 2018
Combined Reinforcement Learning via Abstract Representations
Combined Reinforcement Learning via Abstract Representations
Vincent François-Lavet
Yoshua Bengio
Doina Precup
Joelle Pineau
OffRL
47
89
0
12 Sep 2018
Multi-task Deep Reinforcement Learning with PopArt
Multi-task Deep Reinforcement Learning with PopArt
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
86
316
0
12 Sep 2018
Unity: A General Platform for Intelligent Agents
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
74
813
0
07 Sep 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
98
1,454
0
27 Jun 2018
A Dissection of Overfitting and Generalization in Continuous
  Reinforcement Learning
A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning
Amy Zhang
Nicolas Ballas
Joelle Pineau
CLL
OffRL
52
177
0
20 Jun 2018
Playing hard exploration games by watching YouTube
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
57
270
0
29 May 2018
Decoupling Dynamics and Reward for Transfer Learning
Decoupling Dynamics and Reward for Transfer Learning
Amy Zhang
Harsh Satija
Joelle Pineau
OOD
35
72
0
27 Apr 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
70
798
0
27 Apr 2018
A Study on Overfitting in Deep Reinforcement Learning
A Study on Overfitting in Deep Reinforcement Learning
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
OffRL
OnRL
48
386
0
18 Apr 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
68
446
0
28 Feb 2018
An Analysis of Categorical Distributional Reinforcement Learning
An Analysis of Categorical Distributional Reinforcement Learning
Mark Rowland
Marc G. Bellemare
Will Dabney
Rémi Munos
Yee Whye Teh
44
101
0
22 Feb 2018
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Nick Haber
Damian Mrowca
Li Fei-Fei
Daniel L. K. Yamins
LRM
55
117
0
21 Feb 2018
The Malicious Use of Artificial Intelligence: Forecasting, Prevention,
  and Mitigation
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Miles Brundage
S. Avin
Jack Clark
H. Toner
P. Eckersley
...
Owain Evans
Michael Page
Joanna J. Bryson
Roman V. Yampolskiy
Dario Amodei
71
698
0
20 Feb 2018
Learning Robust Dialog Policies in Noisy Environments
Learning Robust Dialog Policies in Noisy Environments
Maryam Fazel-Zarandi
Shang-Wen Li
Jin Cao
Jared Casale
Peter Henderson
David Whitney
A. Geramifard
65
24
0
11 Dec 2017
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep
  Reinforcement Learning
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
Gregory Farquhar
Tim Rocktaschel
Maximilian Igl
Shimon Whiteson
OffRL
54
71
0
31 Oct 2017
Distributional Reinforcement Learning with Quantile Regression
Distributional Reinforcement Learning with Quantile Regression
Will Dabney
Mark Rowland
Marc G. Bellemare
Rémi Munos
77
756
0
27 Oct 2017
Asymmetric Actor Critic for Image-Based Robot Learning
Asymmetric Actor Critic for Image-Based Robot Learning
Lerrel Pinto
Marcin Andrychowicz
Peter Welinder
Wojciech Zaremba
Pieter Abbeel
OffRL
47
367
0
18 Oct 2017
Rainbow: Combining Improvements in Deep Reinforcement Learning
Rainbow: Combining Improvements in Deep Reinforcement Learning
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
94
2,255
0
06 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
40
267
0
28 Sep 2017
The Consciousness Prior
The Consciousness Prior
Yoshua Bengio
DRL
AI4CE
37
229
0
25 Sep 2017
On overfitting and asymptotic bias in batch reinforcement learning with
  partial observability
On overfitting and asymptotic bias in batch reinforcement learning with partial observability
Vincent François-Lavet
Guillaume Rabusseau
Joelle Pineau
D. Ernst
R. Fonteneau
OffRL
32
33
0
22 Sep 2017
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
108
1,940
0
19 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
63
549
0
18 Sep 2017
Learning with Opponent-Learning Awareness
Learning with Opponent-Learning Awareness
Jakob N. Foerster
Richard Y. Chen
Maruan Al-Shedivat
Shimon Whiteson
Pieter Abbeel
Igor Mordatch
74
536
0
13 Sep 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
53
868
0
16 Aug 2017
Benchmark Environments for Multitask Learning in Continuous Domains
Benchmark Environments for Multitask Learning in Continuous Domains
Peter Henderson
Wei-Di Chang
Florian Shkurti
Johanna Hansen
David Meger
Gregory Dudek
29
40
0
14 Aug 2017
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for
  Continuous Control
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
Riashat Islam
Peter Henderson
Maziar Gomrokchi
Doina Precup
BDL
OffRL
42
251
0
10 Aug 2017
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with
  Model-Free Fine-Tuning
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
Anusha Nagabandi
G. Kahn
R. Fearing
Sergey Levine
76
967
0
08 Aug 2017
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
I. Higgins
Arka Pal
Andrei A. Rusu
Loic Matthey
Christopher P. Burgess
Alexander Pritzel
M. Botvinick
Charles Blundell
Alexander Lerchner
DRL
100
413
0
26 Jul 2017
A Distributional Perspective on Reinforcement Learning
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
77
1,497
0
21 Jul 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
243
18,685
0
20 Jul 2017
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
70
552
0
19 Jul 2017
Learning model-based planning from scratch
Learning model-based planning from scratch
Razvan Pascanu
Yujia Li
Oriol Vinyals
N. Heess
Lars Buesing
S. Racanière
David P. Reichert
T. Weber
Daan Wierstra
Peter W. Battaglia
LM&Ro
86
97
0
19 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
123
551
0
13 Jul 2017
Value Prediction Network
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
65
332
0
11 Jul 2017
1234
Next