ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06339
  4. Cited By
Deep Reinforcement Learning

Deep Reinforcement Learning

15 October 2018
Yuxi Li
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning"

50 / 521 papers shown
Title
Imagination-Augmented Agents for Deep Reinforcement Learning
Imagination-Augmented Agents for Deep Reinforcement Learning
T. Weber
S. Racanière
David P. Reichert
Lars Buesing
A. Guez
...
Razvan Pascanu
Peter W. Battaglia
Demis Hassabis
David Silver
Daan Wierstra
LM&Ro
117
558
0
19 Jul 2017
On the State of the Art of Evaluation in Neural Language Models
On the State of the Art of Evaluation in Neural Language Models
Gábor Melis
Chris Dyer
Phil Blunsom
71
536
0
18 Jul 2017
Trial without Error: Towards Safe Reinforcement Learning via Human
  Intervention
Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
William Saunders
Girish Sastry
Andreas Stuhlmuller
Owain Evans
OffRL
67
231
0
17 Jul 2017
Tracking as Online Decision-Making: Learning a Policy from Streaming
  Videos with Reinforcement Learning
Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning
J. Supančič
Deva Ramanan
OffRL
55
109
0
17 Jul 2017
Human-Level Intelligence or Animal-Like Abilities?
Human-Level Intelligence or Animal-Like Abilities?
Adnan Darwiche
ELMVLM
30
91
0
13 Jul 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
178
554
0
13 Jul 2017
Value Prediction Network
Value Prediction Network
Junhyuk Oh
Satinder Singh
Honglak Lee
84
333
0
11 Jul 2017
Robust Imitation of Diverse Behaviors
Robust Imitation of Diverse Behaviors
Ziyun Wang
J. Merel
Scott E. Reed
Greg Wayne
Nando de Freitas
N. Heess
83
198
0
10 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
206
938
0
07 Jul 2017
Learning human behaviors from motion capture by adversarial imitation
Learning human behaviors from motion capture by adversarial imitation
J. Merel
Yuval Tassa
TB Dhruva
S. Srinivasan
Jay Lemmon
Ziyun Wang
Greg Wayne
N. Heess
GAN
70
202
0
07 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
280
2,339
0
05 Jul 2017
ELF: An Extensive, Lightweight and Flexible Research Platform for
  Real-time Strategy Games
ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games
Yuandong Tian
Qucheng Gong
Wenling Shang
Yuxin Wu
C. L. Zitnick
OffRL
61
126
0
04 Jul 2017
Noisy Networks for Exploration
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
88
897
0
30 Jun 2017
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Natural Language Does Not Emerge Ñaturally' in Multi-Agent Dialog
Satwik Kottur
José M. F. Moura
Stefan Lee
Dhruv Batra
LLMAG
82
221
0
26 Jun 2017
Gradient Episodic Memory for Continual Learning
Gradient Episodic Memory for Continual Learning
David Lopez-Paz
MarcÁurelio Ranzato
VLMCLL
131
2,738
0
26 Jun 2017
THUMT: An Open Source Toolkit for Neural Machine Translation
THUMT: An Open Source Toolkit for Neural Machine Translation
Jiacheng Zhang
Yanzhuo Ding
Shiqi Shen
Yong Cheng
Maosong Sun
Huanbo Luan
Yang Liu
41
10
0
20 Jun 2017
VAIN: Attentional Multi-agent Predictive Modeling
VAIN: Attentional Multi-agent Predictive Modeling
Yedid Hoshen
GNN
103
240
0
19 Jun 2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning
Value-Decomposition Networks For Cooperative Multi-Agent Learning
P. Sunehag
Guy Lever
A. Gruslys
Wojciech M. Czarnecki
V. Zambaldi
...
Marc Lanctot
Nicolas Sonnerat
Joel Z Leibo
K. Tuyls
T. Graepel
73
1,013
0
16 Jun 2017
One Model To Learn Them All
One Model To Learn Them All
Lukasz Kaiser
Aidan Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
VLMViT
80
334
0
16 Jun 2017
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
M. Lewis
Denis Yarats
Yann N. Dauphin
Devi Parikh
Dhruv Batra
LLMAG
101
415
0
16 Jun 2017
An Overview of Multi-Task Learning in Deep Neural Networks
An Overview of Multi-Task Learning in Deep Neural Networks
Sebastian Ruder
CVBM
159
2,831
0
15 Jun 2017
Expected Policy Gradients
Expected Policy Gradients
K. Ciosek
Shimon Whiteson
72
58
0
15 Jun 2017
Schema Networks: Zero-shot Transfer with a Generative Causal Model of
  Intuitive Physics
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics
Ken Kansky
Tom Silver
David A. Mély
Mohamed Eldawy
Miguel Lazaro-Gredilla
Xinghua Lou
N. Dorfman
Szymon Sidor
Scott Phoenix
Dileep George
AI4CE
93
236
0
14 Jun 2017
Hybrid Reward Architecture for Reinforcement Learning
Hybrid Reward Architecture for Reinforcement Learning
H. V. Seijen
Mehdi Fatemi
Joshua Romoff
Romain Laroche
Tavian Barnes
Jeffrey Tsang
63
253
0
13 Jun 2017
Device Placement Optimization with Reinforcement Learning
Device Placement Optimization with Reinforcement Learning
Azalia Mirhoseini
Hieu H. Pham
Quoc V. Le
Benoit Steiner
Rasmus Larsen
Yuefeng Zhou
Naveen Kumar
Mohammad Norouzi
Samy Bengio
J. Dean
85
443
0
13 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
798
132,454
0
12 Jun 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
218
3,377
0
12 Jun 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Ryan J. Lowe
Yi Wu
Aviv Tamar
J. Harb
Pieter Abbeel
Igor Mordatch
162
4,509
0
07 Jun 2017
Visual Interaction Networks
Visual Interaction Networks
Nicholas Watters
Andrea Tacchetti
T. Weber
Razvan Pascanu
Peter W. Battaglia
Daniel Zoran
PINN3DH
107
279
0
05 Jun 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNNNAI
189
1,615
0
05 Jun 2017
On Unifying Deep Generative Models
On Unifying Deep Generative Models
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Eric Xing
DRLGAN
106
127
0
02 Jun 2017
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient
  Estimation for Deep Reinforcement Learning
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning
S. Gu
Timothy Lillicrap
Zoubin Ghahramani
Richard Turner
Bernhard Schölkopf
Sergey Levine
OffRL
84
165
0
01 Jun 2017
End-to-End Differentiable Proving
End-to-End Differentiable Proving
Tim Rocktaschel
Sebastian Riedel
NAI
108
382
0
31 May 2017
The Cramer Distance as a Solution to Biased Wasserstein Gradients
The Cramer Distance as a Solution to Biased Wasserstein Gradients
Marc G. Bellemare
Ivo Danihelka
Will Dabney
S. Mohamed
Balaji Lakshminarayanan
Stephan Hoyer
Rémi Munos
GAN
85
344
0
30 May 2017
IRGAN: A Minimax Game for Unifying Generative and Discriminative
  Information Retrieval Models
IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models
Jun Wang
Lantao Yu
Weinan Zhang
Yu Gong
Yinghui Xu
Benyou Wang
Peng Zhang
Dell Zhang
87
602
0
30 May 2017
Federated Multi-Task Learning
Federated Multi-Task Learning
Virginia Smith
Chao-Kai Chiang
Maziar Sanjabi
Ameet Talwalkar
FedML
159
1,814
0
30 May 2017
Good Semi-supervised Learning that Requires a Bad GAN
Good Semi-supervised Learning that Requires a Bad GAN
Zihang Dai
Zhilin Yang
Fan Yang
William W. Cohen
Ruslan Salakhutdinov
GAN
75
484
0
27 May 2017
Predictive State Recurrent Neural Networks
Predictive State Recurrent Neural Networks
Carlton Downey
Ahmed S. Hefny
Boyue Li
Byron Boots
Geoffrey J. Gordon
AI4TS
74
57
0
25 May 2017
Counterfactual Multi-Agent Policy Gradients
Counterfactual Multi-Agent Policy Gradients
Jakob N. Foerster
Gregory Farquhar
Triantafyllos Afouras
Nantas Nardelli
Shimon Whiteson
151
2,090
0
24 May 2017
Safe Model-based Reinforcement Learning with Stability Guarantees
Safe Model-based Reinforcement Learning with Stability Guarantees
Felix Berkenkamp
M. Turchetta
Angela P. Schoellig
Andreas Krause
191
853
0
23 May 2017
Infrastructure for Usable Machine Learning: The Stanford DAWN Project
Infrastructure for Usable Machine Learning: The Stanford DAWN Project
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
AI4TS
46
28
0
22 May 2017
Model-Based Planning with Discrete and Continuous Actions
Model-Based Planning with Discrete and Continuous Actions
Mikael Henaff
William F. Whitney
Yann LeCun
81
16
0
19 May 2017
Repeated Inverse Reinforcement Learning
Repeated Inverse Reinforcement Learning
Kareem Amin
Nan Jiang
Satinder Singh
123
76
0
15 May 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
125
2,453
0
15 May 2017
A Deep Reinforced Model for Abstractive Summarization
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
208
1,559
0
11 May 2017
Convolutional Sequence to Sequence Learning
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
171
3,290
0
08 May 2017
Safe and Nested Subgame Solving for Imperfect-Information Games
Safe and Nested Subgame Solving for Imperfect-Information Games
Noam Brown
Tuomas Sandholm
100
183
0
08 May 2017
From Language to Programs: Bridging Reinforcement Learning and Maximum
  Marginal Likelihood
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood
Kelvin Guu
Panupong Pasupat
Emmy Liu
Percy Liang
80
190
0
25 Apr 2017
Explaining How a Deep Neural Network Trained with End-to-End Learning
  Steers a Car
Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car
Mariusz Bojarski
Philip Yeres
A. Choromańska
K. Choromanski
Bernhard Firner
L. Jackel
Urs Muller
79
401
0
25 Apr 2017
Equivalence Between Policy Gradients and Soft Q-Learning
Equivalence Between Policy Gradients and Soft Q-Learning
John Schulman
Xi Chen
Pieter Abbeel
OffRL
113
349
0
21 Apr 2017
Previous
123456...91011
Next