ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06339
  4. Cited By
Deep Reinforcement Learning

Deep Reinforcement Learning

15 October 2018
Yuxi Li
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning"

50 / 521 papers shown
Title
Inequity aversion improves cooperation in intertemporal social dilemmas
Inequity aversion improves cooperation in intertemporal social dilemmas
Edward Hughes
Joel Z Leibo
Matthew Phillips
K. Tuyls
Edgar A. Duénez-Guzmán
...
Tina Zhu
Kevin R. McKee
Raphael Köster
H. Roff
T. Graepel
62
209
0
23 Mar 2018
Emergence of grid-like representations by training recurrent neural
  networks to perform spatial localization
Emergence of grid-like representations by training recurrent neural networks to perform spatial localization
Christopher J. Cueva
Xue-Xin Wei
50
216
0
21 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
105
1,320
0
12 Mar 2018
Compositional Attention Networks for Machine Reasoning
Compositional Attention Networks for Machine Reasoning
Drew A. Hudson
Christopher D. Manning
BDLOODLRM
193
577
0
08 Mar 2018
Transfer Learning with Neural AutoML
Transfer Learning with Neural AutoML
Catherine Wong
N. Houlsby
Yifeng Lu
Andrea Gesmundo
58
114
0
07 Mar 2018
An Empirical Evaluation of Generic Convolutional and Recurrent Networks
  for Sequence Modeling
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
DRL
97
4,845
0
04 Mar 2018
Distributed Prioritized Experience Replay
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
151
741
0
02 Mar 2018
Reinforcement Learning to Rank in E-Commerce Search Engine:
  Formalization, Analysis, and Application
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application
Yujing Hu
Qing Da
Anxiang Zeng
Yang Yu
Yinghui Xu
71
180
0
02 Mar 2018
Hierarchical Imitation and Reinforcement Learning
Hierarchical Imitation and Reinforcement Learning
Hoang Minh Le
Nan Jiang
Alekh Agarwal
Miroslav Dudík
Yisong Yue
Hal Daumé
59
192
0
01 Mar 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
116
240
0
25 Feb 2018
An Analysis of Categorical Distributional Reinforcement Learning
An Analysis of Categorical Distributional Reinforcement Learning
Mark Rowland
Marc G. Bellemare
Will Dabney
Rémi Munos
Yee Whye Teh
62
102
0
22 Feb 2018
Machine Theory of Mind
Machine Theory of Mind
Neil C. Rabinowitz
Frank Perbet
H. F. Song
Chiyuan Zhang
S. M. Ali Eslami
M. Botvinick
AI4CE
128
479
0
21 Feb 2018
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Learning to Play with Intrinsically-Motivated Self-Aware Agents
Nick Haber
Damian Mrowca
Li Fei-Fei
Daniel L. K. Yamins
LRM
69
120
0
21 Feb 2018
Meta-Reinforcement Learning of Structured Exploration Strategies
Meta-Reinforcement Learning of Structured Exploration Strategies
Abhishek Gupta
Russell Mendonca
YuXuan Liu
Pieter Abbeel
Sergey Levine
OffRL
110
349
0
20 Feb 2018
Recommendations with Negative Feedback via Pairwise Deep Reinforcement
  Learning
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning
Xiangyu Zhao
Li Zhang
Zhuoye Ding
Long Xia
Jiliang Tang
Dawei Yin
92
335
0
19 Feb 2018
Evolved Policy Gradients
Evolved Policy Gradients
Rein Houthooft
Richard Y. Chen
Phillip Isola
Bradly C. Stadie
Filip Wolski
Jonathan Ho
Pieter Abbeel
98
227
0
13 Feb 2018
Learning to Search with MCTSnets
Learning to Search with MCTSnets
A. Guez
T. Weber
Ioannis Antonoglou
Karen Simonyan
Oriol Vinyals
Daan Wierstra
Rémi Munos
David Silver
78
89
0
13 Feb 2018
Reinforcement Learning for Solving the Vehicle Routing Problem
Reinforcement Learning for Solving the Vehicle Routing Problem
M. Nazari
Afshin Oroojlooy
L. Snyder
Martin Takáč
102
908
0
12 Feb 2018
Neural Architecture Search with Bayesian Optimisation and Optimal
  Transport
Neural Architecture Search with Bayesian Optimisation and Optimal Transport
Kirthevasan Kandasamy
Willie Neiswanger
J. Schneider
Barnabás Póczós
Eric Xing
82
609
0
11 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
100
1,349
0
10 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
240
1,607
0
05 Feb 2018
One-Shot Imitation from Observing Humans via Domain-Adaptive
  Meta-Learning
One-Shot Imitation from Observing Humans via Domain-Adaptive Meta-Learning
Tianhe Yu
Chelsea Finn
Annie Xie
Sudeep Dasari
Tianhao Zhang
Pieter Abbeel
Sergey Levine
70
361
0
05 Feb 2018
Regularized Evolution for Image Classifier Architecture Search
Regularized Evolution for Image Classifier Architecture Search
Esteban Real
A. Aggarwal
Yanping Huang
Quoc V. Le
177
3,039
0
05 Feb 2018
Visual Interpretability for Deep Learning: a Survey
Visual Interpretability for Deep Learning: a Survey
Quanshi Zhang
Song-Chun Zhu
FaMLHAI
144
822
0
02 Feb 2018
Obfuscated Gradients Give a False Sense of Security: Circumventing
  Defenses to Adversarial Examples
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples
Anish Athalye
Nicholas Carlini
D. Wagner
AAML
243
3,194
0
01 Feb 2018
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Recasting Gradient-Based Meta-Learning as Hierarchical Bayes
Erin Grant
Chelsea Finn
Sergey Levine
Trevor Darrell
Thomas Griffiths
BDL
95
510
0
26 Jan 2018
Learning to Evade Static PE Machine Learning Malware Models via
  Reinforcement Learning
Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning
Hyrum S. Anderson
Anant Kharkar
Bobby Filar
David Evans
P. Roth
AAML
73
210
0
26 Jan 2018
Deep Learning for Sentiment Analysis : A Survey
Deep Learning for Sentiment Analysis : A Survey
Lei Zhang
Shuai Wang
Bing-Quan Liu
VLM
93
1,623
0
24 Jan 2018
Scalable and accurate deep learning for electronic health records
Scalable and accurate deep learning for electronic health records
A. Rajkomar
Eyal Oren
Kai Chen
Andrew M. Dai
Nissan Hajaj
...
A. Butte
M. Howell
Claire Cui
Greg S. Corrado
Jeffrey Dean
OODBDL
177
2,151
0
24 Jan 2018
DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement
  Learning Systems for Multi-Agent Dense Traffic Navigation
DeepTraffic: Crowdsourced Hyperparameter Tuning of Deep Reinforcement Learning Systems for Multi-Agent Dense Traffic Navigation
Lex Fridman
Jack Terwilliger
Benedikt Jenik
60
24
0
09 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,420
0
04 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELMLM&RoBDL
150
1,144
0
02 Jan 2018
Deep Learning: A Critical Appraisal
Deep Learning: A Critical Appraisal
G. Marcus
HAIVLM
137
1,041
0
02 Jan 2018
Boosting the Actor with Dual Critic
Boosting the Actor with Dual Critic
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
64
46
0
29 Dec 2017
Sim2Real View Invariant Visual Servoing by Recurrent Control
Sim2Real View Invariant Visual Servoing by Recurrent Control
Fereshteh Sadeghi
Alexander Toshev
Eric Jang
Sergey Levine
53
99
0
20 Dec 2017
Adversarial Examples: Attacks and Defenses for Deep Learning
Adversarial Examples: Attacks and Defenses for Deep Learning
Xiaoyong Yuan
Pan He
Qile Zhu
Xiaolin Li
SILMAAML
99
1,625
0
19 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
111
693
0
18 Dec 2017
Safe Mutations for Deep and Recurrent Neural Networks through Output
  Gradients
Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
57
93
0
18 Dec 2017
Improving Exploration in Evolution Strategies for Deep Reinforcement
  Learning via a Population of Novelty-Seeking Agents
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
80
349
0
18 Dec 2017
A Berkeley View of Systems Challenges for AI
A Berkeley View of Systems Challenges for AI
Ion Stoica
Basel Alomair
Raluca A. Popa
D. Patterson
Michael W. Mahoney
...
Joseph E. Gonzalez
Ken Goldberg
A. Ghodsi
David Culler
Pieter Abbeel
69
200
0
15 Dec 2017
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
Peter Henderson
T. Doan
Riashat Islam
David Meger
BDL
74
13
0
06 Dec 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
153
1,782
0
05 Dec 2017
The Case for Learned Index Structures
The Case for Learned Index Structures
Tim Kraska
Alex Beutel
Ed H. Chi
J. Dean
N. Polyzotis
85
1,046
0
04 Dec 2017
Progressive Neural Architecture Search
Progressive Neural Architecture Search
Chenxi Liu
Barret Zoph
Maxim Neumann
Jonathon Shlens
Wei Hua
Li Li
Li Fei-Fei
Alan Yuille
Jonathan Huang
Kevin Patrick Murphy
114
1,994
0
02 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
88
229
0
29 Nov 2017
Deep Reinforcement Learning for De-Novo Drug Design
Deep Reinforcement Learning for De-Novo Drug Design
Mariya Popova
Olexandr Isayev
Alexander Tropsha
96
1,033
0
29 Nov 2017
Are GANs Created Equal? A Large-Scale Study
Are GANs Created Equal? A Large-Scale Study
Mario Lucic
Karol Kurach
Marcin Michalski
Sylvain Gelly
Olivier Bousquet
EGVM
89
1,013
0
28 Nov 2017
Population Based Training of Neural Networks
Population Based Training of Neural Networks
Max Jaderberg
Valentin Dalibard
Simon Osindero
Wojciech M. Czarnecki
Jeff Donahue
...
Tim Green
Iain Dunning
Karen Simonyan
Chrisantha Fernando
Koray Kavukcuoglu
93
744
0
27 Nov 2017
Recurrent Relational Networks
Recurrent Relational Networks
Rasmus Berg Palm
Ulrich Paquet
Ole Winther
GNNReLMNAI
110
142
0
21 Nov 2017
Teaching a Machine to Read Maps with Deep Reinforcement Learning
Teaching a Machine to Read Maps with Deep Reinforcement Learning
Gino Brunner
Oliver Richter
Yuyi Wang
Roger Wattenhofer
3DV
62
52
0
20 Nov 2017
Previous
123456...91011
Next