ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.06339
  4. Cited By
Deep Reinforcement Learning

Deep Reinforcement Learning

15 October 2018
Yuxi Li
    VLMOffRL
ArXiv (abs)PDFHTML

Papers citing "Deep Reinforcement Learning"

50 / 521 papers shown
Title
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCVBDL
78
380
0
08 Jun 2018
Probabilistic Model-Agnostic Meta-Learning
Probabilistic Model-Agnostic Meta-Learning
Chelsea Finn
Kelvin Xu
Sergey Levine
BDL
278
672
0
07 Jun 2018
Path-Level Network Transformation for Efficient Architecture Search
Path-Level Network Transformation for Efficient Architecture Search
Han Cai
Jiacheng Yang
Weinan Zhang
Song Han
Yong Yu
68
211
0
07 Jun 2018
Model-free, Model-based, and General Intelligence
Model-free, Model-based, and General Intelligence
Hector Geffner
LRMELM
58
57
0
06 Jun 2018
TopRank: A practical algorithm for online stochastic ranking
TopRank: A practical algorithm for online stochastic ranking
Tor Lattimore
Branislav Kveton
Shuai Li
Csaba Szepesvári
LRM
44
71
0
06 Jun 2018
Relational Deep Reinforcement Learning
Relational Deep Reinforcement Learning
V. Zambaldi
David Raposo
Adam Santoro
V. Bapst
Yujia Li
...
Victoria Langston
Razvan Pascanu
M. Botvinick
Oriol Vinyals
Peter W. Battaglia
OffRL
159
222
0
05 Jun 2018
Relational recurrent neural networks
Relational recurrent neural networks
Adam Santoro
Ryan Faulkner
David Raposo
Jack W. Rae
Mike Chrzanowski
T. Weber
Daan Wierstra
Oriol Vinyals
Razvan Pascanu
Timothy Lillicrap
GNN
135
211
0
05 Jun 2018
Relational inductive biases, deep learning, and graph networks
Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
V. Zambaldi
...
Pushmeet Kohli
M. Botvinick
Oriol Vinyals
Yujia Li
Razvan Pascanu
AI4CENAI
769
3,131
0
04 Jun 2018
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual
  Optimization
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization
Hoi-To Wai
Zhuoran Yang
Zhaoran Wang
Mingyi Hong
75
170
0
03 Jun 2018
Inference Aided Reinforcement Learning for Incentive Mechanism Design in
  Crowdsourcing
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing
Zehong Hu
Yitao Liang
Yang Liu
Jie Zhang
OffRL
44
24
0
01 Jun 2018
Reinforced Continual Learning
Reinforced Continual Learning
Ju Xu
Zhanxing Zhu
CLL
97
377
0
31 May 2018
How Does Batch Normalization Help Optimization?
How Does Batch Normalization Help Optimization?
Shibani Santurkar
Dimitris Tsipras
Andrew Ilyas
Aleksander Madry
ODL
105
1,546
0
29 May 2018
Playing hard exploration games by watching YouTube
Playing hard exploration games by watching YouTube
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
65
271
0
29 May 2018
Human-in-the-Loop Interpretability Prior
Human-in-the-Loop Interpretability Prior
Isaac Lage
A. Ross
Been Kim
S. Gershman
Finale Doshi-Velez
79
121
0
29 May 2018
Dual Policy Iteration
Dual Policy Iteration
Wen Sun
Geoffrey J. Gordon
Byron Boots
J. Andrew Bagnell
OffRL
91
57
0
28 May 2018
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for
  Reinforcement Learning
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning
Jing-Cheng Shi
Yang Yu
Qing Da
Shi-Yong Chen
Anxiang Zeng
OffRL
79
187
0
25 May 2018
Meta-Gradient Reinforcement Learning
Meta-Gradient Reinforcement Learning
Zhongwen Xu
H. V. Hasselt
David Silver
112
327
0
24 May 2018
AutoAugment: Learning Augmentation Policies from Data
AutoAugment: Learning Augmentation Policies from Data
E. D. Cubuk
Barret Zoph
Dandelion Mané
Vijay Vasudevan
Quoc V. Le
135
1,775
0
24 May 2018
Deep Reinforcement Learning of Marked Temporal Point Processes
Deep Reinforcement Learning of Marked Temporal Point Processes
U. Upadhyay
A. De
Manuel Gomez Rodriguez
BDLOffRL
59
112
0
23 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
60
75
0
23 May 2018
Do Better ImageNet Models Transfer Better?
Do Better ImageNet Models Transfer Better?
Simon Kornblith
Jonathon Shlens
Quoc V. Le
OODMLT
170
1,330
0
23 May 2018
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Ian Osband
Benjamin Van Roy
OffRL
57
25
0
23 May 2018
Confounding-Robust Policy Improvement
Confounding-Robust Policy Improvement
Nathan Kallus
Angela Zhou
CMLOffRL
332
153
0
22 May 2018
Learning Safe Policies with Expert Guidance
Learning Safe Policies with Expert Guidance
Je-chun Huang
Fa Wu
Doina Precup
Yang Cai
59
25
0
21 May 2018
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report
  Generation
Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation
Yuan Li
Xiaodan Liang
Zhiting Hu
Eric Xing
MedIm
56
336
0
21 May 2018
Data-Efficient Hierarchical Reinforcement Learning
Data-Efficient Hierarchical Reinforcement Learning
Ofir Nachum
S. Gu
Honglak Lee
Sergey Levine
OffRL
99
812
0
21 May 2018
Learning to Optimize Tensor Programs
Learning to Optimize Tensor Programs
Tianqi Chen
Lianmin Zheng
Eddie Q. Yan
Ziheng Jiang
T. Moreau
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
87
404
0
21 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
69
66
0
20 May 2018
A Lyapunov-based Approach to Safe Reinforcement Learning
A Lyapunov-based Approach to Safe Reinforcement Learning
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
163
507
0
20 May 2018
Reinforcement Learning of Theorem Proving
Reinforcement Learning of Theorem Proving
C. Kaliszyk
Josef Urban
Henryk Michalewski
Miroslav Olsák
48
148
0
19 May 2018
Learning to Multitask
Learning to Multitask
Yu Zhang
Ying Wei
Qiang Yang
146
53
0
19 May 2018
Solving the Rubik's Cube Without Human Knowledge
Solving the Rubik's Cube Without Human Knowledge
Stephen Marcus McAleer
Forest Agostinelli
Alexander Shmakov
Pierre Baldi
43
41
0
18 May 2018
GAN Q-learning
GAN Q-learning
T. Doan
Bogdan Mazoure
Clare Lyle
OODOffRL
44
19
0
13 May 2018
Machine Learning in Compiler Optimisation
Machine Learning in Compiler Optimisation
Zheng Wang
Michael F. P. O'Boyle
VLM
51
77
0
09 May 2018
Exploring the Limits of Weakly Supervised Pretraining
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
Laurens van der Maaten
VLM
205
1,370
0
02 May 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CEBDL
91
674
0
02 May 2018
Scalable Bilinear $π$ Learning Using State and Action Features
Scalable Bilinear πππ Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
72
46
0
27 Apr 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual
  Storytelling
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
61
159
0
24 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
90
480
0
23 Apr 2018
Subgoal Discovery for Hierarchical Dialogue Policy Learning
Subgoal Discovery for Hierarchical Dialogue Policy Learning
Da Tang
Xiujun Li
Jianfeng Gao
Chong-Jun Wang
Lihong Li
Tony Jebara
50
50
0
20 Apr 2018
PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement
  Learning for Robust Decision-Making
PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making
Fangkai Yang
Daoming Lyu
Bo Liu
Steven M. Gustafson
OffRL
47
136
0
20 Apr 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
On Learning Intrinsic Rewards for Policy Gradient Methods
Zeyu Zheng
Junhyuk Oh
Satinder Singh
61
208
0
17 Apr 2018
Reinforced Co-Training
Reinforced Co-Training
Jiawei Wu
Lei Li
William Yang Wang
OffRL
85
51
0
17 Apr 2018
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based
  Character Skills
DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills
Xue Bin Peng
Pieter Abbeel
Sergey Levine
M. van de Panne
AI4CE
248
499
0
08 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
73
145
0
02 Apr 2018
Learning to Navigate in Cities Without a Map
Learning to Navigate in Cities Without a Map
Piotr Wojciech Mirowski
Matthew Koichi Grimes
Mateusz Malinowski
Karl Moritz Hermann
Keith Anderson
Denis Teplyashin
Karen Simonyan
Koray Kavukcuoglu
Andrew Zisserman
R. Hadsell
SSLHAI
101
320
0
31 Mar 2018
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent
  Reinforcement Learning
QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Tabish Rashid
Mikayel Samvelyan
Christian Schroeder de Witt
Gregory Farquhar
Jakob N. Foerster
Shimon Whiteson
162
1,676
0
30 Mar 2018
Iterative Visual Reasoning Beyond Convolutions
Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen
Li Li
Li Fei-Fei
Abhinav Gupta
LRMGNN
196
216
0
29 Mar 2018
Unsupervised Predictive Memory in a Goal-Directed Agent
Unsupervised Predictive Memory in a Goal-Directed Agent
Greg Wayne
Chia-Chun Hung
David Amos
M. Berk Mirza
Arun Ahuja
...
David Silver
Koray Kavukcuoglu
M. Botvinick
Demis Hassabis
Timothy Lillicrap
81
192
0
28 Mar 2018
Accelerating Learning in Constructive Predictive Frameworks with the
  Successor Representation
Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Craig Sherstan
Marlos C. Machado
P. Pilarski
49
10
0
23 Mar 2018
Previous
12345...91011
Next