ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1611.01224
  4. Cited By
Sample Efficient Actor-Critic with Experience Replay

Sample Efficient Actor-Critic with Experience Replay

3 November 2016
Ziyun Wang
V. Bapst
N. Heess
Volodymyr Mnih
Rémi Munos
Koray Kavukcuoglu
Nando de Freitas
ArXivPDFHTML

Papers citing "Sample Efficient Actor-Critic with Experience Replay"

50 / 136 papers shown
Title
Variational Actor-Critic Algorithms
Variational Actor-Critic Algorithms
Yuhua Zhu
Lexing Ying
OffRL
15
0
0
03 Aug 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear
  Function Approximation
Finite-Sample Analysis of Off-Policy Natural Actor-Critic with Linear Function Approximation
Zaiwei Chen
S. Khodadadian
S. T. Maguluri
OffRL
68
29
0
26 May 2021
Reinforcement Learning for Ridesharing: An Extended Survey
Reinforcement Learning for Ridesharing: An Extended Survey
Zhiwei Qin
Hongtu Zhu
Jieping Ye
44
84
0
03 May 2021
Generative Adversarial Reward Learning for Generalized Behavior Tendency
  Inference
Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference
Xiaocong Chen
Lina Yao
Xianzhi Wang
Aixin Sun
Wenjie Zhang
Quan Z. Sheng
22
8
0
03 May 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
47
24
0
23 Feb 2021
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
S. Khodadadian
Zaiwei Chen
S. T. Maguluri
CML
OffRL
74
26
0
18 Feb 2021
An advantage actor-critic algorithm for robotic motion planning in dense
  and dynamic scenarios
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios
Chengmin Zhou
Bingding Huang
Pasi Fränti
22
1
0
05 Feb 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
60
73
0
01 Jan 2021
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search
Y. Fu
Zhongzhi Yu
Yongan Zhang
Yingyan Lin
24
4
0
24 Dec 2020
Behavior Priors for Efficient Reinforcement Learning
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Integrating LEO Satellites and Multi-UAV Reinforcement Learning for
  Hybrid FSO/RF Non-Terrestrial Networks
Integrating LEO Satellites and Multi-UAV Reinforcement Learning for Hybrid FSO/RF Non-Terrestrial Networks
Ju-Hyung Lee
Jihong Park
M. Bennis
Young-Chai Ko
38
47
0
20 Oct 2020
Text-based RL Agents with Commonsense Knowledge: New Challenges,
  Environments and Baselines
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
K. Murugesan
Mattia Atzeni
Pavan Kapanipathi
Pushkar Shukla
Yara Rizk
Gerald Tesauro
Kartik Talamadupula
Mrinmaya Sachan
Murray Campbell
LM&Ro
LLMAG
OffRL
37
54
0
08 Oct 2020
Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Sandhya Saisubramanian
S. Zilberstein
Ece Kamar
20
21
0
24 Aug 2020
Queueing Network Controls via Deep Reinforcement Learning
Queueing Network Controls via Deep Reinforcement Learning
J. Dai
Mark O. Gluzman
OffRL
32
50
0
31 Jul 2020
Off-Policy Multi-Agent Decomposed Policy Gradients
Off-Policy Multi-Agent Decomposed Policy Gradients
Yihan Wang
Beining Han
Tonghan Wang
Heng Dong
Chongjie Zhang
35
175
0
24 Jul 2020
Revisiting Fundamentals of Experience Replay
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
30
235
0
13 Jul 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement
  Learning
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Stealing Deep Reinforcement Learning Models for Fun and Profit
Stealing Deep Reinforcement Learning Models for Fun and Profit
Kangjie Chen
Shangwei Guo
Tianwei Zhang
Xiaofei Xie
Yang Liu
MLAU
MIACV
OffRL
24
45
0
09 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
G. Novati
Hugues Lascombes de Laroussilhe
Petros Koumoutsakos
AI4CE
34
15
0
18 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
92
146
0
04 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
30
455
0
02 May 2020
A Survey of Deep Learning for Scientific Discovery
A Survey of Deep Learning for Scientific Discovery
M. Raghu
Erica Schmidt
OOD
AI4CE
47
120
0
26 Mar 2020
Comprehensive Review of Deep Reinforcement Learning Methods and
  Applications in Economics
Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics
Amir H. Mosavi
Pedram Ghamisi
Yaser Faghan
Puhong Duan
OffRL
27
152
0
21 Mar 2020
Sample Efficient Reinforcement Learning through Learning from
  Demonstrations in Minecraft
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft
Christian Scheller
Yanick Schraner
Manfred Vogel
29
27
0
12 Mar 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration
  Guidance for Robot Control
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
24
8
0
27 Feb 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
A Survey of Deep Learning Applications to Autonomous Vehicle Control
A Survey of Deep Learning Applications to Autonomous Vehicle Control
Sampo Kuutti
Richard Bowden
Yaochu Jin
P. Barber
Saber Fallah
36
507
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
27
75
0
09 Nov 2019
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
104
80
0
18 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
22
541
0
01 Oct 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
Controlling an Autonomous Vehicle with Deep Reinforcement Learning
Controlling an Autonomous Vehicle with Deep Reinforcement Learning
A. Folkers
Matthias Rick
C. Büskens
22
67
0
24 Sep 2019
X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust
X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust
Arjun Reddy Akula
Changsong Liu
Sari Saba-Sadiya
Hongjing Lu
S. Todorovic
J. Chai
Song-Chun Zhu
24
18
0
15 Sep 2019
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
Weixun Wang
Tianpei Yang
Y. Liu
Jianye Hao
Xiaotian Hao
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
AI4CE
24
110
0
06 Sep 2019
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain
  Task-Oriented Dialog
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
Ryuichi Takanobu
Hanlin Zhu
Minlie Huang
21
90
0
28 Aug 2019
Neural Simplex Architecture
Neural Simplex Architecture
Dung Phan
Radu Grosu
N. Jansen
Nicola Paoletti
S. Smolka
Scott D. Stoller
24
61
0
01 Aug 2019
Deep Reinforcement Learning for Autonomous Internet of Things: Model,
  Applications and Challenges
Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges
Lei Lei
Yue Tan
Kan Zheng
Shiwen Liu
K. Zheng
Xuemin Shen
Shen
OffRL
23
202
0
22 Jul 2019
A Review of Robot Learning for Manipulation: Challenges,
  Representations, and Algorithms
A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Oliver Kroemer
S. Niekum
George Konidaris
41
356
0
06 Jul 2019
Dependency-aware Attention Control for Unconstrained Face Recognition
  with Image Sets
Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets
Xiaofeng Liu
B. Kumar
Chao Yang
Qingming Tang
J. You
CVBM
23
42
0
05 Jul 2019
Modified Actor-Critics
Modified Actor-Critics
Erinc Merdivan
S. Hanke
M. Geist
24
2
0
02 Jul 2019
Is the Policy Gradient a Gradient?
Is the Policy Gradient a Gradient?
Chris Nota
Philip S. Thomas
8
57
0
17 Jun 2019
AgentGraph: Towards Universal Dialogue Management with Structured Deep
  Reinforcement Learning
AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning
Lu Chen
Zhi Chen
Bowen Tan
Sishan Long
Milica Gasic
Kai Yu
19
35
0
27 May 2019
Policy Search by Target Distribution Learning for Continuous Control
Policy Search by Target Distribution Learning for Continuous Control
Chuheng Zhang
Yuanqi Li
Jian Li
26
6
0
27 May 2019
P3O: Policy-on Policy-off Policy Optimization
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
29
51
0
05 May 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
26
17
0
11 Mar 2019
On Tiny Episodic Memories in Continual Learning
On Tiny Episodic Memories in Continual Learning
Arslan Chaudhry
Marcus Rohrbach
Mohamed Elhoseiny
Thalaiyasingam Ajanthan
P. Dokania
Philip Torr
MarcÁurelio Ranzato
CLL
52
393
0
27 Feb 2019
On-Policy Trust Region Policy Optimisation with Replay Buffers
On-Policy Trust Region Policy Optimisation with Replay Buffers
D. Kangin
N. Pugeault
OffRL
19
3
0
18 Jan 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's
  Mission Execution
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
18
4
0
17 Jan 2019
Previous
123
Next