ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for
  Continuous Control
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
Zhiyuan Xu
Kun Wu
Zhengping Che
Jian Tang
Jieping Ye
CLLOffRL
109
49
0
15 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSLOffRL
110
60
0
15 Oct 2020
Average Cost Optimal Control of Stochastic Systems Using Reinforcement
  Learning
Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning
J. Lai
J. Xiong
18
0
0
13 Oct 2020
Deep Reinforcement Learning and Transportation Research: A Comprehensive
  Review
Deep Reinforcement Learning and Transportation Research: A Comprehensive Review
Nahid Parvez Farazi
T. Ahamed
Limon Barua
Bo Zou
AI4TS
69
18
0
13 Oct 2020
FedAT: A High-Performance and Communication-Efficient Federated Learning
  System with Asynchronous Tiers
FedAT: A High-Performance and Communication-Efficient Federated Learning System with Asynchronous Tiers
Zheng Chai
Yujing Chen
Ali Anwar
Liang Zhao
Yue Cheng
Huzefa Rangwala
FedML
82
124
0
12 Oct 2020
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC
  Placement Based on Availability and Energy Consumption
The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption
Guto Leoni Santos
Theo Lynn
J. Kelner
P. Endo
30
0
0
12 Oct 2020
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in
  Image Classification
Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification
Yulin Wang
Kangchen Lv
Rui Huang
Shiji Song
Le Yang
Gao Huang
3DH
65
151
0
11 Oct 2020
Instance Weighted Incremental Evolution Strategies for Reinforcement
  Learning in Dynamic Environments
Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic Environments
Zhi Wang
Chunlin Chen
D. Dong
51
12
0
09 Oct 2020
Learning Not to Learn: Nature versus Nurture in Silico
Learning Not to Learn: Nature versus Nurture in Silico
R. T. Lange
Henning Sprekeler
80
10
0
09 Oct 2020
Learning Value Functions in Deep Policy Gradients using Residual
  Variance
Learning Value Functions in Deep Policy Gradients using Residual Variance
Yannis Flet-Berliac
Reda Ouhamma
Odalric-Ambrym Maillard
Philippe Preux
OffRL
72
1
0
09 Oct 2020
Q-learning with Language Model for Edit-based Unsupervised Summarization
Q-learning with Language Model for Edit-based Unsupervised Summarization
Ryosuke Kohita
Akifumi Wachi
Yang Zhao
Ryuki Tachibana
KELM
52
4
0
09 Oct 2020
Text-based RL Agents with Commonsense Knowledge: New Challenges,
  Environments and Baselines
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
K. Murugesan
Mattia Atzeni
Pavan Kapanipathi
Pushkar Shukla
Yara Rizk
Gerald Tesauro
Kartik Talamadupula
Mrinmaya Sachan
Murray Campbell
LM&RoLLMAGOffRL
77
56
0
08 Oct 2020
Maximum Reward Formulation In Reinforcement Learning
Maximum Reward Formulation In Reinforcement Learning
S. Gottipati
Yashaswi Pathak
Rohan Nuttall
Sahir
Raviteja Chunduru
Ahmed Touati
Sriram Ganapathi Subramanian
Matthew E. Taylor
Sarath Chandar
118
14
0
08 Oct 2020
Regularized Inverse Reinforcement Learning
Regularized Inverse Reinforcement Learning
Wonseok Jeon
Chen-Yang Su
Paul Barde
T. Doan
Derek Nowrouzezahrai
Joelle Pineau
76
12
0
07 Oct 2020
Online Safety Assurance for Deep Reinforcement Learning
Online Safety Assurance for Deep Reinforcement Learning
Noga H. Rotman
Michael Schapira
Aviv Tamar
OffRL
96
5
0
07 Oct 2020
From Language Games to Drawing Games
From Language Games to Drawing Games
Chrisantha Fernando
D. Zenkova
Stanislav Nikolov
Simon Osindero
73
4
0
06 Oct 2020
Learning Diverse Options via InfoMax Termination Critic
Learning Diverse Options via InfoMax Termination Critic
Yuji Kanagawa
Tomoyuki Kaneko
66
1
0
06 Oct 2020
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for
  Language Model Adaptation
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
Minki Kang
Moonsu Han
Sung Ju Hwang
OOD
81
18
0
06 Oct 2020
Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment
  Mapping
Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping
Ceyer Wakilpoor
Patrick J. Martin
Carrie Rebhuhn
Amanda Vu
67
22
0
06 Oct 2020
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Jonathan Gray
Adam Lerer
A. Bakhtin
Noam Brown
123
51
0
06 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
236
877
0
05 Oct 2020
Offline Learning for Planning: A Summary
Offline Learning for Planning: A Summary
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
51
4
0
05 Oct 2020
The act of remembering: a study in partially observable reinforcement
  learning
The act of remembering: a study in partially observable reinforcement learning
Rodrigo Toro Icarte
Richard Valenzano
Toryn Q. Klassen
Phillip J. K. Christoffersen
Amir-massoud Farahmand
Sheila A. McIlraith
OffRL
40
11
0
05 Oct 2020
Mean-Variance Efficient Reinforcement Learning by Expected Quadratic
  Utility Maximization
Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization
Masahiro Kato
Kei Nakagawa
Kenshi Abe
Tetsuro Morimura
424
0
0
03 Oct 2020
Reinforcement Learning of Sequential Price Mechanisms
Reinforcement Learning of Sequential Price Mechanisms
Gianluca Brero
Alon Eden
M. Gerstgrasser
David C. Parkes
Duncan Rheingans-Yoo
OffRL
60
18
0
02 Oct 2020
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Shangtong Zhang
Romain Laroche
H. V. Seijen
Shimon Whiteson
Rémi Tachet des Combes
124
15
0
02 Oct 2020
Self-Play Reinforcement Learning for Fast Image Retargeting
Self-Play Reinforcement Learning for Fast Image Retargeting
Nobukatsu Kajiura
Satoshi Kosugi
Xueting Wang
T. Yamasaki
136
20
0
02 Oct 2020
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and
  Act in Fantasy Worlds
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds
Prithviraj Ammanabrolu
Jack Urbanek
Margaret Li
Arthur Szlam
Tim Rocktaschel
Jason Weston
LM&Ro
121
44
0
01 Oct 2020
Toolpath design for additive manufacturing using deep reinforcement
  learning
Toolpath design for additive manufacturing using deep reinforcement learning
M. Mozaffar
Ablodghani Ebrahimi
Jian Cao
AI4CE
40
7
0
30 Sep 2020
Cross Learning in Deep Q-Networks
Cross Learning in Deep Q-Networks
Xing Wang
A. Vinel
25
2
0
29 Sep 2020
Trust-Region Method with Deep Reinforcement Learning in Analog Design
  Space Exploration
Trust-Region Method with Deep Reinforcement Learning in Analog Design Space Exploration
Kai-En Yang
Chia-Yu Tsai
Hung-Hao Shen
Chen-Feng Chiang
Feng-Ming Tsai
Chunguang Wang
Yiju Ting
Chia-Shun Yeh
C. Lai
53
14
0
29 Sep 2020
Lucid Dreaming for Experience Replay: Refreshing Past States with the
  Current Policy
Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy
Yunshu Du
Garrett A. Warnell
A. Gebremedhin
Peter Stone
Matthew E. Taylor
58
11
0
29 Sep 2020
Enhancing Continuous Control of Mobile Robots for End-to-End Visual
  Active Tracking
Enhancing Continuous Control of Mobile Robots for End-to-End Visual Active Tracking
Alessandro Devo
Alberto Dionigi
G. Costante
29
27
0
28 Sep 2020
Normalization Techniques in Training DNNs: Methodology, Analysis and
  Application
Normalization Techniques in Training DNNs: Methodology, Analysis and Application
Lei Huang
Jie Qin
Yi Zhou
Fan Zhu
Li Liu
Ling Shao
AI4CE
176
278
0
27 Sep 2020
Lineage Evolution Reinforcement Learning
Lineage Evolution Reinforcement Learning
Zeyu Zhang
Guisheng Yin
29
0
0
26 Sep 2020
Symbolic Relational Deep Reinforcement Learning based on Graph Neural
  Networks and Autoregressive Policy Decomposition
Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks and Autoregressive Policy Decomposition
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
AI4CE
93
3
0
25 Sep 2020
Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense
  Reward for Robotic Trajectory Planning
Deep Reinforcement Learning with a Stage Incentive Mechanism of Dense Reward for Robotic Trajectory Planning
G. Peng
Jin Yang
Xinde Li
M. O. Khyam
47
11
0
25 Sep 2020
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a
  Survey
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
Wenshuai Zhao
Jorge Peña Queralta
Tomi Westerlund
OffRL
265
743
0
24 Sep 2020
Neurocoder: Learning General-Purpose Computation Using Stored Neural
  Programs
Neurocoder: Learning General-Purpose Computation Using Stored Neural Programs
Hung Le
Svetha Venkatesh
NAI
45
5
0
24 Sep 2020
Is Q-Learning Provably Efficient? An Extended Analysis
Is Q-Learning Provably Efficient? An Extended Analysis
Kushagra Rastogi
Jonathan Lee
Fabrice Harel-Canada
Aditya Sunil Joglekar
OffRL
28
1
0
22 Sep 2020
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Dynamic Horizon Value Estimation for Model-based Reinforcement Learning
Junjie Wang
Qichao Zhang
Dongbin Zhao
Mengchen Zhao
Jianye Hao
OffRL
55
5
0
21 Sep 2020
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent
  Policy Optimization
Learn to Exceed: Stereo Inverse Reinforcement Learning with Concurrent Policy Optimization
Feng Tao
Yongcan Cao
101
2
0
21 Sep 2020
Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear
  Decision Trees for Discrete Action Systems
Towards Interpretable-AI Policies Induction using Evolutionary Nonlinear Decision Trees for Discrete Action Systems
Yashesh D. Dhebar
Kalyanmoy Deb
S. Nageshrao
Ling Zhu
Dimitar Filev
68
16
0
20 Sep 2020
AI and Wargaming
AI and Wargaming
J. Goodman
S. Risi
Simon Lucas
VLM
125
14
0
18 Sep 2020
Efficient Reinforcement Learning Development with RLzoo
Efficient Reinforcement Learning Development with RLzoo
Zihan Ding
Tianyang Yu
Yanhua Huang
Hongming Zhang
Guo Li
Quancheng Guo
Kai Zou
Hao Dong
OffRLOnRL
44
6
0
18 Sep 2020
Reinforcement Learning for Weakly Supervised Temporal Grounding of
  Natural Language in Untrimmed Videos
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos
Jie Wu
Guanbin Li
Xiaoguang Han
Liang Lin
OffRLAI4TS
84
56
0
18 Sep 2020
Competitiveness of MAP-Elites against Proximal Policy Optimization on
  locomotion tasks in deterministic simulations
Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations
Szymon Brych
Antoine Cully
67
4
0
17 Sep 2020
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Energy-based Surprise Minimization for Multi-Agent Value Factorization
Karush Suri
Xiaolong Shi
Konstantinos Plataniotis
Y. Lawryshyn
61
1
0
16 Sep 2020
Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey
Zhuangdi Zhu
Kaixiang Lin
Anil K. Jain
Jiayu Zhou
OffRLLRM
160
606
0
16 Sep 2020
Multimodal Safety-Critical Scenarios Generation for Decision-Making
  Algorithms Evaluation
Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation
Wenhao Ding
Baiming Chen
Yue Liu
Kim Ji Eun
Ding Zhao
AAML
102
106
0
16 Sep 2020
Previous
123...394041...707172
Next