ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,552 papers shown
Title
Flow-based Recurrent Belief State Learning for POMDPs
Flow-based Recurrent Belief State Learning for POMDPs
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
56
18
0
23 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
Towards biologically plausible Dreaming and Planning in recurrent
  spiking networks
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
31
7
0
20 May 2022
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement
  Learning-based Beam Search
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search
Tianlin Li
Zhe Chen
Bo Jiang
Jin Tang
Bin Luo
Dacheng Tao
50
18
0
19 May 2022
A2C is a special case of PPO
A2C is a special case of PPO
Shengyi Huang
Anssi Kanervisto
Antonin Raffin
Weixun Wang
Santiago Ontañón
Rousslan Fernand Julien Dossa
OffRL
39
25
0
18 May 2022
Generating Explanations from Deep Reinforcement Learning Using Episodic
  Memory
Generating Explanations from Deep Reinforcement Learning Using Episodic Memory
Sam Blakeman
D. Mareschal
32
3
0
18 May 2022
GraphMapper: Efficient Visual Navigation by Scene Graph Generation
GraphMapper: Efficient Visual Navigation by Scene Graph Generation
Zachary Seymour
Niluthpol Chowdhury Mithun
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
30
8
0
17 May 2022
Qualitative Differences Between Evolutionary Strategies and
  Reinforcement Learning Methods for Control of Autonomous Agents
Qualitative Differences Between Evolutionary Strategies and Reinforcement Learning Methods for Control of Autonomous Agents
Nicola Milano
S. Nolfi
28
0
0
16 May 2022
Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End
  Autonomous Driving
Bridging Sim2Real Gap Using Image Gradients for the Task of End-to-End Autonomous Driving
U. R. Nair
Sarthak Sharma
Udit Singh Parihar
M. S. Menon
Srikanth Vidapanakal
26
3
0
16 May 2022
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning
  Environments
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments
Ryan Sullivan
J. K. Terry
Benjamin Black
John P. Dickerson
30
8
0
14 May 2022
Learning to Guide Multiple Heterogeneous Actors from a Single Human
  Demonstration via Automatic Curriculum Learning in StarCraft II
Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II
Nicholas R. Waytowich
James Z. Hare
Vinicius G. Goecks
Mark R. Mittrick
John Richardson
Anjon Basak
Derrik E. Asher
43
2
0
11 May 2022
Efficient Distributed Framework for Collaborative Multi-Agent
  Reinforcement Learning
Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Xiaohan Hou
Jia-jia Zhang
Xinyu Wang
Jing Xiao
24
0
0
11 May 2022
Learning A Simulation-based Visual Policy for Real-world Peg In Unseen
  Holes
Learning A Simulation-based Visual Policy for Real-world Peg In Unseen Holes
Liangru Xie
Hongxiang Yu
Kechun Xu
Tong Yang
Minhang Wang
Haojian Lu
R. Xiong
Yue Wang
36
0
0
09 May 2022
Generative Evolutionary Strategy For Black-Box Optimizations
Generative Evolutionary Strategy For Black-Box Optimizations
C. Park
16
0
0
06 May 2022
Learning to Solve Vehicle Routing Problems: A Survey
Learning to Solve Vehicle Routing Problems: A Survey
Aigerim Bogyrbayeva
Meraryslan Meraliyev
Taukekhan Mustakhov
Bissenbay Dauletbayev
31
24
0
05 May 2022
Interactive Grounded Language Understanding in a Collaborative
  Environment: IGLU 2021
Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
I. Churin
Putra Manggala
Kata Naszádi
Michiel van der Meer
Taewoon Kim
LLMAG
33
30
0
05 May 2022
CCLF: A Contrastive-Curiosity-Driven Learning Framework for
  Sample-Efficient Reinforcement Learning
CCLF: A Contrastive-Curiosity-Driven Learning Framework for Sample-Efficient Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
34
12
0
02 May 2022
Rate-Constrained Remote Contextual Bandits
Rate-Constrained Remote Contextual Bandits
Francesco Pase
Deniz Gündüz
M. Zorzi
39
8
0
26 Apr 2022
A Survey of Traversability Estimation for Mobile Robots
A Survey of Traversability Estimation for Mobile Robots
Christos Sevastopoulos
S. Konstantopoulos
51
34
0
22 Apr 2022
Learning to Constrain Policy Optimization with Virtual Trust Region
Learning to Constrain Policy Optimization with Virtual Trust Region
Hung Le
Thommen Karimpanal George
Majid Abdolshah
D. Nguyen
Kien Do
Sunil R. Gupta
Svetha Venkatesh
36
3
0
20 Apr 2022
Network Topology Optimization via Deep Reinforcement Learning
Network Topology Optimization via Deep Reinforcement Learning
Zhuoran Li
Xing Wang
L. Pan
Lin Zhu
Zhendong Wang
Junlan Feng
Chao Deng
Longbo Huang
26
13
0
19 Apr 2022
Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on
  Unseen Shapes in Real World
Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World
Liang Xie
Hongxiang Yu
Yinghao Zhao
Haodong Zhang
Zhongxiang Zhou
Minhang Wang
Yue Wang
R. Xiong
22
9
0
16 Apr 2022
Reinforcement Learning Policy Recommendation for Interbank Network
  Stability
Reinforcement Learning Policy Recommendation for Interbank Network Stability
Alessio Brini
G. Tedeschi
Daniele Tantari
21
2
0
14 Apr 2022
Reinforcement learning on graphs: A survey
Reinforcement learning on graphs: A survey
Mingshuo Nie
Dongming Chen
Dongqi Wang
49
45
0
13 Apr 2022
Standardized feature extraction from pairwise conflicts applied to the
  train rescheduling problem
Standardized feature extraction from pairwise conflicts applied to the train rescheduling problem
Anikó Kopacz
Ágnes Mester
Sándor Kolumbán
Lehel Csató
15
0
0
06 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen
3DV
34
30
0
06 Apr 2022
Federated Reinforcement Learning with Environment Heterogeneity
Federated Reinforcement Learning with Environment Heterogeneity
Hao Jin
Yang Peng
Wenhao Yang
Shusen Wang
Zhihua Zhang
65
68
0
06 Apr 2022
A Comprehensive Survey on Automated Machine Learning for Recommendations
A Comprehensive Survey on Automated Machine Learning for Recommendations
Bo Chen
Xiangyu Zhao
Yejing Wang
Wenqi Fan
Huifeng Guo
Ruiming Tang
AI4TS
31
6
0
04 Apr 2022
Autonomous Highway Merging in Mixed Traffic Using Reinforcement Learning
  and Motion Predictive Safety Controller
Autonomous Highway Merging in Mixed Traffic Using Reinforcement Learning and Motion Predictive Safety Controller
Qianqian Liu
Fengying Dang
Xiaofan Wang
Xiaoqiang Ren
30
13
0
03 Apr 2022
Hysteresis-Based RL: Robustifying Reinforcement Learning-based Control
  Policies via Hybrid Control
Hysteresis-Based RL: Robustifying Reinforcement Learning-based Control Policies via Hybrid Control
Jan de Priester
R. Sanfelice
N. van de Wouw
27
2
0
01 Apr 2022
MOF: A Modular Framework for Rapid Application of Optimization
  Methodologies to General Engineering Design Problems
MOF: A Modular Framework for Rapid Application of Optimization Methodologies to General Engineering Design Problems
B. Andersen
G. Delipei
D. Kropaczek
J. Hou
6
5
0
01 Apr 2022
Unsupervised Learning of Temporal Abstractions with Slot-based
  Transformers
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
26
16
0
25 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a
  soft actor-critic approach
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
24
40
0
19 Mar 2022
Learning for Robot Decision Making under Distribution Shift: A Survey
Learning for Robot Decision Making under Distribution Shift: A Survey
Abhishek Paudel
OOD
OffRL
46
5
0
14 Mar 2022
The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its
  Warehouse Applications
The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications
Tim Tsz-Kit Lau
B. Sengupta
25
4
0
14 Mar 2022
Faithfulness in Natural Language Generation: A Systematic Survey of
  Analysis, Evaluation and Optimization Methods
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods
Wei Li
Wenhao Wu
Moye Chen
Jiachen Liu
Xinyan Xiao
Hua Wu
HILM
31
27
0
10 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
41
226
0
09 Mar 2022
Leveraging Randomized Smoothing for Optimal Control of Nonsmooth
  Dynamical Systems
Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Quentin Le Lidec
Fabian Schramm
Louis Montaut
Cordelia Schmid
Ivan Laptev
Justin Carpentier
38
24
0
08 Mar 2022
Online Learning of Reusable Abstract Models for Object Goal Navigation
Online Learning of Reusable Abstract Models for Object Goal Navigation
Tommaso Campari
Leonardo Lamanna
P. Traverso
Luciano Serafini
Lamberto Ballan
EgoV
15
19
0
04 Mar 2022
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning
Tianhao Wu
Fangwei Zhong
Yiran Geng
Hongchen Wang
Yongjian Zhu
Yizhou Wang
Hao Dong
27
8
0
04 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
Can Mean Field Control (MFC) Approximate Cooperative Multi Agent
  Reinforcement Learning (MARL) with Non-Uniform Interaction?
Can Mean Field Control (MFC) Approximate Cooperative Multi Agent Reinforcement Learning (MARL) with Non-Uniform Interaction?
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
34
9
0
28 Feb 2022
Avalanche RL: a Continual Reinforcement Learning Library
Avalanche RL: a Continual Reinforcement Learning Library
Nicolo Lucchesi
Antonio Carta
Vincenzo Lomonaco
Davide Bacciu
42
6
0
28 Feb 2022
Learning to Schedule Heuristics for the Simultaneous Stochastic
  Optimization of Mining Complexes
Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes
Yassine Yaakoubi
R. Dimitrakopoulos
35
10
0
25 Feb 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models
Spyridon Mouselinos
Henryk Michalewski
Mateusz Malinowski
26
3
0
24 Feb 2022
Think Global, Act Local: Dual-scale Graph Transformer for
  Vision-and-Language Navigation
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
36
139
0
23 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
38
9
0
23 Feb 2022
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object
  Tracking
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking
Zeyu Fang
Jian Zhao
Mingyu Yang
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
28
10
0
22 Feb 2022
Previous
123...101112...303132
Next