Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning
Sreejan Kumar
Ishita Dasgupta
Nathaniel D. Daw
Jonathan Cohen
Thomas Griffiths
80
10
0
04 Apr 2022
A Comprehensive Survey on Automated Machine Learning for Recommendations
Bo Chen
Xiangyu Zhao
Yejing Wang
Wenqi Fan
Huifeng Guo
Ruiming Tang
AI4TS
107
7
0
04 Apr 2022
Autonomous Highway Merging in Mixed Traffic Using Reinforcement Learning and Motion Predictive Safety Controller
Qianqian Liu
Fengying Dang
Xiaofan Wang
Xiaoqiang Ren
69
13
0
03 Apr 2022
Enhancing Digital Health Services: A Machine Learning Approach to Personalized Exercise Goal Setting
J. Fang
Vincent Cs Lee
Hao Ji
Haiyan Wang
21
5
0
03 Apr 2022
Hysteresis-Based RL: Robustifying Reinforcement Learning-based Control Policies via Hybrid Control
Jan de Priester
R. Sanfelice
N. van de Wouw
97
2
0
01 Apr 2022
MOF: A Modular Framework for Rapid Application of Optimization Methodologies to General Engineering Design Problems
B. Andersen
G. Delipei
D. Kropaczek
J. Hou
13
5
0
01 Apr 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
Hongru Wang
Wei Liang
Jianbing Shen
Luc Van Gool
Wenguan Wang
97
58
0
30 Mar 2022
Acknowledging the Unknown for Multi-label Learning with Single Positive Labels
Donghao Zhou
Pengfei Chen
Qiong Wang
Guangyong Chen
Pheng-Ann Heng
70
31
0
30 Mar 2022
Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method
W. Ramos
M. Silva
Edson R. Araujo
Victor Moura
Keller Clayderman Martins de Oliveira
Leandro Soriano Marcolino
Erickson R. Nascimento
VGen
76
3
0
29 Mar 2022
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
114
16
0
25 Mar 2022
Dealing with Sparse Rewards Using Graph Neural Networks
Matvey Gerasyov
Ilya Makarov
66
2
0
25 Mar 2022
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
Xintong Wang
Gary Qiurui Ma
Alon Eden
Clara Li
Alexander R. Trott
Stephan Zheng
David C. Parkes
92
11
0
25 Mar 2022
Intelligent Systematic Investment Agent: an ensemble of deep learning and evolutionary strategies
Prasang Gupta
Shaz Hoda
Anand Srinivasa Rao
AIFin
25
0
0
24 Mar 2022
MERLIN -- Malware Evasion with Reinforcement LearnINg
Tony Quertier
Benjamin Marais
Stephane Morucci
Bertrand Fournel
AAML
73
19
0
24 Mar 2022
Object Memory Transformer for Object Goal Navigation
Rui Fukushima
Keita Ota
Asako Kanezaki
Y. Sasaki
Yusuke Yoshiyasu
64
35
0
24 Mar 2022
Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL
Akram Erraqabi
Marlos C. Machado
Mingde Zhao
Sainbayar Sukhbaatar
A. Lazaric
Ludovic Denoyer
Yoshua Bengio
OffRL
82
9
0
21 Mar 2022
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
316
267
0
21 Mar 2022
Self-Imitation Learning from Demonstrations
Georgiy Pshikhachev
Dmitry Ivanov
Vladimir Egorov
A. Shpilman
56
6
0
21 Mar 2022
Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes
Nick Zhang
Abhishek Gupta
Zefeng Chen
Yew-Soon Ong
54
7
0
21 Mar 2022
EdgeMatrix: A Resources Redefined Edge-Cloud System for Prioritized Services
Yuanming Ren
Shihao Shen
Yanli Ju
Xiaofei Wang
Wenyu Wang
Victor C. M. Leung
33
13
0
20 Mar 2022
Reinforcement learning for automatic quadrilateral mesh generation: a soft actor-critic approach
J. Pan
Jingwei Huang
G. Cheng
Yong Zeng
AI4CE
73
41
0
19 Mar 2022
Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Taisuke Kobayashi
62
5
0
18 Mar 2022
Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination
Derrik E. Asher
Anjon Basak
Rolando Fernandez
P. Sharma
Erin G. Zaroukian
...
Thomas Mahre
Gerardo Galindo
Luke Frerichs
J. Rogers
J. Fossaceca
AI4CE
59
5
0
17 Mar 2022
Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes
Shuaijun Wang
R. Gao
Ruihua Han
Shengduo Chen
Chengyang Li
Qi Hao
69
13
0
15 Mar 2022
Learning for Robot Decision Making under Distribution Shift: A Survey
Abhishek Paudel
OOD
OffRL
102
6
0
14 Mar 2022
The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications
Tim Tsz-Kit Lau
B. Sengupta
57
4
0
14 Mar 2022
Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of Flexible-Joint Manipulator with Reference Correction
D. Pavlichenko
Sven Behnke
66
7
0
14 Mar 2022
Calibration of Derivative Pricing Models: a Multi-Agent Reinforcement Learning Perspective
N. Vadori
56
1
0
14 Mar 2022
A Deep Reinforcement Learning Environment for Particle Robot Navigation and Object Manipulation
Jeremy Shen
Erdong Xiao
Yuchen Liu
Chen Feng
71
7
0
12 Mar 2022
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods
Wei Li
Wenhao Wu
Moye Chen
Jiachen Liu
Xinyan Xiao
Hua Wu
HILM
149
29
0
10 Mar 2022
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
101
256
0
09 Mar 2022
Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Quentin Le Lidec
Fabian Schramm
Louis Montaut
Cordelia Schmid
Ivan Laptev
Justin Carpentier
108
24
0
08 Mar 2022
A Survey on Reinforcement Learning Methods in Character Animation
Ariel Kwiatkowski
Eduardo Alvarado
Vicky Kalogeiton
Chenxi Liu
Julien Pettré
M. van de Panne
Marie-Paule Cani
AI4CE
97
46
0
07 Mar 2022
Graph Neural Networks for Image Classification and Reinforcement Learning using Graph representations
Naman Goyal
David F. Steiner
GNN
32
5
0
07 Mar 2022
Learning to Ground Decentralized Multi-Agent Communication with Contrastive Learning
Y. Lo
B. Sengupta
73
4
0
07 Mar 2022
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment
Diogo S. Carvalho
B. Sengupta
63
2
0
06 Mar 2022
Online Learning of Reusable Abstract Models for Object Goal Navigation
Tommaso Campari
Leonardo Lamanna
P. Traverso
Luciano Serafini
Lamberto Ballan
EgoV
78
19
0
04 Mar 2022
Targeted Data Poisoning Attack on News Recommendation System by Content Perturbation
Xudong Zhang
Zan Wang
Jingke Zhao
Lanjun Wang
AAML
34
10
0
04 Mar 2022
GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning
Tianhao Wu
Fangwei Zhong
Yiran Geng
Hongchen Wang
Yongjian Zhu
Yizhou Wang
Hao Dong
77
10
0
04 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
123
11
0
01 Mar 2022
Can Mean Field Control (MFC) Approximate Cooperative Multi Agent Reinforcement Learning (MARL) with Non-Uniform Interaction?
Washim Uddin Mondal
Vaneet Aggarwal
S. Ukkusuri
77
9
0
28 Feb 2022
Avalanche RL: a Continual Reinforcement Learning Library
Nicolo Lucchesi
Antonio Carta
Vincenzo Lomonaco
Davide Bacciu
82
6
0
28 Feb 2022
GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control
Abdolreza Taheri
Joni Pajarinen
R. Ghabcheloo
GP
49
3
0
28 Feb 2022
Domain Knowledge-Based Automated Analog Circuit Design with Deep Reinforcement Learning
Weidong Cao
M. Benosman
Xuan Zhang
Rui Ma
52
15
0
26 Feb 2022
Learning to Schedule Heuristics for the Simultaneous Stochastic Optimization of Mining Complexes
Yassine Yaakoubi
R. Dimitrakopoulos
73
10
0
25 Feb 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
76
9
0
24 Feb 2022
Measuring CLEVRness: Blackbox testing of Visual Reasoning Models
Spyridon Mouselinos
Henryk Michalewski
Mateusz Malinowski
69
3
0
24 Feb 2022
Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration
Yuanfan Xu
Jincheng Yu
Jiahao Tang
Jiantao Qiu
Jian Wang
Yuan Shen
Yu Wang
Huazhong Yang
83
27
0
24 Feb 2022
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
92
150
0
23 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
125
18
0
23 Feb 2022
Previous
1
2
3
...
24
25
26
...
70
71
72
Next