Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
LARG, Language-based Automatic Reward and Goal Generation
Julien Perez
Denys Proux
Claude Roux
Michael Niemaz
LM&Ro
71
1
0
19 Jun 2023
Deep Reinforcement Learning for Flipper Control of Tracked Robots
Hainan Pan
Bailiang Chen
Kaihong Huang
Junkai Ren
Xieyuanli Chen
Huimin Lu
13
1
0
17 Jun 2023
Inroads into Autonomous Network Defence using Explained Reinforcement Learning
Myles Foley
Miaowei Wang
M. Zoe
Chris Hicks
V. Mavroudis
AAML
62
15
0
15 Jun 2023
Mediated Multi-Agent Reinforcement Learning
Dmitry Ivanov
Ilya Zisman
Kirill Chernyshev
76
9
0
14 Jun 2023
AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks
Alexander Tornede
Difan Deng
Theresa Eimer
Joseph Giovanelli
Aditya Mohan
...
Sarah Segel
Daphne Theodorakopoulos
Tanja Tornede
Henning Wachsmuth
Marius Lindauer
119
24
0
13 Jun 2023
Multi-Agent Reinforcement Learning Guided by Signal Temporal Logic Specifications
Jiangwei Wang
Shuo Yang
Ziyan An
Songyang Han
Zhili Zhang
Rahul Mangharam
Meiyi Ma
Fei Miao
90
9
0
11 Jun 2023
Zero-Shot Wireless Indoor Navigation through Physics-Informed Reinforcement Learning
Mingsheng Yin
Tao Li
Haozhe Lei
Yaqi Hu
S. Rangan
Quanyan Zhu
70
4
0
11 Jun 2023
Design Principles for Model Generalization and Scalable AI Integration in Radio Access Networks
Pablo Soldati
E. Ghadimi
Burak Demirel
Yu Wang
Raimundas Gaigalas
Mathias Sintorn
41
3
0
09 Jun 2023
Large Language Models Are Semi-Parametric Reinforcement Learning Agents
Danyang Zhang
Lu Chen
Situo Zhang
Hongshen Xu
Zihan Zhao
Kai Yu
LM&Ro
KELM
LLMAG
83
25
0
09 Jun 2023
Decision S4: Efficient Sequence-Based RL via State Spaces Layers
Shmuel Bar-David
Itamar Zimerman
Eliya Nachmani
Lior Wolf
OffRL
109
28
0
08 Jun 2023
RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control
Jonas Eschmann
Dario Albani
Giuseppe Loianno
OffRL
109
5
0
06 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRL
OnRL
114
17
0
05 Jun 2023
Improving Grammar-based Sequence-to-Sequence Modeling with Decomposition and Constraints
Chao Lou
Kewei Tu
89
1
0
05 Jun 2023
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
Banghua Zhu
Hiteshi Sharma
Felipe Vieira Frujeri
Shi Dong
Chenguang Zhu
Michael I. Jordan
Jiantao Jiao
OSLM
83
41
0
04 Jun 2023
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages
Andrew Jesson
Chris Xiaoxuan Lu
Gunshi Gupta
Angelos Filos
Jakob N. Foerster
Y. Gal
OffRL
86
8
0
02 Jun 2023
Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction
Quentin Delfosse
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
96
39
0
02 Jun 2023
Investigating Navigation Strategies in the Morris Water Maze through Deep Reinforcement Learning
A. Liu
Alla Borisyuk
48
7
0
01 Jun 2023
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRL
AI4CE
126
41
0
01 Jun 2023
Latent Exploration for Reinforcement Learning
A. Chiappa
Alessandro Marin Vargas
Ann Zixiang Huang
Alexander Mathis
90
18
0
31 May 2023
Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
Dongyoung Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
78
22
0
31 May 2023
Exploring the Promise and Limits of Real-Time Recurrent Learning
Kazuki Irie
Anand Gopalakrishnan
Jürgen Schmidhuber
75
16
0
30 May 2023
Subequivariant Graph Reinforcement Learning in 3D Environments
Runfa Chen
Jiaqi Han
Gang Hua
Wen-bing Huang
OffRL
76
11
0
30 May 2023
Policy Optimization for Continuous Reinforcement Learning
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
103
18
0
30 May 2023
Doing the right thing for the right reason: Evaluating artificial moral cognition by probing cost insensitivity
Yiran Mao
Madeline G. Reinecke
M. Kunesch
Edgar A. Duénez-Guzmán
Ramona Comanescu
Julia Haas
Joel Z Leibo
66
2
0
29 May 2023
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments
Daniel Coelho
Miguel Oliveira
Vítor M. F. Santos
50
4
0
29 May 2023
DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Yunhao Tang
Tadashi Kozuno
Mark Rowland
Anna Harutyunyan
Rémi Munos
Bernardo Avila-Pires
Michal Valko
37
0
0
29 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
107
18
0
28 May 2023
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Mark Rowland
Yunhao Tang
Clare Lyle
Rémi Munos
Marc G. Bellemare
Will Dabney
96
11
0
28 May 2023
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu
Souradip Chakraborty
Yanchao Sun
Furong Huang
AAML
75
5
0
27 May 2023
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kai Zhang
Abhishek Gupta
OffRL
SSL
88
9
0
26 May 2023
NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
82
19
0
26 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
107
19
0
26 May 2023
Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
OffRL
71
3
0
25 May 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
175
844
0
25 May 2023
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning
V. Moschopoulos
Pantelis Kyriakidis
A. Lazaridis
I. Vlahavas
31
0
0
25 May 2023
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
66
52
0
24 May 2023
Masked Path Modeling for Vision-and-Language Navigation
Zi-Yi Dou
Feng Gao
Nanyun Peng
LM&Ro
83
3
0
23 May 2023
ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry
Chris Beeler
Sriram Ganapathi Subramanian
Kyle Sprague
Nouha Chatti
C. Bellinger
...
Amanuel Dawit
Zihan Yang
Xinkai Li
Mark Crowley
Isaac Tamblyn
OffRL
77
6
0
23 May 2023
Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning
Oswin So
Chuchu Fan
46
24
0
23 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
117
1
0
23 May 2023
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
Sumeet Batra
Bryon Tjanaka
Matthew C. Fontaine
Aleksei Petrenko
Stefanos Nikolaidis
Gaurav Sukhatme
OffRL
100
17
0
23 May 2023
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning
Kibeom Kim
Hyun-Dong Lee
Min Whoo Lee
Moonheon Lee
Minsu Lee
Byoung-Tak Zhang
79
1
0
23 May 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
119
4
0
22 May 2023
Road Planning for Slums via Deep Reinforcement Learning
Y. Zheng
Hongyuan Su
Jingtao Ding
Depeng Jin
Yong Li
80
14
0
22 May 2023
Continually Improving Extractive QA via Human Feedback
Ge Gao
Hung-Ting Chen
Yoav Artzi
Eunsol Choi
87
12
0
21 May 2023
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Wanqiao Xu
Shi Dong
Dilip Arumugam
Benjamin Van Roy
78
8
0
19 May 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
Liuyi Wang
Chengju Liu
Zongtao He
Shu Li
Qingqing Yan
Huiyi Chen
Qi Chen
76
10
0
19 May 2023
Sharing Lifelong Reinforcement Learning Knowledge via Modulating Masks
Saptarshi Nath
Christos Peridis
Eseoghene Ben-Iwhiwhu
Xinran Liu
Shirin Dora
Cong Liu
Soheil Kolouri
Andrea Soltoggio
CLL
76
10
0
18 May 2023
Client Selection for Federated Policy Optimization with Environment Heterogeneity
Zhijie Xie
S. H. Song
63
4
0
18 May 2023
The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond
Jiin Woo
Gauri Joshi
Yuejie Chi
FedML
75
22
0
18 May 2023
Previous
1
2
3
...
14
15
16
...
70
71
72
Next