Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Machine Learning for the Multi-Dimensional Bin Packing Problem: Literature Review and Empirical Evaluation
Wenjie Wu
Changjun Fan
Jin-Yu Huang
Zhong Liu
Junchi Yan
66
0
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
188
5
0
13 Dec 2023
Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models
Theodore Wolf
Nantas Nardelli
John Shawe-Taylor
Maria Perez-Ortiz
107
1
0
11 Dec 2023
FOSS: A Self-Learned Doctor for Query Optimizer
Kai Zhong
Luming Sun
Tao Ji
Cuiping Li
Hong Chen
53
0
0
11 Dec 2023
Mobile Edge Computing and AI Enabled Web3 Metaverse over 6G Wireless Communications: A Deep Reinforcement Learning Approach
Wen-li Yu
Terence Jie Chua
Jun Zhao
31
0
0
11 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
80
2
0
11 Dec 2023
Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning
Mélodie Hani Daniel Zakaria
Miguel Aranda
Laurent Lequievre
S. Lengagne
J. Corrales
Y. Mezouar
AI4CE
60
6
0
08 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
85
8
0
08 Dec 2023
Efficient Parallel Reinforcement Learning Framework using the Reactor Model
Jacky Kwok
Marten Lohstroh
Edward A. Lee
64
0
0
07 Dec 2023
Multi Actor-Critic DDPG for Robot Action Space Decomposition: A Framework to Control Large 3D Deformation of Soft Linear Objects
Mélodie Daniel
Aly Magassouba
Miguel Aranda
Laurent Lequievre
Juan Antonio Corrales Ramón
Roberto Iglesias Rodriguez
Y. Mezouar
AI4CE
85
5
0
07 Dec 2023
Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Kiana Ehsani
Tanmay Gupta
Rose Hendrix
Jordi Salvador
Luca Weihs
...
Alvaro Herrasti
Ranjay Krishna
Dustin Schwenk
Eli VanderBilt
Aniruddha Kembhavi
86
23
0
05 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
94
6
0
05 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
92
2
0
05 Dec 2023
I-PHYRE: Interactive Physical Reasoning
Shiqian Li
Ke Wu
Fangqiu Yi
Yixin Zhu
LRM
91
7
0
04 Dec 2023
Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach
Xingqiu He
Chaoqun You
Tony Q.S. Quek
47
10
0
01 Dec 2023
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
Xiangyuan Zhang
Weichao Mao
S. Mowlavi
M. Benosman
Tamer Basar
OffRL
AI4CE
84
3
0
30 Nov 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
85
4
0
30 Nov 2023
LiveTune: Dynamic Parameter Tuning for Feedback-Driven Optimization
Soheil Zibakhsh Shabgahi
Nojan Sheybani
Aiden Tabrizi
F. Koushanfar
55
0
0
28 Nov 2023
Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning
Changyu Chen
Ramesha Karunasena
Thanh Hong Nguyen
Arunesh Sinha
Pradeep Varakantham
98
9
0
26 Nov 2023
Evidential Active Recognition: Intelligent and Prudent Open-World Embodied Perception
Lei Fan
Mingfu Liang
Yunxuan Li
Gang Hua
Ying Wu
89
6
0
23 Nov 2023
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
Yangyang Yu
Haohang Li
Zhi Chen
Yuechen Jiang
Yang Li
Denghui Zhang
Rong Liu
Jordan W. Suchow
K. Khashanah
103
72
0
23 Nov 2023
Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series
Woosung Koh
Insu Choi
Yuntae Jang
Gimin Kang
Woo Chang Kim
62
1
0
22 Nov 2023
Probabilistic Inference in Reinforcement Learning Done Right
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDL
OffRL
88
4
0
22 Nov 2023
Nav-Q: Quantum Deep Reinforcement Learning for Collision-Free Navigation of Self-Driving Cars
Akash Sinha
A. Macaluso
Matthias Klusch
82
6
0
20 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
49
3
0
17 Nov 2023
The Next 700 ML-Enabled Compiler Optimizations
S. VenkataKeerthy
Siddharth Jain
Umesh Kalvakuntla
Pranav Sai Gorantla
R. Chitale
E. Brevdo
Albert Cohen
Mircea Trofin
Ramakrishna Upadrasta
51
3
0
17 Nov 2023
JaxMARL: Multi-Agent RL Environments in JAX
Alex Rutherford
Benjamin Ellis
Matteo Gallici
Jonathan Cook
Andrei Lupu
...
Bruno Lacerda
Nick Hawes
Tim Rocktaschel
Chris Xiaoxuan Lu
Jakob N. Foerster
122
20
0
16 Nov 2023
Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge
Sang-Hyun Lee
Seung-Woo Seo
ODL
CLL
SSL
77
3
0
15 Nov 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
62
2
0
14 Nov 2023
Untargeted Black-box Attacks for Social Recommendations
Wenqi Fan
Shijie Wang
Xiao Wei
Xiaowei Mei
Qing Li
MLAU
AAML
70
3
0
13 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
77
0
0
12 Nov 2023
An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning
Xubo Yang
Jian Gao
Ting Wang
Yaozhen He
57
0
0
11 Nov 2023
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Jared Markowitz
Edward W. Staley
OffRL
75
2
0
10 Nov 2023
Autonomous Advanced Aerial Mobility -- An End-to-end Autonomy Framework for UAVs and Beyond
Sakshi Mishra
Praveen Palanisamy
94
16
0
08 Nov 2023
Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments
Ryo Ueda
Tadahiro Taniguchi
59
7
0
08 Nov 2023
A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems
Cheng Yin
Yi Chen
39
0
0
07 Nov 2023
Hypothesis Network Planned Exploration for Rapid Meta-Reinforcement Learning Adaptation
Maxwell J. Jacobson
Yexiang Xue
88
0
0
07 Nov 2023
Environmental-Impact Based Multi-Agent Reinforcement Learning
Farinaz Alamiyan Harandi
Pouria Ramazi
125
1
0
06 Nov 2023
Kindness in Multi-Agent Reinforcement Learning
Farinaz Alamiyan Harandi
Mersad Hassanjani
Pouria Ramazi
41
0
0
06 Nov 2023
Causal Question Answering with Reinforcement Learning
Lukas Blübaum
Stefan Heindorf
CML
80
4
0
05 Nov 2023
Towards model-free RL algorithms that scale well with unstructured data
Joseph Modayil
Zaheer Abbas
OffRL
60
3
0
03 Nov 2023
High Probability Convergence of Adam Under Unbounded Gradients and Affine Variance Noise
Yusu Hong
Junhong Lin
67
9
0
03 Nov 2023
Optimistic Multi-Agent Policy Gradient
Wenshuai Zhao
Yi Zhao
Zhiyuan Li
Arno Solin
Joni Pajarinen
81
2
0
03 Nov 2023
Epidemic Decision-making System Based Federated Reinforcement Learning
Yangxi Zhou
Junping Du
Zhe Xue
Zhenhui Pan
Weikang Chen
50
0
0
03 Nov 2023
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression
Jiaming Guo
Rui Zhang
Shaohui Peng
Qi Yi
Xingui Hu
...
Zidong Du
Xishan Zhang
Ling Li
Qi Guo
Yunji Chen
OffRL
74
7
0
02 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
64
5
0
31 Oct 2023
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods
Zhengpeng Xie
Changdong Yu
Weizheng Qiao
98
1
0
31 Oct 2023
Handover Protocol Learning for LEO Satellite Networks: Access Delay and Collision Minimization
Ju-Hyung Lee
C. Park
Soohyun Park
A. Molisch
136
11
0
31 Oct 2023
Network Contention-Aware Cluster Scheduling with Reinforcement Learning
Junyeol Ryu
Jeongyoon Eo
GNN
37
0
0
31 Oct 2023
Asymmetric Diffusion Based Channel-Adaptive Secure Wireless Semantic Communications
Xintian Ren
Jun Wu
Hansong Xu
Qianqian Pan
DiffM
64
2
0
30 Oct 2023
Previous
1
2
3
...
10
11
12
...
70
71
72
Next