Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning
Jihyeon Seong
Sekwang Oh
Jaesik Choi
AI4TS
119
0
0
06 Jun 2024
Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning
Mohamed Elsayed
Homayoon Farrahi
Felix Dangel
A. Rupam Mahmood
73
4
0
05 Jun 2024
Speeding up Policy Simulation in Supply Chain RL
Vivek Farias
Joren Gijsbrechts
Aryan I. Khojandi
Tianyi Peng
A. Zheng
101
0
0
04 Jun 2024
CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework
Yiyang Zhao
Yunzhuo Liu
Bo Jiang
Tian Guo
101
3
0
03 Jun 2024
Learning-based legged locomotion; state of the art and future perspectives
Sehoon Ha
Joonho Lee
M. van de Panne
Zhaoming Xie
Wenhao Yu
Majid Khadiv
147
20
0
03 Jun 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
90
10
0
03 Jun 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
120
0
0
03 Jun 2024
SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems
Nathan Corecco
Giorgio Piatti
Luca A. Lanzendörfer
Flint Xiaofeng Fan
Roger Wattenhofer
OffRL
66
3
0
01 Jun 2024
SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents
Ethan Rathbun
Christopher Amato
Alina Oprea
OffRL
AAML
76
6
0
30 May 2024
A Deep Reinforcement Learning Approach for Trading Optimization in the Forex Market with Multi-Agent Asynchronous Distribution
Davoud Sarani
Parviz Rashidi Khazaee
31
0
0
30 May 2024
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
133
2
0
30 May 2024
Learning Latent Graph Structures and their Uncertainty
A. Manenti
Daniele Zambon
Cesare Alippi
BDL
166
1
0
30 May 2024
Offline Regularised Reinforcement Learning for Large Language Models Alignment
Pierre Harvey Richemond
Yunhao Tang
Daniel Guo
Daniele Calandriello
M. G. Azar
...
Gil Shamir
Rishabh Joshi
Tianqi Liu
Rémi Munos
Bilal Piot
OffRL
121
29
0
29 May 2024
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
123
9
0
29 May 2024
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
Ju-Seung Byun
Andrew Perrault
57
1
0
27 May 2024
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Shutong Ding
Ke Hu
Zhenhao Zhang
Kan Ren
Weinan Zhang
Jingyi Yu
Jingya Wang
Ye-ling Shi
115
21
0
25 May 2024
Pessimistic Backward Policy for GFlowNets
Hyosoon Jang
Yunhui Jang
Minsu Kim
Jinkyoo Park
SungSoo Ahn
118
7
0
25 May 2024
Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics
David Boetius
Stefan Leue
84
0
0
24 May 2024
Transmission Interface Power Flow Adjustment: A Deep Reinforcement Learning Approach based on Multi-task Attribution Map
Shunyu Liu
Wei Luo
Yanzhen Zhou
Kaixuan Chen
Quan Zhang
Huating Xu
Qinglai Guo
Mingli Song
58
15
0
24 May 2024
Leveraging Unknown Objects to Construct Labeled-Unlabeled Meta-Relationships for Zero-Shot Object Navigation
Yanwei Zheng
Changrui Li
Chuanlin Lan
Yaling Li
Xiao Zhang
Yifei Zou
Dongxiao Yu
Zhipeng Cai
74
0
0
24 May 2024
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
93
2
0
24 May 2024
Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks
Jingchi Jiang
Rujia Shen
Boran Wang
Yi Guan
OffRL
BDL
73
1
0
23 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
109
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
335
54
0
23 May 2024
GASE: Graph Attention Sampling with Edges Fusion for Solving Vehicle Routing Problems
Zhenwei Wang
Ruibin Bai
Fazlullah Khan
Ender Özcan
Tiehua Zhang
GNN
101
2
0
21 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
112
69
0
20 May 2024
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation
Suorong Yang
Peijia Li
Xin Xiong
Shen Furao
Jian Zhao
66
2
0
19 May 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
64
1
0
19 May 2024
Reinforcement learning
Florentin Wörgötter
100
2,526
0
16 May 2024
Stochastic Q-learning for Large Discrete Action Spaces
Fares Fourati
Vaneet Aggarwal
Mohamed-Slim Alouini
OffRL
78
4
0
16 May 2024
When Large Language Model Meets Optimization
Sen Huang
Kaixiang Yang
Sheng Qi
Rui Wang
102
14
0
16 May 2024
Deep Learning in Earthquake Engineering: A Comprehensive Review
Yazhou Xie
AI4CE
110
5
0
15 May 2024
CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning
Jingwen Wang
Dehui Du
Yida Li
Yiyang Li
Yikang Chen
AI4TS
CML
37
0
0
14 May 2024
Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning
M. Khan
Syed Hammad Ahmed
G. Sukthankar
68
0
0
14 May 2024
MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction
Haopeng Wang
Zijian Long
Haiwei Dong
Abdulmotaleb El Saddik
136
5
0
13 May 2024
Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models
Andrii Tytarenko
OffRL
97
1
0
13 May 2024
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds
Christopher Cui
Xiangyu Peng
Mark O. Riedl
LLMAG
OffRL
MoE
87
1
0
09 May 2024
Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning
Haobin Zhang
Zhuang Yang
68
0
0
08 May 2024
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
Albert Bou
Morgan Thomas
Sebastian Dittert
Carles Navarro Ramírez
Maciej Majewski
...
Mazen Ahmad
Vincent Moens
Woody Sherman
Simone Sciabola
Gianni De Fabritiis
96
9
0
07 May 2024
TorchDriveEnv: A Reinforcement Learning Benchmark for Autonomous Driving with Reactive, Realistic, and Diverse Non-Playable Characters
J. Lavington
Ke Zhang
Vasileios Lioutas
Matthew Niedoba
Yunpeng Liu
...
Xiaoxuan Liang
Setareh Dabiri
Adam Scibior
Berend Zwartsenberg
Frank Wood
87
5
0
07 May 2024
Pragmatist Intelligence: Where the Principle of Usefulness Can Take ANNs
Antonio Bikić
Sayan Mukherjee
82
0
0
07 May 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
99
11
0
07 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
107
4
0
06 May 2024
Artificial Intelligence in the Autonomous Navigation of Endovascular Interventions: A Systematic Review
Harry Robertshaw
Lennart Karstensen
Benjamin Jackson
Hadi Sadati
K. Rhode
Sebastien Ourselin
Alejandro Granados
Thomas C Booth
59
14
0
06 May 2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
82
0
0
04 May 2024
Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
Anton Plaksin
Vitaly Kalev
65
1
0
03 May 2024
Adversarial Attacks on Reinforcement Learning Agents for Command and Control
Ahaan Dabholkar
James Z. Hare
Mark R. Mittrick
John Richardson
Nick Waytowich
Priya Narayanan
Saurabh Bagchi
AAML
66
1
0
02 May 2024
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Skander Moalla
Andrea Miele
Razvan Pascanu
Çağlar Gülçehre
95
6
0
01 May 2024
HUGO -- Highlighting Unseen Grid Options: Combining Deep Reinforcement Learning with a Heuristic Target Topology Approach
Malte Lehna
Clara Holzhuter
Sven Tomforde
Christoph Scholz
70
6
0
01 May 2024
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
Anna-Lena Schlamp
Werner Huber
Stefanie Schmidtner
34
0
0
01 May 2024
Previous
1
2
3
...
6
7
8
...
70
71
72
Next