Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Segmenting Action-Value Functions Over Time-Scales in SARSA via TD(
Δ
\Delta
Δ
)
Mahammad Humayoo
81
0
0
22 Nov 2024
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
98
1
0
21 Nov 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
106
1
0
20 Nov 2024
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
Kevin Godin-Dubois
Karine Miras
Anna V. Kononova
94
0
0
20 Nov 2024
Bitcoin Under Volatile Block Rewards: How Mempool Statistics Can Influence Bitcoin Mining
Roozbeh Sarenche
Alireza Aghabagherloo
S. Nikova
Bart Preneel
122
0
0
18 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
184
3
0
18 Nov 2024
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan
Anan Schütt
Jill-Jênn Vie
Arun-Balajiee Lekshmi-Narayanan
Elisabeth André
Nicolas Vayatis
AI4Ed
120
0
0
18 Nov 2024
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
Philips George John
Arnab Bhattacharyya
Silviu Maniu
Dimitrios Myrisiotis
Zhenan Wu
OffRL
83
0
0
16 Nov 2024
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Sheryl Paul
Anand Balakrishnan
Xin Qin
Jyotirmoy V. Deshmukh
57
0
0
15 Nov 2024
Innate-Values-driven Reinforcement Learning based Cognitive Modeling
Qin Yang
84
0
0
14 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
98
6
0
08 Nov 2024
Retentive Neural Quantum States: Efficient Ansätze for Ab Initio Quantum Chemistry
Oliver Knitter
Dan Zhao
J. Stokes
M. Ganahl
Stefan Leichenauer
S. Veerapaneni
59
2
0
06 Nov 2024
Hierarchical Orchestra of Policies
Thomas P Cannon
Özgür Simsek
CLL
55
0
0
05 Nov 2024
When to Localize? A Risk-Constrained Reinforcement Learning Approach
Chak Lam Shek
Kasra Torshizi
Troi Williams
Pratap Tokekar
123
2
0
05 Nov 2024
Accelerating Task Generalisation with Multi-Level Skill Hierarchies
Thomas P Cannon
Özgür Simsek
AI4CE
77
0
0
05 Nov 2024
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Bowen Li
Zhaoyu Li
Qiwei Du
Jinqi Luo
Wenshan Wang
...
Katia Sycara
Pradeep Kumar Ravikumar
Alexander G. Gray
X. Si
Sebastian A. Scherer
AI4CE
LRM
161
5
0
01 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
36
0
0
31 Oct 2024
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
68
0
0
31 Oct 2024
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection
Yujin Wang
Tianyi Xu
Fan Zhang
Tianfan Xue
Liang Feng
VLM
73
6
0
30 Oct 2024
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng
Zehao Li
Peng Jiang
Yijie Peng
57
0
0
28 Oct 2024
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents
Jannis Weil
Jonas Ringsdorf
Julian Barthel
Yi-Ping Phoebe Chen
Tobias Meuser
OffRL
44
0
0
28 Oct 2024
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations
Eduardo C. Garrido-Merchán
Maria Coronado Vaca
Álvaro López-López
Carlos Martínez de Ibarreta
74
0
0
27 Oct 2024
Multi-agent cooperation through learning-aware policy gradients
Alexander Meulemans
Seijin Kobayashi
J. Oswald
Nino Scherrer
Eric Elmoznino
Blake A. Richards
Guillaume Lajoie
Blaise Agüera y Arcas
João Sacramento
79
1
0
24 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
183
11
0
23 Oct 2024
Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts
Sheryl Paul
Jyotirmoy V. Deshmukh
62
0
0
22 Oct 2024
LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded"
Som Sagar
Aditya Taparia
Ransalu Senanayake
39
0
0
22 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Wei Wei
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
36
0
0
22 Oct 2024
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Hanlin Yang
Jian Yao
Weiming Liu
Qing Wang
Hanmin Qin
...
Hongwu Chen
Juchao Zhuo
Qiang Fu
Yang Wei
Haobo Fu
63
1
0
21 Oct 2024
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution
Zijie Zhao
Roy E. Welsch
AIFin
112
1
0
19 Oct 2024
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments
Mariusz Wisniewski
Paraskevas Chatzithanos
Weisi Guo
Antonios Tsourdos
64
3
0
18 Oct 2024
Streaming Deep Reinforcement Learning Finally Works
Mohamed Elsayed
Gautham Vasan
A. R. Mahmood
OffRL
113
6
0
18 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
115
4
0
18 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yue Yang
79
5
0
18 Oct 2024
AERO: Softmax-Only LLMs for Efficient Private Inference
N. Jha
Brandon Reagen
107
5
0
16 Oct 2024
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at Edge
Motahare Mounesan
Xiaojie Zhang
S. Debroy
56
1
0
16 Oct 2024
TradExpert: Revolutionizing Trading with Mixture of Expert LLMs
Qianggang Ding
Haochen Shi
Jiadong Guo
Bang Liu
AIFin
112
3
0
16 Oct 2024
Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Zhengyan Shi
Sander Land
Acyr Locatelli
Matthieu Geist
Max Bartolo
113
8
0
15 Oct 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
135
5
0
15 Oct 2024
Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning
Bokai Hu
Sai Ashish Somayajula
Xin Pan
Zihan Huang
OffRL
31
1
0
14 Oct 2024
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Mingjun Wang
Remington Dechene
127
0
0
11 Oct 2024
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
94
2
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
240
0
0
10 Oct 2024
Effective Exploration Based on the Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
64
2
0
09 Oct 2024
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
61
1
0
08 Oct 2024
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
74
0
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
104
1
0
07 Oct 2024
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng
Hao Luan
Kevin Yuchen Ma
Harold Soh
83
2
0
03 Oct 2024
Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting
Alessio Russo
Alberto Maria Metelli
Marcello Restelli
58
0
0
02 Oct 2024
Criticality and Safety Margins for Reinforcement Learning
Alexander Grushin
Walt Woods
Alvaro Velasquez
Simon Khan
AAML
102
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
110
3
0
25 Sep 2024
Previous
1
2
3
4
5
...
70
71
72
Next