Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,517 papers shown
Title
Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots
Yu Wang
Wenchuan Jia
Yi Sun
Dong He
34
0
0
25 Sep 2024
Symbolic State Partitioning for Reinforcement Learning
Mohsen Ghaffari
Mahsa Varshosaz
E. Johnsen
Andrzej Wasowski
20
1
0
25 Sep 2024
Neural Coordination and Capacity Control for Inventory Management
Carson Eisenach
Udaya Ghai
Dhruv Madeka
Kari Torkkola
Dean Phillips Foster
Sham Kakade
29
0
0
24 Sep 2024
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
Chen Tessler
Yunrong Guo
Ofir Nabati
Gal Chechik
Xue Bin Peng
VGen
AI4CE
45
30
0
22 Sep 2024
Causal Reinforcement Learning for Optimisation of Robot Dynamics in Unknown Environments
Julian Gerald Dcruz
Sam Mahoney
Jia Yun Chua
Adoundeth Soukhabandith
John Mugabe
Weisi Guo
Miguel Arana-Catania
29
0
0
20 Sep 2024
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
Alihan Hüyük
A. R. Koblitz
Atefeh Mohajeri
M. Andrews
OffRL
40
0
0
19 Sep 2024
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
43
2
0
13 Sep 2024
Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies
Alexei Pisacane
Victor-Alexandru Darvariu
Mirco Musolesi
23
0
0
12 Sep 2024
GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Heng Xiong
Changrong Guo
Jian Peng
Kai Ding
Wenjie Chen
Xuchong Qiu
Long Bai
Jianfeng Xu
OffRL
34
4
0
09 Sep 2024
QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE
Junjie Zhao
Chengxi Zhang
Min Qin
Peng Yang
OOD
31
4
0
08 Sep 2024
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization
Gao Tianci
Dmitriev D. Dmitry
Konstantin A. Neusypin
Yang Bo
Rao Shengren
OffRL
33
1
0
02 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
37
0
0
02 Sep 2024
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU
Peng Zhu
Yuante Li
Yifan Hu
Qinyuan Liu
Dawei Cheng
Yuqi Liang
AIFin
AI4TS
46
4
0
26 Aug 2024
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
51
9
0
21 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
95
3
0
20 Aug 2024
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
Jianye Xu
Pan Hu
Bassam Alrifaee
44
5
0
14 Aug 2024
Parallel Distributional Deep Reinforcement Learning for Mapless Navigation of Terrestrial Mobile Robots
V. A. Kich
A. H. Kolling
J. C. Jesus
Gabriel V. Heisler
Hiago Jacobs
...
André da Silva Kelbouscas
Akihisa Ohya
Ricardo B. Grando
Paulo Lilles Jorge Drews-Jr
D. T. Gamarra
38
3
0
11 Aug 2024
Emergence in Multi-Agent Systems: A Safety Perspective
Philipp Altmann
Julian Schonberger
Steffen Illium
Maximilian Zorn
Fabian Ritz
Tom Haider
Simon Burton
Thomas Gabor
40
1
0
08 Aug 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
60
0
0
08 Aug 2024
Adversarial Safety-Critical Scenario Generation using Naturalistic Human Driving Priors
Kunkun Hao
Yonggang Luo
Wen Cui
Yuqiao Bai
Jucheng Yang
Songyang Yan
Yuxi Pan
Zijiang Yang
AAML
36
19
0
06 Aug 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
47
0
0
05 Aug 2024
SAPG: Split and Aggregate Policy Gradients
Jayesh Singla
Ananye Agarwal
Deepak Pathak
OffRL
OnRL
42
3
0
29 Jul 2024
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
30
0
0
23 Jul 2024
Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN
Norman Becker
Daniel Reti
Evridiki V. Ntagiou
M. Wallum
Hans D. Schotten
37
1
0
22 Jul 2024
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C
Neil De La Fuente
Daniel A. Vidal Guerra
OffRL
24
5
0
19 Jul 2024
Optimistic Q-learning for average reward and episodic reinforcement learning
Priyank Agrawal
Shipra Agrawal
56
4
0
18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
21
0
0
18 Jul 2024
Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas
L. Kuhlmann
Gideon Kowadlo
47
0
0
16 Jul 2024
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau
A. Baratin
Kisoo Kwon
Boris Knyazev
Yan Zhang
45
1
0
12 Jul 2024
RoboMorph: Evolving Robot Morphology using Large Language Models
Kevin Qiu
Krzysztof Ciebiera
Krzysztof Ciebiera
Marek Cygan
Marek Cygan
Łukasz Kuciński
LM&Ro
54
0
0
11 Jul 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Xuyang Chen
Lin Zhao
45
0
0
09 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
42
1
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
16
0
05 Jul 2024
A Role of Environmental Complexity on Representation Learning in Deep Reinforcement Learning Agents
Andrew Liu
Alla Borisyuk
37
1
0
03 Jul 2024
Reinforcement Learning-driven Data-intensive Workflow Scheduling for Volunteer Edge-Cloud
Motahare Mounesan
Mauro Lemus
H. Yeddulapalli
Prasad Calyam
S. Debroy
OffRL
19
6
0
01 Jul 2024
Towards shutdownable agents via stochastic choice
Elliott Thornley
Alexander Roman
Christos Ziakas
Leyton Ho
Louis Thomson
46
0
0
30 Jun 2024
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
Charlotte Beylier
Simon M. Hofmann
Nico Scherf
26
0
0
20 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
43
1
0
17 Jun 2024
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
Yizhe Huang
Guy Van den Broeck
Fanqi Kong
Yaodong Yang
Song-Chun Zhu
Xue Feng
39
3
0
12 Jun 2024
CHARME: A chain-based reinforcement learning approach for the minor embedding problem
Hoang M. Ngo
Nguyen H K. Do
Minh Nhat Vu
Tamer Kahveci
My T. Thai
AI4CE
30
2
0
11 Jun 2024
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
Wichayaporn Wongkamjan
Feng Gu
Yanze Wang
Ulf Hermjakob
Jonathan May
Brandon M. Stewart
Jonathan K. Kummerfeld
Denis Peskoff
Jordan L. Boyd-Graber
53
3
0
07 Jun 2024
Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning
Jihyeon Seong
Sekwang Oh
Jaesik Choi
AI4TS
47
0
0
06 Jun 2024
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes
Johannes Muller
Semih Cayci
50
0
0
06 Jun 2024
Speeding up Policy Simulation in Supply Chain RL
Vivek Farias
Joren Gijsbrechts
Aryan I. Khojandi
Tianyi Peng
A. Zheng
49
0
0
04 Jun 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
47
9
0
03 Jun 2024
Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents
John L. Zhou
Weizhe Hong
Jonathan C. Kao
35
0
0
03 Jun 2024
SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents
Ethan Rathbun
Christopher Amato
Alina Oprea
OffRL
AAML
46
4
0
30 May 2024
Learning Latent Graph Structures and their Uncertainty
A. Manenti
Daniele Zambon
Cesare Alippi
BDL
38
1
0
30 May 2024
Previous
1
2
3
4
5
...
29
30
31
Next