ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,546 papers shown
Title
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
Yan Yang
Bin Gao
Ya-xiang Yuan
50
2
0
30 May 2024
Offline Regularised Reinforcement Learning for Large Language Models
  Alignment
Offline Regularised Reinforcement Learning for Large Language Models Alignment
Pierre Harvey Richemond
Yunhao Tang
Daniel Guo
Daniele Calandriello
M. G. Azar
...
Gil Shamir
Rishabh Joshi
Tianqi Liu
Rémi Munos
Bilal Piot
OffRL
46
24
0
29 May 2024
Correctable Landmark Discovery via Large Models for Vision-Language
  Navigation
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
39
6
0
29 May 2024
Counterexample-Guided Repair of Reinforcement Learning Systems Using
  Safety Critics
Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics
David Boetius
Stefan Leue
28
0
0
24 May 2024
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Model-free reinforcement learning with noisy actions for automated experimental control in optics
Lea Richtmann
Viktoria-S. Schmiesing
Dennis Wilken
Jan Heine
Aaron Tranter
Avishek Anand
Tobias J. Osborne
M. Heurs
33
2
0
24 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
44
2
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
45
0
23 May 2024
ACEGEN: Reinforcement learning of generative chemical agents for drug
  discovery
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
Albert Bou
Morgan Thomas
Sebastian Dittert
Carles Navarro Ramírez
Maciej Majewski
...
Mazen Ahmad
Vincent Moens
Woody Sherman
Simone Sciabola
Gianni De Fabritiis
55
6
0
07 May 2024
SwiftRL: Towards Efficient Reinforcement Learning on Real
  Processing-In-Memory Systems
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Kailash Gogineni
Sai Santosh Dayapule
Juan Gómez Luna
Karthikeya Gogineni
Peng Wei
Tian-Shing Lan
Mohammad Sadrosadati
Onur Mutlu
Guru Venkataramani
55
11
0
07 May 2024
Adversarial Attacks on Reinforcement Learning Agents for Command and
  Control
Adversarial Attacks on Reinforcement Learning Agents for Command and Control
Ahaan Dabholkar
James Z. Hare
Mark R. Mittrick
John Richardson
Nick Waytowich
Priya Narayanan
Saurabh Bagchi
AAML
39
1
0
02 May 2024
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement
  Learning
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
42
1
0
30 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
39
6
0
22 Apr 2024
A survey of air combat behavior modeling using machine learning
A survey of air combat behavior modeling using machine learning
Patrick Ribu Gorton
Andreas Strand
K. Brathen
AI4CE
37
8
0
22 Apr 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
42
0
0
15 Apr 2024
TDANet: Target-Directed Attention Network For Object-Goal Visual
  Navigation With Zero-Shot Ability
TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability
Shiwei Lian
Feitian Zhang
39
3
0
12 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in
  Reinforcement Learning
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
58
4
0
12 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
53
6
0
09 Apr 2024
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Guangchen Lan
Dong-Jun Han
Abolfazl Hashemi
Vaneet Aggarwal
Christopher G. Brinton
124
15
0
09 Apr 2024
Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks
Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks
Xingran Chen
Navid Naderializadeh
Alejandro Ribeiro
Shirin Saeedi Bidokhti
177
1
0
04 Apr 2024
VLRM: Vision-Language Models act as Reward Models for Image Captioning
VLRM: Vision-Language Models act as Reward Models for Image Captioning
Maksim Dzabraev
Alexander Kunitsyn
Andrei Ivaniuta
VLM
MLLM
31
3
0
02 Apr 2024
Survey of Computerized Adaptive Testing: A Machine Learning Perspective
Survey of Computerized Adaptive Testing: A Machine Learning Perspective
Qi Liu
Zhuang Yan
Haoyang Bi
Zhenya Huang
Weizhe Huang
...
Z. Pardos
Haiping Ma
Mengxiao Zhu
Shijin Wang
Enhong Chen
AI4Ed
49
9
0
31 Mar 2024
One-Shot Averaging for Distributed TD($λ$) Under Markov Sampling
One-Shot Averaging for Distributed TD(λλλ) Under Markov Sampling
Haoxing Tian
I. Paschalidis
Alexander Olshevsky
OffRL
47
4
0
13 Mar 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep
  Reinforcement Learning
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
45
5
0
12 Mar 2024
Koopman-Assisted Reinforcement Learning
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
40
6
0
04 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
A Critical Evaluation of AI Feedback for Aligning Large Language Models
A Critical Evaluation of AI Feedback for Aligning Large Language Models
Archit Sharma
Sedrick Scott Keh
Eric Mitchell
Chelsea Finn
Kushal Arora
Thomas Kollar
ALM
LLMAG
29
23
0
19 Feb 2024
Self-evolving Autoencoder Embedded Q-Network
Self-evolving Autoencoder Embedded Q-Network
Ieee J. Senthilnath Senior Member
Zhen Bangjian Zhou
Wei Ng
Deeksha Aggarwal
Rajdeep Dutta
Ji Wei Yoon
Phyu Aung
Keyu Wu
Ieee Li Fellow
Xiaoli Li
64
1
0
18 Feb 2024
OptEx: Expediting First-Order Optimization with Approximately
  Parallelized Iterations
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
Yao Shu
Jiongfeng Fang
Y. He
Fei Richard Yu
35
0
0
18 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
34
1
0
15 Feb 2024
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement
  Learning
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning
A. S. Nipu
Siming Liu
Anthony Harris
27
4
0
12 Feb 2024
Training Large Language Models for Reasoning through Reverse Curriculum
  Reinforcement Learning
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Zhiheng Xi
Wenxiang Chen
Boyang Hong
Senjie Jin
Rui Zheng
...
Xinbo Zhang
Peng Sun
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
42
22
0
08 Feb 2024
COA-GPT: Generative Pre-trained Transformers for Accelerated Course of
  Action Development in Military Operations
COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations
Vinicius G. Goecks
Nicholas R. Waytowich
SLR
48
7
0
01 Feb 2024
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning:
  Theory, Algorithms and Implementations
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
Matthias Lehmann
46
0
0
24 Jan 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
45
1
0
17 Jan 2024
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
Multi-agent reinforcement learning using echo-state network and its
  application to pedestrian dynamics
Multi-agent reinforcement learning using echo-state network and its application to pedestrian dynamics
Hisato Komatsu
16
1
0
19 Dec 2023
Colored Noise in PPO: Improved Exploration and Performance through
  Correlated Action Sampling
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob J. Hollenstein
Georg Martius
J. Piater
22
3
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
28
10
0
18 Dec 2023
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human
  Preferences
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Minyoung Hwang
Luca Weihs
Chanwoo Park
Kimin Lee
Aniruddha Kembhavi
Kiana Ehsani
37
18
0
14 Dec 2023
Improve Robustness of Reinforcement Learning against Observation
  Perturbations via $l_\infty$ Lipschitz Policy Networks
Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞l_\inftyl∞​ Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
48
4
0
14 Dec 2023
Adaptive parameter sharing for multi-agent reinforcement learning
Adaptive parameter sharing for multi-agent reinforcement learning
Dapeng Li
Na Lou
Bin Zhang
Zhiwei Xu
Guoliang Fan
32
3
0
14 Dec 2023
Machine Learning for the Multi-Dimensional Bin Packing Problem:
  Literature Review and Empirical Evaluation
Machine Learning for the Multi-Dimensional Bin Packing Problem: Literature Review and Empirical Evaluation
Wenjie Wu
Changjun Fan
Jin-Yu Huang
Zhong Liu
Junchi Yan
38
0
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Mobile Edge Computing and AI Enabled Web3 Metaverse over 6G Wireless
  Communications: A Deep Reinforcement Learning Approach
Mobile Edge Computing and AI Enabled Web3 Metaverse over 6G Wireless Communications: A Deep Reinforcement Learning Approach
Wen-li Yu
Terence Jie Chua
Jun Zhao
24
0
0
11 Dec 2023
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Jing Hou
Guang Chen
Ruiqi Zhang
Zhijun Li
Shangding Gu
Changjun Jiang
OffRL
32
2
0
11 Dec 2023
Robotic Control of the Deformation of Soft Linear Objects Using Deep
  Reinforcement Learning
Robotic Control of the Deformation of Soft Linear Objects Using Deep Reinforcement Learning
Mélodie Hani Daniel Zakaria
Miguel Aranda
Laurent Lequievre
S. Lengagne
J. Corrales
Y. Mezouar
AI4CE
20
6
0
08 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive
  Learning and Reinforced Incremental Clustering
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
39
5
0
08 Dec 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
19
2
0
17 Nov 2023
Autonomous Advanced Aerial Mobility -- An End-to-end Autonomy Framework
  for UAVs and Beyond
Autonomous Advanced Aerial Mobility -- An End-to-end Autonomy Framework for UAVs and Beyond
Sakshi Mishra
Praveen Palanisamy
36
16
0
08 Nov 2023
Previous
123456...293031
Next