ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement
  Learning
A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning
Zun Li
Michael P. Wellman
75
1
0
30 Apr 2024
Point Cloud Models Improve Visual Robustness in Robotic Learners
Point Cloud Models Improve Visual Robustness in Robotic Learners
Skand Peri
Iain Lee
Chanho Kim
Fuxin Li
Tucker Hermans
Stefan Lee
3DPC
125
4
0
29 Apr 2024
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement
  Learning Policies
SAFE-RL: Saliency-Aware Counterfactual Explainer for Deep Reinforcement Learning Policies
Amir Samadi
K. Koufos
Kurt Debattista
M. Dianati
85
5
0
28 Apr 2024
Research and application of artificial intelligence based webshell
  detection model: A literature review
Research and application of artificial intelligence based webshell detection model: A literature review
Mingrui Ma
Lansheng Han
Chunjie Zhou
134
3
0
28 Apr 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
Jonathan D. Chang
Wenhao Zhan
Owen Oertell
Gokul Swamy
Kianté Brantley
Thorsten Joachims
J. Andrew Bagnell
Jason D. Lee
Wen Sun
OffRL
85
41
0
25 Apr 2024
AFU: Actor-Free critic Updates in off-policy RL for continuous control
AFU: Actor-Free critic Updates in off-policy RL for continuous control
Nicolas Perrin-Gilbert
OffRL
108
0
0
24 Apr 2024
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for
  Mobile Edge Computing, its Applications, and Future Research Trajectories
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories
Ning Yang
Shuo Chen
Haijun Zhang
Randall Berry
OffRL
106
9
0
22 Apr 2024
A survey of air combat behavior modeling using machine learning
A survey of air combat behavior modeling using machine learning
Patrick Ribu Gorton
Andreas Strand
K. Brathen
AI4CE
71
8
0
22 Apr 2024
Learning to Cut via Hierarchical Sequence/Set Model for Efficient
  Mixed-Integer Programming
Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming
Jie Wang
Zhihai Wang
Xijun Li
Yufei Kuang
Zhihao Shi
Fangzhou Zhu
Mingxuan Yuan
Jianguo Zeng
Yongdong Zhang
Feng Wu
83
8
0
19 Apr 2024
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework
  for Network Resource Allocation
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
Tianfu Wang
Qilin Fan
Chao Wang
Long Yang
Leilei Ding
Nicholas Jing Yuan
Hui Xiong
72
2
0
19 Apr 2024
Breaching the Bottleneck: Evolutionary Transition from Reward-Driven
  Learning to Reward-Agnostic Domain-Adapted Learning in Neuromodulated Neural
  Nets
Breaching the Bottleneck: Evolutionary Transition from Reward-Driven Learning to Reward-Agnostic Domain-Adapted Learning in Neuromodulated Neural Nets
S. Arnold
Reiji Suzuki
Takaya Arita
Kimitoshi Yamazaki
47
0
0
19 Apr 2024
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement
  Learning Agents
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
Chen Gong
Kecen Li
Jin Yao
Tianhao Wang
OnRL
75
1
0
18 Apr 2024
Actor-Critic Reinforcement Learning with Phased Actor
Actor-Critic Reinforcement Learning with Phased Actor
Ruofan Wu
Junmin Zhong
Jennie Si
41
0
0
18 Apr 2024
Function Approximation for Reinforcement Learning Controller for Energy
  from Spread Waves
Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Avisek Naug
Alexandre Frederic Julien Pichard
Mathieu Cocho
71
8
0
17 Apr 2024
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weiling Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
162
165
0
16 Apr 2024
Hierarchical Decision Making Based on Structural Information Principles
Hierarchical Decision Making Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
75
0
0
15 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
91
38
0
12 Apr 2024
TDANet: Target-Directed Attention Network For Object-Goal Visual
  Navigation With Zero-Shot Ability
TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability
Shiwei Lian
Feitian Zhang
100
3
0
12 Apr 2024
Generalized Population-Based Training for Hyperparameter Optimization in
  Reinforcement Learning
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Hui Bai
Ran Cheng
88
6
0
12 Apr 2024
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Yunxiang Li
Rui Yuan
Chen Fan
Mark Schmidt
Samuel Horváth
Robert Mansel Gower
Martin Takávc
72
0
0
11 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
121
8
0
09 Apr 2024
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Guangchen Lan
Dong-Jun Han
Abolfazl Hashemi
Vaneet Aggarwal
Christopher G. Brinton
232
16
0
09 Apr 2024
Compositional Conservatism: A Transductive Approach in Offline
  Reinforcement Learning
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
Yeda Song
Dongwook Lee
Gunhee Kim
OffRL
64
1
0
06 Apr 2024
Direct Nash Optimization: Teaching Language Models to Self-Improve with
  General Preferences
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Corby Rosset
Ching-An Cheng
Arindam Mitra
Michael Santacroce
Ahmed Hassan Awadallah
Tengyang Xie
202
132
0
04 Apr 2024
Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks
Decentralized Learning Strategies for Estimation Error Minimization with Graph Neural Networks
Xingran Chen
Navid Naderializadeh
Alejandro Ribeiro
Shirin Saeedi Bidokhti
405
1
0
04 Apr 2024
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by
  Cross-Modal Contrastive Learning
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
Mengfei Du
Binhao Wu
Jiwen Zhang
Zhihao Fan
Zejun Li
Ruipu Luo
Xuanjing Huang
Zhongyu Wei
69
3
0
02 Apr 2024
VLRM: Vision-Language Models act as Reward Models for Image Captioning
VLRM: Vision-Language Models act as Reward Models for Image Captioning
Maksim Dzabraev
Alexander Kunitsyn
Andrei Ivaniuta
VLMMLLM
73
3
0
02 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and
  Bias
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
100
44
0
01 Apr 2024
Survey of Computerized Adaptive Testing: A Machine Learning Perspective
Survey of Computerized Adaptive Testing: A Machine Learning Perspective
Qi Liu
Zhuang Yan
Haoyang Bi
Zhenya Huang
Weizhe Huang
...
Z. Pardos
Haiping Ma
Mengxiao Zhu
Shijin Wang
Enhong Chen
AI4Ed
99
10
0
31 Mar 2024
Biologically-Plausible Topology Improved Spiking Actor Network for
  Efficient Deep Reinforcement Learning
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Duzhen Zhang
Qingyu Wang
Tielin Zhang
Bo Xu
331
2
0
29 Mar 2024
Offline Imitation Learning from Multiple Baselines with Applications to
  Compiler Optimization
Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization
T. V. Marinov
Alekh Agarwal
Mircea Trofin
OffRL
63
0
0
28 Mar 2024
Disentangling Length from Quality in Direct Preference Optimization
Disentangling Length from Quality in Direct Preference Optimization
Ryan Park
Rafael Rafailov
Stefano Ermon
Chelsea Finn
ALM
98
145
0
28 Mar 2024
Compressed Federated Reinforcement Learning with a Generative Model
Compressed Federated Reinforcement Learning with a Generative Model
Ali Beikmohammadi
Sarit Khirirat
Sindri Magnússon
FedML
125
3
0
26 Mar 2024
Depending on yourself when you should: Mentoring LLM with RL agents to
  become the master in cybersecurity games
Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games
Yikuan Yan
Yaolun Zhang
Keman Huang
120
10
0
26 Mar 2024
Collaborative AI Teaming in Unknown Environments via Active Goal
  Deduction
Collaborative AI Teaming in Unknown Environments via Active Goal Deduction
Zuyuan Zhang
Hanhan Zhou
Mahdi Imani
Taeyoung Lee
Tian-Shing Lan
72
11
0
22 Mar 2024
DouRN: Improving DouZero by Residual Neural Networks
DouRN: Improving DouZero by Residual Neural Networks
Yiquan Chen
Yingchao Lyu
Di Zhang
50
0
0
21 Mar 2024
Bootstrapping Reinforcement Learning with Imitation for Vision-Based
  Agile Flight
Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight
Jiaxu Xing
Angel Romero
L. Bauersfeld
Davide Scaramuzza
98
16
0
18 Mar 2024
Phasic Diversity Optimization for Population-Based Reinforcement
  Learning
Phasic Diversity Optimization for Population-Based Reinforcement Learning
Jingcheng Jiang
Haiyin Piao
Yu Fu
Yihang Hao
Chuanlu Jiang
Ziqi Wei
Xin Yang
60
0
0
17 Mar 2024
Reinforcement Learning with Options and State Representation
Reinforcement Learning with Options and State Representation
Ayoub Ghriss
Masashi Sugiyama
A. Lazaric
35
0
0
16 Mar 2024
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities
  Transfer
Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Mengying Lin
Yaran Chen
Dong Zhao
Zhaoran Wang
116
2
0
15 Mar 2024
Quiet-STaR: Language Models Can Teach Themselves to Think Before
  Speaking
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
E. Zelikman
Georges Harik
Yijia Shao
Varuna Jayasiri
Nick Haber
Noah D. Goodman
LLMAGReLMLRM
137
151
0
14 Mar 2024
A Reinforcement Learning Approach to Dairy Farm Battery Management using
  Q Learning
A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning
Nawazish Ali
Abdul Wahid
Rachael Shaw
Karl Mason
22
10
0
14 Mar 2024
One-Shot Averaging for Distributed TD($λ$) Under Markov Sampling
One-Shot Averaging for Distributed TD(λλλ) Under Markov Sampling
Haoxing Tian
I. Paschalidis
Alexander Olshevsky
OffRL
94
4
0
13 Mar 2024
Human Alignment of Large Language Models through Online Preference
  Optimisation
Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello
Daniel Guo
Rémi Munos
Mark Rowland
Yunhao Tang
...
Michal Valko
Tianqi Liu
Rishabh Joshi
Zeyu Zheng
Bilal Piot
110
67
0
13 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
99
14
0
13 Mar 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep
  Reinforcement Learning
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
65
7
0
12 Mar 2024
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
Huijie Tang
Federico Berto
Jinkyoo Park
108
4
0
12 Mar 2024
A Holistic Framework Towards Vision-based Traffic Signal Control with
  Microscopic Simulation
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Pan He
Quanyi Li
Xiaoyong Yuan
Bolei Zhou
55
0
0
11 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in
  Goal-Oriented Reinforcement Learning
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
68
1
0
11 Mar 2024
Tactical Decision Making for Autonomous Trucks by Deep Reinforcement
  Learning with Total Cost of Operation Based Reward
Tactical Decision Making for Autonomous Trucks by Deep Reinforcement Learning with Total Cost of Operation Based Reward
Deepthi Pathare
Leo Laine
M. Chehreghani
75
1
0
11 Mar 2024
Previous
123...789...707172
Next