ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
An Efficient Learning-based Solver Comparable to Metaheuristics for the
  Capacitated Arc Routing Problem
An Efficient Learning-based Solver Comparable to Metaheuristics for the Capacitated Arc Routing Problem
Runze Guo
Feng Xue
Anlong Ming
N. Sebe
132
0
0
11 Mar 2024
Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace
Zero-shot cross-modal transfer of Reinforcement Learning policies through a Global Workspace
Léopold Maytié
Benjamin Devillers
Alexandre Arnold
R. V. Rullen
OffRL
61
1
0
07 Mar 2024
Stop Regressing: Training Value Functions via Classification for
  Scalable Deep RL
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
Jordi Orbay
Q. Vuong
Adrien Ali Taïga
Yevgen Chebotar
...
Sergey Levine
Pablo Samuel Castro
Aleksandra Faust
Aviral Kumar
Rishabh Agarwal
OffRL
105
66
0
06 Mar 2024
A Survey on Applications of Reinforcement Learning in Spatial Resource
  Allocation
A Survey on Applications of Reinforcement Learning in Spatial Resource Allocation
Di Zhang
Moyang Wang
Joseph D Mango
Xiang Li
Xianrui Xu
109
1
0
06 Mar 2024
Koopman-Assisted Reinforcement Learning
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
109
10
0
04 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language
  Models
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
78
2
0
04 Mar 2024
Towards Fair and Efficient Learning-based Congestion Control
Towards Fair and Efficient Learning-based Congestion Control
Xudong Liao
Han Tian
Chaoliang Zeng
Xinchen Wan
Kai Chen
48
7
0
04 Mar 2024
Towards Provable Log Density Policy Gradient
Towards Provable Log Density Policy Gradient
Pulkit Katdare
Anant Joshi
Katherine Driggs-Campbell
67
0
0
03 Mar 2024
Deep Reinforcement Learning for Solving Management Problems: Towards A
  Large Management Mode
Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode
Jinyang Jiang
Xiaotian Liu
Tao Ren
Qinghao Wang
Yi Zheng
Yufu Du
Yijie Peng
Cheng Zhang
OffRLAI4CE
37
0
0
01 Mar 2024
Curiosity-driven Red-teaming for Large Language Models
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
Tsun-Hsuan Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
116
45
0
29 Feb 2024
Aligning Knowledge Graph with Visual Perception for Object-goal
  Navigation
Aligning Knowledge Graph with Visual Perception for Object-goal Navigation
Nuo Xu
Wen Wang
Rong Yang
Mengjie Qin
Zheyuan Lin
Wei Song
Chunlong Zhang
J. Gu
Chao Li
100
9
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
88
4
0
29 Feb 2024
Learning to Program Variational Quantum Circuits with Fast Weights
Learning to Program Variational Quantum Circuits with Fast Weights
Samuel Yen-Chi Chen
124
14
0
27 Feb 2024
Deep Reinforcement Learning (DRL)-based Methods for Serverless Stream
  Processing Engines: A Vision, Architectural Elements, and Future Directions
Deep Reinforcement Learning (DRL)-based Methods for Serverless Stream Processing Engines: A Vision, Architectural Elements, and Future Directions
Maria R. Read
C. Dehury
S. Srirama
Rajkumar Buyya
AI4TSOffRL
45
1
0
27 Feb 2024
Structure-Based Drug Design via 3D Molecular Generative Pre-training and
  Sampling
Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling
Yuwei Yang
Siqi Ouyang
Xueyu Hu
Mingyue Zheng
Hao Zhou
Lei Li
89
2
0
22 Feb 2024
Automated Design and Optimization of Distributed Filtering Circuits via
  Reinforcement Learning
Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning
Peng Gao
Tao Yu
Fei Wang
Ruyue Yuan
39
1
0
22 Feb 2024
Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP
  Guided Reinforcement Learning
Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning
Antoine Chaffin
Ewa Kijak
Vincent Claveau
81
0
0
21 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
88
3
0
21 Feb 2024
Mastering the Game of Guandan with Deep Reinforcement Learning and
  Behavior Regulating
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
Yifan YangGong
Haojun Pan
Lei Wang
74
1
0
21 Feb 2024
Learning to Model Diverse Driving Behaviors in Highly Interactive
  Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning
Learning to Model Diverse Driving Behaviors in Highly Interactive Autonomous Driving Scenarios with Multi-Agent Reinforcement Learning
Weiwei Liu
Wenxuan Hu
Wei Jing
Lanxin Lei
Lingping Gao
Yong Liu
86
2
0
21 Feb 2024
Skill or Luck? Return Decomposition via Advantage Functions
Skill or Luck? Return Decomposition via Advantage Functions
Hsiao-Ru Pan
Bernhard Schölkopf
OffRL
43
5
0
20 Feb 2024
A Critical Evaluation of AI Feedback for Aligning Large Language Models
A Critical Evaluation of AI Feedback for Aligning Large Language Models
Archit Sharma
Sedrick Scott Keh
Eric Mitchell
Chelsea Finn
Kushal Arora
Thomas Kollar
ALMLLMAG
98
27
0
19 Feb 2024
Stochastic Approximation with Delayed Updates: Finite-Time Rates under
  Markovian Sampling
Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling
Arman Adibi
Nicolò Dal Fabbro
Luca Schenato
Sanjeev R. Kulkarni
H. Vincent Poor
George J. Pappas
Hamed Hassani
A. Mitra
158
9
0
19 Feb 2024
Self-evolving Autoencoder Embedded Q-Network
Self-evolving Autoencoder Embedded Q-Network
Ieee J. Senthilnath Senior Member
Zhen Bangjian Zhou
Wei Ng
Deeksha Aggarwal
Rajdeep Dutta
Ji Wei Yoon
Phyu Aung
Keyu Wu
Ieee Li Fellow
Xiaoli Li
92
1
0
18 Feb 2024
OptEx: Expediting First-Order Optimization with Approximately
  Parallelized Iterations
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
Yao Shu
Jiongfeng Fang
Y. He
Fei Richard Yu
66
0
0
18 Feb 2024
Modelling crypto markets by multi-agent reinforcement learning
Modelling crypto markets by multi-agent reinforcement learning
J. Lussange
Stefano Vrizzi
Stefano Palminteri
Boris Gutkin
AIFin
59
0
0
16 Feb 2024
Direct Preference Optimization with an Offset
Direct Preference Optimization with an Offset
Afra Amini
Tim Vieira
Ryan Cotterell
131
67
0
16 Feb 2024
Revisiting Experience Replayable Conditions
Revisiting Experience Replayable Conditions
Taisuke Kobayashi
102
3
0
15 Feb 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
92
2
0
15 Feb 2024
Learning Interpretable Policies in Hindsight-Observable POMDPs through
  Partially Supervised Reinforcement Learning
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning
Michael Lanier
Ying Xu
Nathan Jacobs
Chongjie Zhang
Yevgeniy Vorobeychik
64
2
0
14 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
66
6
0
14 Feb 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale
  Wireless Networks
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
Talha Bozkus
Urbashi Mitra
61
4
0
12 Feb 2024
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement
  Learning
MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning
Ayesha Siddika Nipu
Siming Liu
Anthony Harris
37
4
0
12 Feb 2024
Principled Penalty-based Methods for Bilevel Reinforcement Learning and
  RLHF
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
Han Shen
Zhuoran Yang
Tianyi Chen
OffRL
110
15
0
10 Feb 2024
NavFormer: A Transformer Architecture for Robot Target-Driven Navigation
  in Unknown and Dynamic Environments
NavFormer: A Transformer Architecture for Robot Target-Driven Navigation in Unknown and Dynamic Environments
Haitong Wang
Aaron Hao Tan
G. Nejat
100
14
0
09 Feb 2024
Reinforcement Learning for Blind Stair Climbing with Legged and
  Wheeled-Legged Robots
Reinforcement Learning for Blind Stair Climbing with Legged and Wheeled-Legged Robots
Simon Chamorro
Victor Klemm
M. I. Valls
Christopher Pal
Roland Siegwart
104
6
0
09 Feb 2024
Training Large Language Models for Reasoning through Reverse Curriculum
  Reinforcement Learning
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
Zhiheng Xi
Wenxiang Chen
Boyang Hong
Senjie Jin
Rui Zheng
...
Xinbo Zhang
Peng Sun
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
65
28
0
08 Feb 2024
Private Knowledge Sharing in Distributed Learning: A Survey
Private Knowledge Sharing in Distributed Learning: A Survey
Yasas Supeksala
Dinh C. Nguyen
Ming Ding
Thilina Ranbaduge
Calson Chua
Jun Zhang
Jun Li
H. Vincent Poor
96
0
0
08 Feb 2024
QGFN: Controllable Greediness with Action Values
QGFN: Controllable Greediness with Action Values
Elaine Lau
Stephen Zhewen Lu
Ling Pan
Doina Precup
Emmanuel Bengio
169
14
0
07 Feb 2024
Learning mirror maps in policy mirror descent
Learning mirror maps in policy mirror descent
Carlo Alfano
Sebastian Towers
Silvia Sapora
Chris Xiaoxuan Lu
Patrick Rebeschini
63
0
0
07 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqing Zhang
Ziwei Luo
Jens Sjölund
Thomas B. Schön
Per Mattsson
104
13
0
06 Feb 2024
In-context learning agents are asymmetric belief updaters
In-context learning agents are asymmetric belief updaters
Johannes A. Schubert
Akshay K. Jagadish
Marcel Binz
Eric Schulz
LLMAG
77
10
0
06 Feb 2024
Abstracted Trajectory Visualization for Explainability in Reinforcement
  Learning
Abstracted Trajectory Visualization for Explainability in Reinforcement Learning
Yoshiki Takagi
Roderick S. Tabalba
Nurit Kirshenbaum
Jason Leigh
35
0
0
05 Feb 2024
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised
  Environment Design
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
120
9
0
05 Feb 2024
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement
  Learning
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning
Carlos A. Velázquez-Vargas
Isaac Ray Christian
Jordan A. Taylor
Sreejan Kumar
52
0
0
05 Feb 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement
  Learning
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
Shengyi Huang
Quentin Gallouedec
Florian Felten
Antonin Raffin
Rousslan Fernand Julien Dossa
...
Alexander Nikulin
Xiao Hu
Tianlin Liu
Jongwook Choi
Brent Yi
OffRL
89
10
0
05 Feb 2024
Towards Optimal Adversarial Robust Q-learning with Bellman
  Infinity-error
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
AAML
135
2
0
03 Feb 2024
Learning the Market: Sentiment-Based Ensemble Trading Agents
Learning the Market: Sentiment-Based Ensemble Trading Agents
Andrew Ye
James Xu
Yi Wang
Yifan Yu
Daniel Yan
Ryan Chen
Bosheng Dong
Vipin Chaudhary
Shuai Xu
AIFin
25
1
0
02 Feb 2024
To the Max: Reinventing Reward in Reinforcement Learning
To the Max: Reinventing Reward in Reinforcement Learning
Grigorii Veviurko
Wendelin Bohmer
Mathijs de Weerdt
68
6
0
02 Feb 2024
Previous
123...8910...707172
Next