ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Bag of Policies for Distributional Deep Exploration
Bag of Policies for Distributional Deep Exploration
Asen Nachkov
Luchen Li
Giulia Luise
Filippo Valdettaro
Aldo A. Faisal
OffRL
84
0
0
03 Aug 2023
EdgeMatrix: A Resource-Redefined Scheduling Framework for SLA-Guaranteed
  Multi-Tier Edge-Cloud Computing Systems
EdgeMatrix: A Resource-Redefined Scheduling Framework for SLA-Guaranteed Multi-Tier Edge-Cloud Computing Systems
Shihao Shen
Yuanming Ren
Yanli Ju
Xiaofei Wang
Wenyu Wang
Victor C. M. Leung
51
16
0
01 Aug 2023
Reinforcement Learning for Generative AI: State of the Art,
  Opportunities and Open Research Challenges
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
139
22
0
31 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed
  Markov Decision Processes
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
44
3
0
29 Jul 2023
Dialogue Shaping: Empowering Agents through NPC Interaction
Dialogue Shaping: Empowering Agents through NPC Interaction
Wei Zhou
Xiangyu Peng
Mark O. Riedl
LLMAG
71
9
0
28 Jul 2023
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Amir Ramezani Dooraki
Alexandros Iosifidis
36
0
0
28 Jul 2023
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal
  Adversarial Masks
FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks
Buse G. A. Tekgul
Nadarajah Asokan
AAML
68
2
0
27 Jul 2023
FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on
  Staged Reinforcement Learning
FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
Leiming Chen
Kai Wang
Cihao Dong
Sibo Qiao
Ziling Huang
Yuming Nie
Zhaoxiang Hou
C. Tan
FedML
70
2
0
25 Jul 2023
Provable Benefits of Policy Learning from Human Preferences in
  Contextual Bandit Problems
Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems
Xiang Ji
Huazheng Wang
Minshuo Chen
Tuo Zhao
Mengdi Wang
OffRL
123
7
0
24 Jul 2023
Policy Gradient Optimal Correlation Search for Variance Reduction in
  Monte Carlo simulation and Maximum Optimal Transport
Policy Gradient Optimal Correlation Search for Variance Reduction in Monte Carlo simulation and Maximum Optimal Transport
Pierre Bras
Gilles Pagès
56
1
0
24 Jul 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
67
1
0
22 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
74
1
0
21 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&RoOffRL
122
5
0
20 Jul 2023
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Kanghoon Lee
Jiachen Li
David Isele
Jinkyoo Park
K. Fujimura
Mykel J. Kochenderfer
78
6
0
19 Jul 2023
Data Cross-Segmentation for Improved Generalization in Reinforcement
  Learning Based Algorithmic Trading
Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading
Vikram Duvvur
Aashay Mehta
Edward W. Sun
Bo Wu
Ken Yew Chan
J. Schneider
AIFin
79
0
0
18 Jul 2023
QMNet: Importance-Aware Message Exchange for Decentralized Multi-Agent
  Reinforcement Learning
QMNet: Importance-Aware Message Exchange for Decentralized Multi-Agent Reinforcement Learning
Xiufeng Huang
Sheng Zhou
120
1
0
18 Jul 2023
REX: Rapid Exploration and eXploitation for AI Agents
REX: Rapid Exploration and eXploitation for AI Agents
Rithesh Murthy
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Le Xue
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
OffRL
88
10
0
18 Jul 2023
Quarl: A Learning-Based Quantum Circuit Optimizer
Quarl: A Learning-Based Quantum Circuit Optimizer
Zikun Li
Jin-Ye Peng
Yixuan Mei
Sina Lin
Yi Wu
Oded Padon
Zhi-Long Jia
48
23
0
17 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
125
5
0
16 Jul 2023
Aeolus Ocean -- A simulation environment for the autonomous
  COLREG-compliant navigation of Unmanned Surface Vehicles using Deep
  Reinforcement Learning and Maritime Object Detection
Aeolus Ocean -- A simulation environment for the autonomous COLREG-compliant navigation of Unmanned Surface Vehicles using Deep Reinforcement Learning and Maritime Object Detection
A. Vekinis
S. Perantonis
67
0
0
13 Jul 2023
A Comprehensive Overview of Large Language Models
A Comprehensive Overview of Large Language Models
Humza Naveed
Asad Ullah Khan
Shi Qiu
Muhammad Saqib
Saeed Anwar
Muhammad Usman
Naveed Akhtar
Nick Barnes
Ajmal Mian
OffRL
261
629
0
12 Jul 2023
Maneuver Decision-Making Through Automatic Curriculum Reinforcement
  Learning Without Handcrafted Reward functions
Maneuver Decision-Making Through Automatic Curriculum Reinforcement Learning Without Handcrafted Reward functions
Hong-Peng Zhang
48
2
0
12 Jul 2023
PID-Inspired Inductive Biases for Deep Reinforcement Learning in
  Partially Observable Control Tasks
PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks
I. Char
J. Schneider
77
4
0
12 Jul 2023
A Survey From Distributed Machine Learning to Distributed Deep Learning
A Survey From Distributed Machine Learning to Distributed Deep Learning
Mohammad Dehghani
Zahra Yazdanparast
118
0
0
11 Jul 2023
Secrets of RLHF in Large Language Models Part I: PPO
Secrets of RLHF in Large Language Models Part I: PPO
Rui Zheng
Shihan Dou
Songyang Gao
Yuan Hua
Wei Shen
...
Hang Yan
Tao Gui
Qi Zhang
Xipeng Qiu
Xuanjing Huang
ALMOffRL
128
177
0
11 Jul 2023
Loss Dynamics of Temporal Difference Reinforcement Learning
Loss Dynamics of Temporal Difference Reinforcement Learning
Blake Bordelon
P. Masset
Henry Kuo
Cengiz Pehlevan
AI4CE
60
0
0
10 Jul 2023
ScriptWorld: Text Based Environment For Learning Procedural Knowledge
ScriptWorld: Text Based Environment For Learning Procedural Knowledge
Abhinav Joshi
A. Ahmad
Umang Pandey
Ashutosh Modi
51
6
0
08 Jul 2023
SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially
  Observable Multi-Agent Path Finding
SACHA: Soft Actor-Critic with Heuristic-Based Attention for Partially Observable Multi-Agent Path Finding
Qiushi Lin
Hang Ma
116
19
0
05 Jul 2023
Learning Symbolic Rules over Abstract Meaning Representations for
  Textual Reinforcement Learning
Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning
Subhajit Chaudhury
Sarathkrishna Swaminathan
Daiki Kimura
Prithviraj Sen
K. Murugesan
...
Michiaki Tatsubori
Achille Fokoue
Pavan Kapanipathi
Asim Munawar
Alexander G. Gray
NAI
67
7
0
05 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of
  Circular Cylinder with Sparse Surface Pressure Sensing
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
57
31
0
05 Jul 2023
Learning Multi-Agent Communication with Contrastive Learning
Learning Multi-Agent Communication with Contrastive Learning
Y. Lo
B. Sengupta
Jakob N. Foerster
Michael Noukhovitch
81
5
0
03 Jul 2023
RObotic MAnipulation Network (ROMAN) $\unicode{x2013}$ Hybrid
  Hierarchical Learning for Solving Complex Sequential Tasks
RObotic MAnipulation Network (ROMAN) \unicodex2013\unicode{x2013}\unicodex2013 Hybrid Hierarchical Learning for Solving Complex Sequential Tasks
Eleftherios Triantafyllidis
Fernando Acero
Zhaocheng Liu
Zhibin Li
100
0
0
30 Jun 2023
Thompson sampling for improved exploration in GFlowNets
Thompson sampling for improved exploration in GFlowNets
Jarrid Rector-Brooks
Kanika Madan
Moksh Jain
Maksym Korablyov
Cheng-Hao Liu
Sarath Chandar
Nikolay Malkin
Yoshua Bengio
93
30
0
30 Jun 2023
Enhancing training of physics-informed neural networks using
  domain-decomposition based preconditioning strategies
Enhancing training of physics-informed neural networks using domain-decomposition based preconditioning strategies
Alena Kopanicáková
Hardik Kothari
George Karniadakis
Rolf Krause
AI4CE
76
18
0
30 Jun 2023
Systematic Investigation of Sparse Perturbed Sharpness-Aware
  Minimization Optimizer
Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer
Peng Mi
Li Shen
Tianhe Ren
Yiyi Zhou
Tianshuo Xu
Xiaoshuai Sun
Tongliang Liu
Rongrong Ji
Dacheng Tao
AAML
73
2
0
30 Jun 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
102
3
0
29 Jun 2023
SARC: Soft Actor Retrospective Critic
SARC: Soft Actor Retrospective Critic
Sukriti Verma
Ayush Chopra
J. Subramanian
Mausoom Sarkar
Nikaash Puri
Piyush B. Gupta
Balaji Krishnamurthy
46
0
0
28 Jun 2023
Action and Trajectory Planning for Urban Autonomous Driving with
  Hierarchical Reinforcement Learning
Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning
Xinyang Lu
Flint Xiaofeng Fan
Tianying Wang
63
8
0
28 Jun 2023
Diversity is Strength: Mastering Football Full Game with Interactive
  Reinforcement Learning of Multiple AIs
Diversity is Strength: Mastering Football Full Game with Interactive Reinforcement Learning of Multiple AIs
Chenglu Sun
Shuo Shen
Sijia Xu
Weidong Zhang
52
1
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
103
30
0
27 Jun 2023
Augmenting Control over Exploration Space in Molecular Dynamics
  Simulators to Streamline De Novo Analysis through Generative Control Policies
Augmenting Control over Exploration Space in Molecular Dynamics Simulators to Streamline De Novo Analysis through Generative Control Policies
Paloma Gonzalez-Rojas
Andrew Emmel
L. Martínez
Neil Malur
G. Rutledge
AI4CE
73
0
0
26 Jun 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region
  Methods
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
85
2
0
25 Jun 2023
Maintaining Plasticity in Deep Continual Learning
Maintaining Plasticity in Deep Continual Learning
Shibhansh Dohare
J. F. Hernandez-Garcia
Parash Rahman
A. Rupam Mahmood
Richard S. Sutton
KELMCLL
97
30
0
23 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Correcting discount-factor mismatch in on-policy policy gradient methods
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
62
9
0
23 Jun 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning
  from Human Feedback?
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?
Akansha Kalra
Daniel S. Brown
108
0
0
22 Jun 2023
Decentralized Multi-Agent Reinforcement Learning with Global State
  Prediction
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
70
3
0
22 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
90
1
0
21 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman
  Theory
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
88
3
0
20 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy
  Guided Reinforcement Learning
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Qi Zhang
EGVM
152
10
0
20 Jun 2023
Cooperative Multi-Agent Learning for Navigation via Structured State
  Abstraction
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
67
5
0
20 Jun 2023
Previous
123...131415...707172
Next