ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXivPDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 1,515 papers shown
Title
Unsupervised Event Outlier Detection in Continuous Time
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
75
0
0
25 Nov 2024
Creating Hierarchical Dispositions of Needs in an Agent
Creating Hierarchical Dispositions of Needs in an Agent
Tofara Moyo
91
0
0
23 Nov 2024
On the Linear Speedup of Personalized Federated Reinforcement Learning
  with Shared Representations
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong
Shufan Wang
Daniel Jiang
Jian Li
FedML
78
1
0
22 Nov 2024
Umbrella Reinforcement Learning -- computationally efficient tool for
  hard non-linear problems
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
64
1
0
21 Nov 2024
ReinFog: A DRL Empowered Framework for Resource Management in Edge and
  Cloud Computing Environments
ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
82
0
0
20 Nov 2024
AMaze: An intuitive benchmark generator for fast prototyping of
  generalizable agents
AMaze: An intuitive benchmark generator for fast prototyping of generalizable agents
Kevin Godin-Dubois
Karine Miras
Anna V. Kononova
76
0
0
20 Nov 2024
Bitcoin Under Volatile Block Rewards: How Mempool Statistics Can Influence Bitcoin Mining
Roozbeh Sarenche
Alireza Aghabagherloo
S. Nikova
Bart Preneel
77
0
0
18 Nov 2024
VLN-Game: Vision-Language Equilibrium Search for Zero-Shot Semantic Navigation
Bangguo Yu
Yuzhen Liu
Lei Han
Hamidreza Kasaei
Tingguang Li
M. Cao
LM&Ro
81
3
0
18 Nov 2024
A Pre-Trained Graph-Based Model for Adaptive Sequencing of Educational Documents
Jean Vassoyan
Anan Schütt
Jill-Jênn Vie
Arun-Balajiee Lekshmi-Narayanan
Elisabeth André
Nicolas Vayatis
AI4Ed
74
0
0
18 Nov 2024
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
Philips George John
Arnab Bhattacharyya
Silviu Maniu
Dimitrios Myrisiotis
Zhenan Wu
OffRL
36
0
0
16 Nov 2024
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Sheryl Paul
Anand Balakrishnan
Xin Qin
Jyotirmoy V. Deshmukh
31
0
0
15 Nov 2024
Rationality based Innate-Values-driven Reinforcement Learning
Rationality based Innate-Values-driven Reinforcement Learning
Qin Yang
18
0
0
14 Nov 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
29
2
0
08 Nov 2024
Retentive Neural Quantum States: Efficient Ansätze for Ab Initio
  Quantum Chemistry
Retentive Neural Quantum States: Efficient Ansätze for Ab Initio Quantum Chemistry
Oliver Knitter
Dan Zhao
J. Stokes
M. Ganahl
Stefan Leichenauer
S. Veerapaneni
37
1
0
06 Nov 2024
Hierarchical Orchestra of Policies
Hierarchical Orchestra of Policies
Thomas P Cannon
Özgür Simsek
CLL
39
0
0
05 Nov 2024
When to Localize? A Risk-Constrained Reinforcement Learning Approach
When to Localize? A Risk-Constrained Reinforcement Learning Approach
Chak Lam Shek
Kasra Torshizi
Troi Williams
Pratap Tokekar
41
2
0
05 Nov 2024
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
Bowen Li
Zhaoyu Li
Qiwei Du
Jinqi Luo
Wenshan Wang
...
Katia Sycara
Pradeep Kumar Ravikumar
Alexander G. Gray
X. Si
Sebastian A. Scherer
AI4CE
LRM
81
3
0
01 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer
  Vision
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
26
0
0
31 Oct 2024
CALE: Continuous Arcade Learning Environment
CALE: Continuous Arcade Learning Environment
Jesse Farebrother
Pablo Samuel Castro
ELM
38
0
0
31 Oct 2024
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object
  Detection
AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection
Yujin Wang
Tianyi Xu
Fan Zhang
Tianfan Xue
Liang Feng
VLM
31
4
0
30 Oct 2024
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and
  Replenishment
Dual-Agent Deep Reinforcement Learning for Dynamic Pricing and Replenishment
Yi Zheng
Zehao Li
Peng Jiang
Yijie Peng
24
0
0
28 Oct 2024
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement
  Learning Agents
FairStream: Fair Multimedia Streaming Benchmark for Reinforcement Learning Agents
Jannis Weil
Jonas Ringsdorf
Julian Barthel
Yi-Ping Phoebe Chen
Tobias Meuser
OffRL
31
0
0
28 Oct 2024
Deep Reinforcement Learning Agents for Strategic Production Policies in
  Microeconomic Market Simulations
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations
Eduardo C. Garrido-Merchán
Maria Coronado Vaca
Álvaro López-López
Carlos Martínez de Ibarreta
28
0
0
27 Oct 2024
Multi-agent cooperation through learning-aware policy gradients
Multi-agent cooperation through learning-aware policy gradients
Alexander Meulemans
Seijin Kobayashi
J. Oswald
Nino Scherrer
Eric Elmoznino
Blake A. Richards
Guillaume Lajoie
Blaise Agüera y Arcas
João Sacramento
56
0
0
24 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
85
6
0
23 Oct 2024
Survival of the Fittest: Evolutionary Adaptation of Policies for
  Environmental Shifts
Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts
Sheryl Paul
Jyotirmoy V. Deshmukh
38
0
0
22 Oct 2024
LLM-Assisted Red Teaming of Diffusion Models through "Failures Are
  Fated, But Can Be Faded"
LLM-Assisted Red Teaming of Diffusion Models through "Failures Are Fated, But Can Be Faded"
Som Sagar
Aditya Taparia
Ransalu Senanayake
20
0
0
22 Oct 2024
Diverse Policies Recovering via Pointwise Mutual Information Weighted
  Imitation Learning
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Hanlin Yang
Jian Yao
Weiming Liu
Qing Wang
Hanmin Qin
...
Hongwu Chen
Juchao Zhuo
Qiang Fu
Yang Wei
Haobo Fu
34
1
0
21 Oct 2024
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing
  Stock Selection and Execution
Hierarchical Reinforced Trader (HRT): A Bi-Level Approach for Optimizing Stock Selection and Execution
Zijie Zhao
Roy E. Welsch
AIFin
15
1
0
19 Oct 2024
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor
  Environments
Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments
Mariusz Wisniewski
Paraskevas Chatzithanos
Weisi Guo
Antonios Tsourdos
38
3
0
18 Oct 2024
Streaming Deep Reinforcement Learning Finally Works
Streaming Deep Reinforcement Learning Finally Works
Mohamed Elsayed
Gautham Vasan
A. R. Mahmood
OffRL
54
4
0
18 Oct 2024
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling
  IoT Applications in Edge and Cloud Computing Environments
TF-DDRL: A Transformer-enhanced Distributed DRL Technique for Scheduling IoT Applications in Edge and Cloud Computing Environments
Zhiyu Wang
M. Goudarzi
Rajkumar Buyya
OffRL
35
3
0
18 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yue Yang
40
3
0
18 Oct 2024
AERO: Softmax-Only LLMs for Efficient Private Inference
AERO: Softmax-Only LLMs for Efficient Private Inference
N. Jha
Brandon Reagen
32
1
0
16 Oct 2024
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference
  Optimization at Edge
EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at Edge
Motahare Mounesan
Xiaojie Zhang
S. Debroy
28
1
0
16 Oct 2024
TradExpert: Revolutionizing Trading with Mixture of Expert LLMs
TradExpert: Revolutionizing Trading with Mixture of Expert LLMs
Qianggang Ding
Haochen Shi
Jiadong Guo
Bang Liu
AIFin
43
3
0
16 Oct 2024
Understanding Likelihood Over-optimisation in Direct Alignment
  Algorithms
Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Zhengyan Shi
Sander Land
Acyr Locatelli
Matthieu Geist
Max Bartolo
54
4
0
15 Oct 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
45
3
0
15 Oct 2024
Improving the Language Understanding Capabilities of Large Language
  Models Using Reinforcement Learning
Improving the Language Understanding Capabilities of Large Language Models Using Reinforcement Learning
Bokai Hu
Sai Ashish Somayajula
Xin Pan
Zihan Huang
Pengtao Xie
OffRL
21
1
0
14 Oct 2024
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Mingjun Wang
Remington Dechene
31
0
0
11 Oct 2024
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games
Fanqi Kong
Yizhe Huang
Song-Chun Zhu
Siyuan Qi
Xue Feng
36
2
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Effective Exploration Based on the Structural Information Principles
Effective Exploration Based on the Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
21
2
0
09 Oct 2024
Solving Multi-Goal Robotic Tasks with Decision Transformer
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
32
1
0
08 Oct 2024
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
28
0
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
Diffusion Meets Options: Hierarchical Generative Skill Composition for
  Temporally-Extended Tasks
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng
Hao Luan
Kevin Yuchen Ma
Harold Soh
32
2
0
03 Oct 2024
Efficient Learning of POMDPs with Known Observation Model in
  Average-Reward Setting
Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting
Alessio Russo
Alberto Maria Metelli
Marcello Restelli
36
0
0
02 Oct 2024
Criticality and Safety Margins for Reinforcement Learning
Criticality and Safety Margins for Reinforcement Learning
Alexander Grushin
Walt Woods
Alvaro Velasquez
Simon Khan
AAML
38
1
0
26 Sep 2024
A Survey for Deep Reinforcement Learning Based Network Intrusion
  Detection
A Survey for Deep Reinforcement Learning Based Network Intrusion Detection
Wanrong Yang
Alberto Acuto
Yihang Zhou
Dominik Wojtczak
OffRL
36
2
0
25 Sep 2024
Previous
123456...293031
Next