ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Exterior Penalty Policy Optimization with Penalty Metric Network under
  Constraints
Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints
Shiqing Gao
Jiaxin Ding
Luoyi Fu
Xinbing Wang
Cheng Zhou
63
0
0
22 Jul 2024
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs
  A2C
A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C
Neil De La Fuente
Daniel A. Vidal Guerra
OffRL
34
7
0
19 Jul 2024
Track-MDP: Reinforcement Learning for Target Tracking with Controlled
  Sensing
Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
Adarsh M. Subramaniam
Argyrios Gerogiannis
James Z. Hare
Venugopal V. Veeravalli
134
0
0
19 Jul 2024
DeepClair: Utilizing Market Forecasts for Effective Portfolio Selection
DeepClair: Utilizing Market Forecasts for Effective Portfolio Selection
Donghee Choi
Jinkyu Kim
Mogan Gim
Jinho Lee
Jaewoo Kang
78
0
0
18 Jul 2024
PG-Rainbow: Using Distributional Reinforcement Learning in Policy
  Gradient Methods
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods
WooJae Jeon
KanJun Lee
Jeewoo Lee
OffRL
34
0
0
18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
262
3
0
18 Jul 2024
Optimistic Q-learning for average reward and episodic reinforcement learning
Optimistic Q-learning for average reward and episodic reinforcement learning
Priyank Agrawal
Shipra Agrawal
125
6
0
18 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
105
1
0
18 Jul 2024
Subequivariant Reinforcement Learning in 3D Multi-Entity Physical
  Environments
Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments
Runfa Chen
Ling Wang
Yu Du
Tianrui Xue
Gang Hua
Jianwei Zhang
Wenbing Huang
OffRL
103
1
0
17 Jul 2024
Graceful task adaptation with a bi-hemispheric RL agent
Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas
L. Kuhlmann
Gideon Kowadlo
72
0
0
16 Jul 2024
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
AlphaDou: High-Performance End-to-End Doudizhu AI Integrating Bidding
Chang Lei
Huan Lei
57
0
0
14 Jul 2024
Any-Property-Conditional Molecule Generation with Self-Criticism using
  Spanning Trees
Any-Property-Conditional Molecule Generation with Self-Criticism using Spanning Trees
Alexia Jolicoeur-Martineau
A. Baratin
Kisoo Kwon
Boris Knyazev
Yan Zhang
71
1
0
12 Jul 2024
Gradient Boosting Reinforcement Learning
Gradient Boosting Reinforcement Learning
Benjamin Fuhrer
Chen Tessler
Gal Dalal
OffRLAI4CE
185
3
0
11 Jul 2024
RoboMorph: Evolving Robot Morphology using Large Language Models
RoboMorph: Evolving Robot Morphology using Large Language Models
Kevin Qiu
Krzysztof Ciebiera
Krzysztof Ciebiera
Marek Cygan
Marek Cygan
Łukasz Kuciński
LM&Ro
160
1
0
11 Jul 2024
Mitigating Partial Observability in Sequential Decision Processes via
  the Lambda Discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen
Aaron Kirtland
Ruo Yu Tao
Sam Lobel
Daniel Scott
Nicholas Petrocelli
Omer Gottesman
Ronald E. Parr
M. L. Littman
George Konidaris
52
2
0
10 Jul 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal
  Reinforcement for Enhanced Financial Decision Making
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
Yangyang Yu
Zhiyuan Yao
Haohang Li
Zhiyang Deng
Yupeng Cao
...
Guojun Xiong
Yueru He
Jimin Huang
Dong Li
Qianqian Xie
AIFinLLMAG
95
32
0
09 Jul 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Xuyang Chen
Lin Zhao
73
0
0
09 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
91
0
0
08 Jul 2024
Communication and Control Co-Design in 6G: Sequential Decision-Making
  with LLMs
Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Xianfu Chen
Celimuge Wu
Yi Shen
Yusheng Ji
Tsutomu Yoshinaga
Qiang Ni
Charilaos C. Zarakovitis
Honggang Zhang
87
3
0
06 Jul 2024
The Impact of Quantization and Pruning on Deep Reinforcement Learning
  Models
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models
Heng Lu
Mehdi Alemi
Reza Rawassizadeh
100
1
0
05 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
170
26
0
05 Jul 2024
A Role of Environmental Complexity on Representation Learning in Deep Reinforcement Learning Agents
A Role of Environmental Complexity on Representation Learning in Deep Reinforcement Learning Agents
Andrew Liu
Alla Borisyuk
146
1
0
03 Jul 2024
Weight Clipping for Deep Continual and Reinforcement Learning
Weight Clipping for Deep Continual and Reinforcement Learning
Mohamed Elsayed
Qingfeng Lan
Clare Lyle
A. Rupam Mahmood
91
12
0
01 Jul 2024
Reinforcement Learning-driven Data-intensive Workflow Scheduling for
  Volunteer Edge-Cloud
Reinforcement Learning-driven Data-intensive Workflow Scheduling for Volunteer Edge-Cloud
Motahare Mounesan
Mauro Lemus
H. Yeddulapalli
Prasad Calyam
S. Debroy
OffRL
41
6
0
01 Jul 2024
Model-Free Active Exploration in Reinforcement Learning
Model-Free Active Exploration in Reinforcement Learning
Alessio Russo
Alexandre Proutiere
OffRL
59
3
0
30 Jun 2024
Disentangled Representations for Causal Cognition
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
101
2
0
30 Jun 2024
Towards shutdownable agents via stochastic choice
Towards shutdownable agents via stochastic choice
Elliott Thornley
Alexander Roman
Christos Ziakas
Leyton Ho
Louis Thomson
140
0
0
30 Jun 2024
Deep Reinforcement Learning Strategies in Finance: Insights into Asset
  Holding, Trading Behavior, and Purchase Diversity
Deep Reinforcement Learning Strategies in Finance: Insights into Asset Holding, Trading Behavior, and Purchase Diversity
Alireza Mohammadshafie
Akram Mirzaeinia
Haseebullah Jumakhan
Amir Mirzaeinia
AIFin
28
1
0
29 Jun 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
112
6
0
29 Jun 2024
Efficient World Models with Context-Aware Tokenization
Efficient World Models with Context-Aware Tokenization
Vincent Micheli
Eloi Alonso
François Fleuret
OffRLVLM
80
6
0
27 Jun 2024
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension
  via Deep Reinforcement Learning
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning
Nishesh Singh
Sidharth Ramesh
Abhishek Shankar
Jyotishka Duttagupta
Leander Stephen D'Souza
Sanjay Singh
32
0
0
27 Jun 2024
Understanding and Diagnosing Deep Reinforcement Learning
Understanding and Diagnosing Deep Reinforcement Learning
Ezgi Korkmaz
68
3
0
23 Jun 2024
Multistep Criticality Search and Power Shaping in Microreactors with
  Reinforcement Learning
Multistep Criticality Search and Power Shaping in Microreactors with Reinforcement Learning
M. Radaideh
Leo Tunkle
D. Price
Kamal Abdulraheem
Linyu Lin
Moutaz Elias
39
0
0
22 Jun 2024
Learning to Retrieve Iteratively for In-Context Learning
Learning to Retrieve Iteratively for In-Context Learning
Yunmo Chen
Tongfei Chen
Harsh Jhamtani
Patrick Xia
Richard Shin
Jason Eisner
Benjamin Van Durme
RALM
96
7
0
20 Jun 2024
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent
  poLicy for InTerpretability
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretability
Shuang Ao
Simon Khan
Haris Aziz
Flora D. Salim
123
0
0
20 Jun 2024
Two-Stage Depth Enhanced Learning with Obstacle Map For Object
  Navigation
Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation
Yanwei Zheng
Shaopu Feng
Bowen Huang
Changrui Li
Xiao Zhang
Dongxiao Yu
121
0
0
20 Jun 2024
Graph Neural Networks for Job Shop Scheduling Problems: A Survey
Graph Neural Networks for Job Shop Scheduling Problems: A Survey
Igor G. Smit
Jianan Zhou
Robbert Reijnen
Yaoxin Wu
Jian Chen
Cong Zhang
Zaharah Bukhsh
Wim P. M. Nuijten
Yingqian Zhang
GNNAI4CE
117
11
0
20 Jun 2024
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
Revealing the Learning Process in Reinforcement Learning Agents Through Attention-Oriented Metrics
Charlotte Beylier
Simon M. Hofmann
Nico Scherf
222
0
0
20 Jun 2024
Low-Redundant Optimization for Large Language Model Alignment
Low-Redundant Optimization for Large Language Model Alignment
Zhipeng Chen
Kun Zhou
Wayne Xin Zhao
Jingyuan Wang
Ji-Rong Wen
83
3
0
18 Jun 2024
A Super-human Vision-based Reinforcement Learning Agent for Autonomous
  Racing in Gran Turismo
A Super-human Vision-based Reinforcement Learning Agent for Autonomous Racing in Gran Turismo
Miguel Vasco
Takuma Seno
Kenta Kawamoto
K. Subramanian
Peter R. Wurman
Peter Stone
109
8
0
18 Jun 2024
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
An Imitative Reinforcement Learning Framework for Autonomous Dogfight
Siyuan Li
Rongchang Zuo
Peng Liu
Yingnan Zhao
Yingnan Zhao
112
1
0
17 Jun 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
69
0
0
14 Jun 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
Haruka Kita
Sotetsu Koyamada
Yotaro Yamaguchi
Shin Ishii
73
0
0
14 Jun 2024
RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack
  against LLMs
RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs
Xuan Chen
Yuzhou Nie
Lu Yan
Yunshu Mao
Wenbo Guo
Xiangyu Zhang
63
7
0
13 Jun 2024
Efficient Adaptation in Mixed-Motive Environments via Hierarchical
  Opponent Modeling and Planning
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning
Yizhe Huang
Hoang Trung-Dung
Fanqi Kong
Yaodong Yang
Song-Chun Zhu
Xue Feng
84
3
0
12 Jun 2024
CHARME: A chain-based reinforcement learning approach for the minor
  embedding problem
CHARME: A chain-based reinforcement learning approach for the minor embedding problem
Hoang M. Ngo
Nguyen H K. Do
Minh Nhat Vu
Tamer Kahveci
My T. Thai
AI4CE
69
2
0
11 Jun 2024
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization
Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization
Jesse van Remmerden
Maurice Kenter
D. Roijers
Charalampos Andriotis
Yingqian Zhang
Zaharah Bukhsh
52
1
0
10 Jun 2024
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning:
  A Systematic Review
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review
Hafez Ghaemi
Shirin Jamshidi
Mohammad Mashreghi
M. N. Ahmadabadi
Hamed Kebriaei
106
1
0
10 Jun 2024
STARLING: Self-supervised Training of Text-based Reinforcement Learning
  Agent with Large Language Models
STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
Shreyas Basavatia
K. Murugesan
Shivam Ratnakar
SyDaAI4CE
83
8
0
09 Jun 2024
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play
Wichayaporn Wongkamjan
Feng Gu
Yanze Wang
Ulf Hermjakob
Jonathan May
Brandon M. Stewart
Jonathan K. Kummerfeld
Denis Peskoff
Jordan L. Boyd-Graber
90
6
0
07 Jun 2024
Previous
123...567...707172
Next