ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Flexible Option Learning
Flexible Option Learning
Martin Klissarov
Doina Precup
OffRL
77
26
0
06 Dec 2021
SelectAugment: Hierarchical Deterministic Sample Selection for Data
  Augmentation
SelectAugment: Hierarchical Deterministic Sample Selection for Data Augmentation
Shiqi Lin
Zhizheng Zhang
Xin Li
Wenjun Zeng
Zhibo Chen
136
9
0
06 Dec 2021
MDPFuzz: Testing Models Solving Markov Decision Processes
MDPFuzz: Testing Models Solving Markov Decision Processes
Qi Pang
Yuanyuan Yuan
Shuai Wang
100
30
0
06 Dec 2021
Explainable Deep Learning in Healthcare: A Methodological Survey from an
  Attribution View
Explainable Deep Learning in Healthcare: A Methodological Survey from an Attribution View
Di Jin
Elena Sergeeva
W. Weng
Geeticka Chauhan
Peter Szolovits
OOD
120
58
0
05 Dec 2021
Deep Policy Iteration with Integer Programming for Inventory Management
Deep Policy Iteration with Integer Programming for Inventory Management
Pavithra Harsha
A. Jagmohan
Jayant Kalagnanam
Brian Quanz
Divya Singhvi
52
1
0
04 Dec 2021
Coupling Vision and Proprioception for Navigation of Legged Robots
Coupling Vision and Proprioception for Navigation of Legged Robots
Zipeng Fu
Ashish Kumar
Ananye Agarwal
Haozhi Qi
Jitendra Malik
Deepak Pathak
74
78
0
03 Dec 2021
Episodic Policy Gradient Training
Episodic Policy Gradient Training
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDLOffRL
68
6
0
03 Dec 2021
Learning Emergent Random Access Protocol for LEO Satellite Networks
Learning Emergent Random Access Protocol for LEO Satellite Networks
Ju-Hyung Lee
Hyowoon Seo
Jihong Park
M. Bennis
Young-Chai Ko
74
18
0
03 Dec 2021
Towards Interactive Reinforcement Learning with Intrinsic Feedback
Towards Interactive Reinforcement Learning with Intrinsic Feedback
Ben Poole
Minwoo Lee
OffRL
83
1
0
02 Dec 2021
Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data
  via Generative Bias-transformation
Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation
Yeonsung Jung
Hajin Shim
J. Yang
Eunho Yang
87
8
0
02 Dec 2021
A Survey on Scenario-Based Testing for Automated Driving Systems in
  High-Fidelity Simulation
A Survey on Scenario-Based Testing for Automated Driving Systems in High-Fidelity Simulation
Ziyuan Zhong
Yun Tang
Yuan Zhou
V. Neves
Yang Liu
Baishakhi Ray
131
64
0
02 Dec 2021
Risk-based implementation of COLREGs for autonomous surface vehicles
  using deep reinforcement learning
Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
T. N. Larsen
Amalie Heiberg
Eivind Meyer
Adil Rasheed
Omer San
Damiano Varagnolo
68
35
0
30 Nov 2021
Agent-Centric Relation Graph for Object Visual Navigation
Agent-Centric Relation Graph for Object Visual Navigation
X. Hu
Youfang Lin
Shuo Wang
Zhihao Wu
Kai Lv
99
20
0
29 Nov 2021
Explore the Potential Performance of Vision-and-Language Navigation
  Model: a Snapshot Ensemble Method
Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method
Wenda Qin
Teruhisa Misu
Derry Wijaya
UQCVLM&Ro
83
5
0
28 Nov 2021
A Reinforcement Learning Approach for the Continuous Electricity Market
  of Germany: Trading from the Perspective of a Wind Park Operator
A Reinforcement Learning Approach for the Continuous Electricity Market of Germany: Trading from the Perspective of a Wind Park Operator
Malte Lehna
Bjorn Hoppmann
René Heinrich
Christoph Scholz
42
17
0
26 Nov 2021
Reinforcement Explanation Learning
Reinforcement Explanation Learning
Siddhant Agarwal
Owais Iqbal
Sree Aditya Buridi
Madda Manjusha
Abir Das
FAtt
33
0
0
26 Nov 2021
Learning State Representations via Retracing in Reinforcement Learning
Learning State Representations via Retracing in Reinforcement Learning
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
84
8
0
24 Nov 2021
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and
  Applications
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and Applications
Khaled B. Letaief
Yuanming Shi
Jianmin Lu
Jianhua Lu
99
434
0
24 Nov 2021
Integrating Imitation Learning with Human Driving Data into
  Reinforcement Learning to Improve Training Efficiency for Autonomous Driving
Integrating Imitation Learning with Human Driving Data into Reinforcement Learning to Improve Training Efficiency for Autonomous Driving
Heidi Lu
17
1
0
23 Nov 2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space:
  Theory and Algorithms
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
123
85
0
22 Nov 2021
Deep Reinforced Attention Regression for Partial Sketch Based Image
  Retrieval
Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval
Dingrong Wang
Hitesh Sapkota
Xumin Liu
Qi Yu
87
5
0
21 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based
  Autonomous Driving
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
92
61
0
16 Nov 2021
Physics-informed neural networks via stochastic Hamiltonian dynamics
  learning
Physics-informed neural networks via stochastic Hamiltonian dynamics learning
Minh Nguyen
Chandrajit Bajaj
40
1
0
15 Nov 2021
VisualEnv: visual Gym environments with Blender
VisualEnv: visual Gym environments with Blender
A. Scorsoglio
R. Furfaro
AI4CE
50
1
0
15 Nov 2021
Interactive Medical Image Segmentation with Self-Adaptive Confidence
  Calibration
Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration
Wenhao Li
Qisen Xu
Chuyun Shen
Bin Hu
Fengping Zhu
Yuxin Li
Bo Jin
Xiangfeng Wang
95
5
0
15 Nov 2021
Modular Networks Prevent Catastrophic Interference in Model-Based
  Multi-Task Reinforcement Learning
Modular Networks Prevent Catastrophic Interference in Model-Based Multi-Task Reinforcement Learning
Robin Schiewer
Laurenz Wiskott
19
3
0
15 Nov 2021
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
AI in Human-computer Gaming: Techniques, Challenges and Opportunities
Qiyue Yin
Jun Yang
Kaiqi Huang
Meijing Zhao
Wancheng Ni
Bin Liang
Yan Huang
Shu Wu
Liangsheng Wang
61
21
0
15 Nov 2021
Reinforcement Learning of Self Enhancing Camera Image and Signal
  Processing
Reinforcement Learning of Self Enhancing Camera Image and Signal Processing
Minh Nguyen
Yi Wang
Yunhao Yang
36
2
0
15 Nov 2021
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning
  Approach
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach
Francisco Caio Lima Paiva
L. Felizardo
Reinaldo A. C. Bianchi
Anna Helena Reali Costa
AIFin
30
7
0
14 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep
  Reinforcement Learning
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
60
24
0
13 Nov 2021
Improving Experience Replay through Modeling of Similar Transitions'
  Sets
Improving Experience Replay through Modeling of Similar Transitions' Sets
Daniel Eugênio Neves
João Pedro Oliveira Batisteli
Eduardo Felipe Lopes
Lucila Ishitani
Zenilton K. G. Patrocínio
OffRL
31
1
0
12 Nov 2021
Collaboration Promotes Group Resilience in Multi-Agent AI
Collaboration Promotes Group Resilience in Multi-Agent AI
Sarah Keren
M. Gerstgrasser
Ofir Abu
J. Rosenschein
42
0
0
12 Nov 2021
Multi-agent Reinforcement Learning for Cooperative Lane Changing of
  Connected and Autonomous Vehicles in Mixed Traffic
Multi-agent Reinforcement Learning for Cooperative Lane Changing of Connected and Autonomous Vehicles in Mixed Traffic
Wei Zhou
Dong Chen
Jun Yan
Zhaojian Li
Huilin Yin
Wancheng Ge
99
85
0
11 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
65
1
0
11 Nov 2021
Multimodal Transformer with Variable-length Memory for
  Vision-and-Language Navigation
Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation
Chuang Lin
Yi Jiang
Jianfei Cai
Zhuang Li
Gholamreza Haffari
Zehuan Yuan
86
32
0
10 Nov 2021
Spatially and Seamlessly Hierarchical Reinforcement Learning for State
  Space and Policy space in Autonomous Driving
Spatially and Seamlessly Hierarchical Reinforcement Learning for State Space and Policy space in Autonomous Driving
Jaehyung Kim
Jaeseung Jeong
25
0
0
10 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at
  Scale
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
197
61
0
09 Nov 2021
Explainable Deep Reinforcement Learning for Portfolio Management: An
  Empirical Approach
Explainable Deep Reinforcement Learning for Portfolio Management: An Empirical Approach
Mao Guan
Xiao-Yang Liu
AIFinAI4TS
51
21
0
07 Nov 2021
FinRL: Deep Reinforcement Learning Framework to Automate Trading in
  Quantitative Finance
FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance
Xiao-Yang Liu
Hongyang Yang
Jiechao Gao
Chris Wang
AIFinOffRL
121
99
0
07 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
63
24
0
06 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon
  Reasoning
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
88
43
0
04 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
123
101
0
04 Nov 2021
Towards an Understanding of Default Policies in Multitask Policy
  Optimization
Towards an Understanding of Default Policies in Multitask Policy Optimization
Theodore H. Moskovitz
Michael Arbel
Jack Parker-Holder
Aldo Pacchiano
70
10
0
04 Nov 2021
Attacking Deep Reinforcement Learning-Based Traffic Signal Control
  Systems with Colluding Vehicles
Attacking Deep Reinforcement Learning-Based Traffic Signal Control Systems with Colluding Vehicles
Ao Qu
Yihong Tang
Wei-Ying Ma
45
10
0
04 Nov 2021
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
112
12
0
04 Nov 2021
Proximal Policy Optimization with Continuous Bounded Action Space via
  the Beta Distribution
Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution
Irving G. B. Petrazzini
Eric A. Antonelo
OffRL
59
13
0
03 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
72
19
0
03 Nov 2021
Learning to Explore by Reinforcement over High-Level Options
Learning to Explore by Reinforcement over High-Level Options
Juncheng Liu
B. McCane
S. Mills
EgoV
33
1
0
02 Nov 2021
Human-Level Control without Server-Grade Hardware
Human-Level Control without Server-Grade Hardware
Brett Daley
Chris Amato
BDLOffRL
36
0
0
01 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
100
17
0
30 Oct 2021
Previous
123...272829...707172
Next