Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,552 papers shown
Title
Combining Learning from Human Feedback and Knowledge Engineering to Solve Hierarchical Tasks in Minecraft
Vinicius G. Goecks
Nicholas R. Waytowich
David Watkins
Bharat Prakash
18
7
0
07 Dec 2021
Flexible Option Learning
Martin Klissarov
Doina Precup
OffRL
41
26
0
06 Dec 2021
Explainable Deep Learning in Healthcare: A Methodological Survey from an Attribution View
Di Jin
Elena Sergeeva
W. Weng
Geeticka Chauhan
Peter Szolovits
OOD
56
55
0
05 Dec 2021
Coupling Vision and Proprioception for Navigation of Legged Robots
Zipeng Fu
Ashish Kumar
Ananye Agarwal
Haozhi Qi
Jitendra Malik
Deepak Pathak
21
73
0
03 Dec 2021
Learning Emergent Random Access Protocol for LEO Satellite Networks
Ju-Hyung Lee
Hyowoon Seo
Jihong Park
M. Bennis
Young-Chai Ko
30
17
0
03 Dec 2021
Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation
Yeonsung Jung
Hajin Shim
J. Yang
Eunho Yang
34
8
0
02 Dec 2021
A Survey on Scenario-Based Testing for Automated Driving Systems in High-Fidelity Simulation
Ziyuan Zhong
Yun Tang
Yuan Zhou
V. Neves
Yang Liu
Baishakhi Ray
50
60
0
02 Dec 2021
Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
T. N. Larsen
Amalie Heiberg
Eivind Meyer
Adil Rasheed
Omer San
Damiano Varagnolo
32
35
0
30 Nov 2021
Agent-Centric Relation Graph for Object Visual Navigation
X. Hu
Youfang Lin
Shuo Wang
Zhihao Wu
Kai Lv
44
19
0
29 Nov 2021
A Reinforcement Learning Approach for the Continuous Electricity Market of Germany: Trading from the Perspective of a Wind Park Operator
Malte Lehna
Bjorn Hoppmann
René Heinrich
Christoph Scholz
29
16
0
26 Nov 2021
Learning State Representations via Retracing in Reinforcement Learning
Changmin Yu
Dong Li
Jianye Hao
Jun Wang
Neil Burgess
35
7
0
24 Nov 2021
Edge Artificial Intelligence for 6G: Vision, Enabling Technologies, and Applications
Khaled B. Letaief
Yuanming Shi
Jianmin Lu
Jianhua Lu
48
417
0
24 Nov 2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia
X. Zhou
OffRL
36
79
0
22 Nov 2021
Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval
Dingrong Wang
Hitesh Sapkota
Xumin Liu
Qi Yu
43
4
0
21 Nov 2021
GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving
Raphael Chekroun
Marin Toromanoff
Sascha Hornauer
Fabien Moutarde
41
60
0
16 Nov 2021
Physics-informed neural networks via stochastic Hamiltonian dynamics learning
Minh Nguyen
Chandrajit Bajaj
21
1
0
15 Nov 2021
Interactive Medical Image Segmentation with Self-Adaptive Confidence Calibration
Wenhao Li
Qisen Xu
Chuyun Shen
Bin Hu
Fengping Zhu
Yuxin Li
Bo Jin
Xiangfeng Wang
35
5
0
15 Nov 2021
Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning
Jueming Hu
Xuxi Yang
Weichang Wang
Peng Wei
Lei Ying
Yongming Liu
46
24
0
13 Nov 2021
Multi-agent Reinforcement Learning for Cooperative Lane Changing of Connected and Autonomous Vehicles in Mixed Traffic
Wei Zhou
Dong Chen
Jun Yan
Zhaojian Li
Huilin Yin
Wancheng Ge
44
80
0
11 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
37
1
0
11 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
97
59
0
09 Nov 2021
Explainable Deep Reinforcement Learning for Portfolio Management: An Empirical Approach
Mao Guan
Xiao-Yang Liu
AIFin
AI4TS
27
20
0
07 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
38
23
0
06 Nov 2021
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
37
41
0
04 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
45
93
0
04 Nov 2021
Towards an Understanding of Default Policies in Multitask Policy Optimization
Theodore H. Moskovitz
Michael Arbel
Jack Parker-Holder
Aldo Pacchiano
30
9
0
04 Nov 2021
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
35
10
0
04 Nov 2021
Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution
Irving G. B. Petrazzini
Eric A. Antonelo
OffRL
20
12
0
03 Nov 2021
Model-Based Episodic Memory Induces Dynamic Hybrid Controls
Hung Le
Thommen George Karimpanal
Majid Abdolshah
T. Tran
Svetha Venkatesh
30
19
0
03 Nov 2021
Mastering Atari Games with Limited Data
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
VLM
59
226
0
30 Oct 2021
Adaptive Discretization in Online Reinforcement Learning
Sean R. Sinclair
Siddhartha Banerjee
Chao Yu
OffRL
45
15
0
29 Oct 2021
Understanding the Effect of Stochasticity in Policy Optimization
Jincheng Mei
Bo Dai
Chenjun Xiao
Csaba Szepesvári
Dale Schuurmans
24
17
0
29 Oct 2021
Learning to Ground Multi-Agent Communication with Autoencoders
Toru Lin
Minyoung Huh
C. Stauffer
Ser-Nam Lim
Phillip Isola
AI4CE
48
52
0
28 Oct 2021
Bayesian Sequential Optimal Experimental Design for Nonlinear Models Using Policy Gradient Reinforcement Learning
Wanggang Shen
Xun Huan
13
40
0
28 Oct 2021
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
21
18
0
27 Oct 2021
Distributed Multi-Agent Deep Reinforcement Learning Framework for Whole-building HVAC Control
Vinay Hanumaiah
Sahika Genc
AI4CE
24
6
0
26 Oct 2021
EnTRPO: Trust Region Policy Optimization Method with Entropy Regularization
Sahar Roostaie
M. Ebadzadeh
16
3
0
26 Oct 2021
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
33
226
0
25 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Kibeom Kim
Min Whoo Lee
Yoonsung Kim
Je-hwan Ryu
Minsu Lee
Byoung-Tak Zhang
24
8
0
25 Oct 2021
A Deep Reinforcement Learning Approach for Audio-based Navigation and Audio Source Localization in Multi-speaker Environments
Petros Giannakopoulos
Aggelos Pikrakis
Y. Cotronis
27
3
0
25 Oct 2021
An Economy of Neural Networks: Learning from Heterogeneous Experiences
A. Kuriksha
27
7
0
22 Oct 2021
Statistical discrimination in learning agents
Edgar A. Duénez-Guzmán
Kevin R. McKee
Yiran Mao
Ben Coppin
Silvia Chiappa
...
Yoram Bachrach
Suzanne Sadedin
William S. Isaac
K. Tuyls
Joel Z. Leibo
47
7
0
21 Oct 2021
On games and simulators as a platform for development of artificial intelligence for command and control
Vinicius G. Goecks
Nicholas R. Waytowich
Derrik E. Asher
Song Jun Park
Mark R. Mittrick
...
Anne Logie
Mark S. Dennison
T. Trout
Priya Narayanan
Alexander Kott
41
26
0
21 Oct 2021
Neuro-Symbolic Reinforcement Learning with First-Order Logic
Daiki Kimura
Masaki Ono
Subhajit Chaudhury
Ryosuke Kohita
Akifumi Wachi
Don Joven Agravante
Michiaki Tatsubori
Asim Munawar
Alexander G. Gray
NAI
31
37
0
21 Oct 2021
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning
Wenzhuo Zhou
Ruoqing Zhu
Annie Qu
40
22
0
20 Oct 2021
Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm
Raghuram Bharadwaj Diddigi
Prateek Jain
P. J
S. Bhatnagar
CML
OffRL
19
3
0
19 Oct 2021
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
AI4CE
31
15
0
18 Oct 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
21
67
0
15 Oct 2021
A Framework for Learning to Request Rich and Contextually Useful Information from Humans
Khanh Nguyen
Yonatan Bisk
Hal Daumé
54
16
0
14 Oct 2021
NeurIPS 2021 Competition IGLU: Interactive Grounded Language Understanding in a Collaborative Environment
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
Arthur Szlam
Yuxuan Sun
Katja Hofmann
Michel Galley
Ahmed Hassan Awadallah
LLMAG
70
15
0
13 Oct 2021
Previous
1
2
3
...
12
13
14
...
30
31
32
Next