ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement
  Learning
Formal Policy Synthesis for Continuous-Space Systems via Reinforcement Learning
Milad Kazemi
Sadegh Soudjani
83
29
0
04 May 2020
Noise Pollution in Hospital Readmission Prediction: Long Document
  Classification with Reinforcement Learning
Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning
Liyan Xu
J. Hogan
R. Patzer
Jinho Choi
36
4
0
04 May 2020
Deep Reinforcement Learning for Intelligent Transportation Systems: A
  Survey
Deep Reinforcement Learning for Intelligent Transportation Systems: A Survey
Ammar Haydari
Y. Yilmaz
AI4TS
119
469
0
02 May 2020
Generalized Entropy Regularization or: There's Nothing Special about
  Label Smoothing
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing
Clara Meister
Elizabeth Salesky
Ryan Cotterell
UQCV
56
61
0
02 May 2020
Enhancing Text-based Reinforcement Learning Agents with Commonsense
  Knowledge
Enhancing Text-based Reinforcement Learning Agents with Commonsense Knowledge
K. Murugesan
Mattia Atzeni
Pushkar Shukla
Mrinmaya Sachan
Pavan Kapanipathi
Kartik Talamadupula
LLMAG
68
30
0
02 May 2020
Breaking (Global) Barriers in Parallel Stochastic Optimization with
  Wait-Avoiding Group Averaging
Breaking (Global) Barriers in Parallel Stochastic Optimization with Wait-Avoiding Group Averaging
Shigang Li
Tal Ben-Nun
Giorgi Nadiradze
Salvatore Di Girolamo
Nikoli Dryden
Dan Alistarh
Torsten Hoefler
75
15
0
30 Apr 2020
On the Spontaneous Emergence of Discrete and Compositional Signals
On the Spontaneous Emergence of Discrete and Compositional Signals
Nur Lan
Emmanuel Chemla
Shane Steinert-Threlkeld
LRM
75
8
0
30 Apr 2020
Improving Factual Consistency Between a Response and Persona Facts
Improving Factual Consistency Between a Response and Persona Facts
Mohsen Mesgar
Edwin Simpson
Iryna Gurevych
HILM
80
6
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement
  Learning
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDLDRLSSL
192
143
0
30 Apr 2020
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
Xiaoteng Ma
Junyao Chen
Li Xia
Jun Yang
Qianchuan Zhao
Zhengyuan Zhou
94
17
0
30 Apr 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
102
27
0
29 Apr 2020
Molecular Design in Synthetically Accessible Chemical Space via Deep
  Reinforcement Learning
Molecular Design in Synthetically Accessible Chemical Space via Deep Reinforcement Learning
Julien Horwood
Emmanuel Noutahi
AI4CE
73
69
0
29 Apr 2020
Improving Target-driven Visual Navigation with Attention on 3D Spatial
  Relationships
Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships
Yunlian Lv
Ning Xie
Yimin Shi
Zijiao Wang
Jikang Cheng
34
0
0
29 Apr 2020
Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue
  Task
Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task
Katya Kudashkina
Valliappa Chockalingam
Graham W. Taylor
Michael Bowling
OffRLLLMAG
64
2
0
28 Apr 2020
Image Augmentation Is All You Need: Regularizing Deep Reinforcement
  Learning from Pixels
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
Ilya Kostrikov
Denis Yarats
Rob Fergus
OffRL
206
794
0
28 Apr 2020
Improving Sample Efficiency and Multi-Agent Communication in RL-based
  Train Rescheduling
Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling
Dano Roost
Ralph Meier
Stephan Huschauer
Erik Nygren
A. Egli
Andreas Weiler
Thilo Stadelmann
23
4
0
28 Apr 2020
Transferable Active Grasping and Real Embodied Dataset
Transferable Active Grasping and Real Embodied Dataset
Xiangyu Chen
Zelin Ye
Jiankai Sun
Yuda Fan
Fangwei Hu
Chenxi Wang
Cewu Lu
54
19
0
28 Apr 2020
The AI Economist: Improving Equality and Productivity with AI-Driven Tax
  Policies
The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies
Stephan Zheng
Alexander R. Trott
Sunil Srinivasa
Nikhil Naik
Melvin Gruesbeck
David C. Parkes
R. Socher
60
136
0
28 Apr 2020
First return, then explore
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
127
365
0
27 Apr 2020
GymFG: A Framework with a Gym Interface for FlightGear
GymFG: A Framework with a Gym Interface for FlightGear
A. Wood
Ali Sydney
Peter Chin
B. Thapa
Ryan Ross
16
2
0
26 Apr 2020
A Perspective on Deep Learning for Molecular Modeling and Simulations
A Perspective on Deep Learning for Molecular Modeling and Simulations
Jun Zhang
Yao-Kun Lei
Zhen Zhang
Junhan Chang
Maodong Li
Xu Han
Lijiang Yang
Yue Yang
Y. Gao
AI4CE
119
8
0
25 Apr 2020
A State Aggregation Approach for Solving Knapsack Problem with Deep
  Reinforcement Learning
A State Aggregation Approach for Solving Knapsack Problem with Deep Reinforcement Learning
Reza Refaei Afshar
Yingqian Zhang
M. Firat
U. Kaymak
OffRL
77
14
0
25 Apr 2020
CFR-RL: Traffic Engineering with Reinforcement Learning in SDN
CFR-RL: Traffic Engineering with Reinforcement Learning in SDN
Member Ieee Junjie Zhang
Minghao Ye
Senior Member Ieee Zehua Guo
Chen-Yu Yen
F. I. H. Jonathan Chao
37
140
0
24 Apr 2020
The Two Kinds of Free Energy and the Bayesian Revolution
The Two Kinds of Free Energy and the Bayesian Revolution
Sebastian Gottwald
Daniel A. Braun
3DV
47
33
0
24 Apr 2020
Learning Constrained Adaptive Differentiable Predictive Control Policies
  With Guarantees
Learning Constrained Adaptive Differentiable Predictive Control Policies With Guarantees
Ján Drgoňa
Aaron Tuor
D. Vrabie
118
18
0
23 Apr 2020
OF-VO: Efficient Navigation among Pedestrians Using Commodity Sensors
OF-VO: Efficient Navigation among Pedestrians Using Commodity Sensors
Jing Liang
Yi-Ling Qiao
Tianrui Guan
Tianyi Zhou
94
13
0
23 Apr 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
110
38
0
22 Apr 2020
Flexible and Efficient Long-Range Planning Through Curious Exploration
Flexible and Efficient Long-Range Planning Through Curious Exploration
Aidan Curtis
Minjian Xin
Dilip Arumugam
Kevin T. Feigelis
Daniel L. K. Yamins
36
6
0
22 Apr 2020
Policy Gradient from Demonstration and Curiosity
Policy Gradient from Demonstration and Curiosity
Jie Chen
Wenjun Xu
132
12
0
22 Apr 2020
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage
  Decomposition
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition
Zihan Zhang
Yuanshuo Zhou
Xiangyang Ji
OffRL
87
158
0
21 Apr 2020
Real World Games Look Like Spinning Tops
Real World Games Look Like Spinning Tops
Wojciech M. Czarnecki
Gauthier Gidel
Brendan D. Tracey
K. Tuyls
Shayegan Omidshafiei
David Balduzzi
Max Jaderberg
82
101
0
20 Apr 2020
Modeling Survival in model-based Reinforcement Learning
Modeling Survival in model-based Reinforcement Learning
Saeed Moazami
P. Doerschuk
OffRL
29
1
0
18 Apr 2020
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
Matt Deitke
Winson Han
Alvaro Herrasti
Aniruddha Kembhavi
Eric Kolve
...
Eli VanderBilt
Matthew Wallingford
Luca Weihs
Mark Yatskar
Ali Farhadi
LM&Ro
154
241
0
14 Apr 2020
Extrapolation in Gridworld Markov-Decision Processes
Extrapolation in Gridworld Markov-Decision Processes
Eugene Charniak
34
0
0
14 Apr 2020
Reinforcement Learning Approach to Vibration Compensation for Dynamic
  Feed Drive Systems
Reinforcement Learning Approach to Vibration Compensation for Dynamic Feed Drive Systems
Ralf Gulde
Marc Tuscher
A. Csiszar
O. Riedel
A. Verl
AI4CE
33
1
0
14 Apr 2020
A Deep Reinforcement Learning Framework for Continuous Intraday Market
  Bidding
A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding
Ioannis Boukas
D. Ernst
Thibaut Théate
Adrien Bolland
A. Huynen
Martin Buchwald
Christelle Wynants
Bertrand Cornélusse
54
53
0
13 Apr 2020
Certifiable Robustness to Adversarial State Uncertainty in Deep
  Reinforcement Learning
Certifiable Robustness to Adversarial State Uncertainty in Deep Reinforcement Learning
Michael Everett
Bjorn Lutjens
Jonathan P. How
AAML
55
42
0
11 Apr 2020
A Review on Deep Learning Techniques for Video Prediction
A Review on Deep Learning Techniques for Video Prediction
Sergiu Oprea
P. Martinez-Gonzalez
Alberto Garcia-Garcia
John Alejandro Castro-Vargas
S. Orts-Escolano
Jose Garcia-Rodriguez
Antonis Argyros
112
256
0
10 Apr 2020
Solving the scalarization issues of Advantage-based Reinforcement
  Learning Algorithms
Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms
Federico A. Galatolo
M. G. Cimino
G. Vaglini
56
3
0
08 Apr 2020
Learning from Learners: Adapting Reinforcement Learning Agents to be
  Competitive in a Card Game
Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game
Pablo V. A. Barros
Ana Tanevska
A. Sciutti
72
21
0
08 Apr 2020
Optimistic Agent: Accurate Graph-Based Value Estimation for More
  Successful Visual Navigation
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation
M. Moghaddam
Qi Wu
Ehsan Abbasnejad
Javen Qinfeng Shi
68
4
0
07 Apr 2020
Using Generative Adversarial Nets on Atari Games for Feature Extraction
  in Deep Reinforcement Learning
Using Generative Adversarial Nets on Atari Games for Feature Extraction in Deep Reinforcement Learning
A. Aydin
Elif Surer
20
1
0
06 Apr 2020
Sub-Instruction Aware Vision-and-Language Navigation
Sub-Instruction Aware Vision-and-Language Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Qi Wu
Stephen Gould
136
72
0
06 Apr 2020
Learning Stabilizing Control Policies for a Tensegrity Hopper with
  Augmented Random Search
Learning Stabilizing Control Policies for a Tensegrity Hopper with Augmented Random Search
Vladislav Kurenkov
Hany Hamed
S. Savin
17
2
0
06 Apr 2020
CNN2Gate: Toward Designing a General Framework for Implementation of
  Convolutional Neural Networks on FPGA
CNN2Gate: Toward Designing a General Framework for Implementation of Convolutional Neural Networks on FPGA
Alireza Ghaffari
Yvon Savaria
31
9
0
06 Apr 2020
Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep
  Reinforcement Learning
Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning
P. Costa
Jason Rhuggenaath
Yingqian Zhang
A. Akçay
97
143
0
03 Apr 2020
A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air
  Traffic Control
A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air Traffic Control
Supriyo Ghosh
Sean Laguna
Shiau Hong Lim
L. Wynter
Hasan A. Poonawala
52
14
0
03 Apr 2020
Multi-agent Reinforcement Learning for Networked System Control
Multi-agent Reinforcement Learning for Networked System Control
Tianshu Chu
Sandeep P. Chinchali
Sachin Katti
79
112
0
03 Apr 2020
Action Space Shaping in Deep Reinforcement Learning
Action Space Shaping in Deep Reinforcement Learning
Anssi Kanervisto
Christian Scheller
Ville Hautamaki
80
84
0
02 Apr 2020
Average Reward Adjusted Discounted Reinforcement Learning:
  Near-Blackwell-Optimal Policies for Real-World Applications
Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications
Manuel Schneckenreither
OffRL
34
5
0
02 Apr 2020
Previous
123...444546...707172
Next