ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01783
  4. Cited By
Asynchronous Methods for Deep Reinforcement Learning
v1v2 (latest)

Asynchronous Methods for Deep Reinforcement Learning

4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
ArXiv (abs)PDFHTML

Papers citing "Asynchronous Methods for Deep Reinforcement Learning"

50 / 3,591 papers shown
Title
Soft policy optimization using dual-track advantage estimator
Soft policy optimization using dual-track advantage estimator
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
31
3
0
15 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSLDRL
403
346
0
14 Sep 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution
  using Reinforcement Learning
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
102
24
0
14 Sep 2020
Multi-Agent Reinforcement Learning in Cournot Games
Multi-Agent Reinforcement Learning in Cournot Games
Yuanyuan Shi
Baosen Zhang
65
7
0
14 Sep 2020
Efficient Competitive Self-Play Policy Optimization
Efficient Competitive Self-Play Policy Optimization
Yuanyi Zhong
Yuanshuo Zhou
Jian Peng
20
2
0
13 Sep 2020
Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman
Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman
Takuma Yoneda
Matthew R. Walter
Jason Naradowsky
LLMAG
18
1
0
13 Sep 2020
Guided Policy Search Based Control of a High Dimensional Advanced
  Manufacturing Process
Guided Policy Search Based Control of a High Dimensional Advanced Manufacturing Process
A. Surana
Kishore K. Reddy
M. Siopis
AI4CE
27
2
0
12 Sep 2020
Phasic Policy Gradient
Phasic Policy Gradient
K. Cobbe
Jacob Hilton
Oleg Klimov
John Schulman
OffRL
100
160
0
09 Sep 2020
Graph neural networks-based Scheduler for Production planning problems
  using Reinforcement Learning
Graph neural networks-based Scheduler for Production planning problems using Reinforcement Learning
Mohammed Sharafath Abdul Hameed
Andreas Schwung
28
25
0
08 Sep 2020
Evolutionary Reinforcement Learning via Cooperative Coevolutionary
  Negatively Correlated Search
Evolutionary Reinforcement Learning via Cooperative Coevolutionary Negatively Correlated Search
Hu Zhang
Peng Yang
Yang Yu
Mingjiang Li
K. Tang
126
21
0
08 Sep 2020
Detecting and adapting to crisis pattern with context based Deep
  Reinforcement Learning
Detecting and adapting to crisis pattern with context based Deep Reinforcement Learning
Eric Benhamou
David Saltiel
Jean-Jacques Ohana
Jamal Atif
67
19
0
07 Sep 2020
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators
  using Reinforcement Learning
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
Sheng-Chun Kao
Geonhwa Jeong
T. Krishna
116
96
0
04 Sep 2020
Sparse Meta Networks for Sequential Adaptation and its Application to
  Adaptive Language Modelling
Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling
Tsendsuren Munkhdalai
CLLOffRL
54
4
0
03 Sep 2020
Action and Perception as Divergence Minimization
Action and Perception as Divergence Minimization
Danijar Hafner
Pedro A. Ortega
Jimmy Ba
Thomas Parr
Karl J. Friston
N. Heess
91
53
0
03 Sep 2020
Grounded Language Learning Fast and Slow
Grounded Language Learning Fast and Slow
Felix Hill
O. Tieleman
Tamara von Glehn
Nathaniel Wong
Hamza Merzic
S. Clark
LM&Ro
172
81
0
03 Sep 2020
TAP-Net: Transport-and-Pack using Reinforcement Learning
TAP-Net: Transport-and-Pack using Reinforcement Learning
Huang Ruizhen
XU Juzhan
Bin Chen
Minglun Gong
Hao Zhang
Hui Huang
71
26
0
03 Sep 2020
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown
  Dynamics
Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics
Yanchao Sun
Da Huo
Furong Huang
AAMLOffRLOnRL
114
52
0
02 Sep 2020
Latency and Throughput Optimization in Modern Networks: A Comprehensive
  Survey
Latency and Throughput Optimization in Modern Networks: A Comprehensive Survey
A. Mirzaeinia
Mehdi Mirzaeinia
A. Rezgui
53
6
0
01 Sep 2020
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments
  using A3C learning and Residual Recurrent Neural Networks
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
Shreshth Tuli
Shashikant Ilager
K. Ramamohanarao
Rajkumar Buyya
69
179
0
01 Sep 2020
PlotThread: Creating Expressive Storyline Visualizations using
  Reinforcement Learning
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning
Tan Tang
Renzhong Li
Xinke Wu
Shuhan Liu
Johannes Knittel
Steffen Koch
Thomas Ertl
Lingyun Yu
Peiran Ren
Yingcai Wu
95
54
0
01 Sep 2020
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant
  Movement Primitives
Deep Reinforcement Learning for Contact-Rich Skills Using Compliant Movement Primitives
Oren Spector
M. Zacksenhouse
64
12
0
30 Aug 2020
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning
  Systems
Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems
Vinicius G. Goecks
117
11
0
30 Aug 2020
Reinforcement Learning with Feedback-modulated TD-STDP
Reinforcement Learning with Feedback-modulated TD-STDP
Stephen Chung
R. Kozma
33
3
0
29 Aug 2020
Real-world Video Adaptation with Reinforcement Learning
Real-world Video Adaptation with Reinforcement Learning
Hongzi Mao
Shannon Chen
Drew Dimmery
Shaun Singh
Drew Blaisdell
Yuandong Tian
Mohammad Alizadeh
E. Bakshy
OffRL
126
77
0
28 Aug 2020
AllenAct: A Framework for Embodied AI Research
AllenAct: A Framework for Embodied AI Research
Luca Weihs
Jordi Salvador
Klemen Kotar
Unnat Jain
Kuo-Hao Zeng
Roozbeh Mottaghi
Aniruddha Kembhavi
LM&RoAI4CE
80
75
0
28 Aug 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity
  Edge Devices
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge Devices
Parth Mannan
A. Samajdar
T. Krishna
58
2
0
27 Aug 2020
Constrained Markov Decision Processes via Backward Value Functions
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
107
52
0
26 Aug 2020
Selective Particle Attention: Visual Feature-Based Attention in Deep
  Reinforcement Learning
Selective Particle Attention: Visual Feature-Based Attention in Deep Reinforcement Learning
Sam Blakeman
D. Mareschal
68
1
0
26 Aug 2020
Model-Free Episodic Control with State Aggregation
Model-Free Episodic Control with State Aggregation
R. Pinto
OffRL
32
3
0
21 Aug 2020
NANCY: Neural Adaptive Network Coding methodologY for video distribution
  over wireless networks
NANCY: Neural Adaptive Network Coding methodologY for video distribution over wireless networks
Paresh Saxena
Mandan Naresh
Manik Gupta
Anirudh Achanta
S. Kota
Smrati Gupta
8
8
0
21 Aug 2020
Exploiting Scene-specific Features for Object Goal Navigation
Exploiting Scene-specific Features for Object Goal Navigation
Tommaso Campari
Paolo Eccher
Luciano Serafini
Lamberto Ballan
95
29
0
21 Aug 2020
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning
  Workloads
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads
Deepak Narayanan
Keshav Santhanam
Fiodar Kazhamiaka
Amar Phanishayee
Matei A. Zaharia
83
216
0
20 Aug 2020
Reinforcement Learning for Low-Thrust Trajectory Design of
  Interplanetary Missions
Reinforcement Learning for Low-Thrust Trajectory Design of Interplanetary Missions
Alessandro Zavoli
Lorenzo Federici
35
7
0
19 Aug 2020
Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep
  Reinforcement Learning
Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement Learning
Wenshuai Zhao
Jorge Peña Queralta
Qingqing Li
Tomi Westerlund
64
28
0
18 Aug 2020
Ubiquitous Distributed Deep Reinforcement Learning at the Edge:
  Analyzing Byzantine Agents in Discrete Action Spaces
Ubiquitous Distributed Deep Reinforcement Learning at the Edge: Analyzing Byzantine Agents in Discrete Action Spaces
Wenshuai Zhao
Jorge Peña Queralta
Qingqing Li
Tomi Westerlund
80
6
0
18 Aug 2020
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning
  with Average and Discounted Rewards
Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards
Umer Siddique
Paul Weng
Matthieu Zimmer
FaMLOffRL
65
88
0
18 Aug 2020
Learning Complex Multi-Agent Policies in Presence of an Adversary
Learning Complex Multi-Agent Policies in Presence of an Adversary
Siddharth Ghiya
Katia Sycara
25
3
0
18 Aug 2020
A Survey of Deep Learning for Data Caching in Edge Network
A Survey of Deep Learning for Data Caching in Edge Network
Yantong Wang
V. Friderikos
90
28
0
17 Aug 2020
Generative Design by Reinforcement Learning: Enhancing the Diversity of
  Topology Optimization Designs
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs
Seowoo Jang
Soyoung Yoo
Namwoo Kang
AI4CE
124
74
0
17 Aug 2020
Playing Catan with Cross-dimensional Neural Network
Playing Catan with Cross-dimensional Neural Network
Quentin Gendre
Tomoyuki Kaneko
BDL
34
4
0
17 Aug 2020
Reducing Sampling Error in Batch Temporal Difference Learning
Reducing Sampling Error in Batch Temporal Difference Learning
Brahma S. Pavse
Ishan Durugkar
Josiah P. Hanna
Peter Stone
OffRL
71
12
0
15 Aug 2020
Explainability in Deep Reinforcement Learning
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
255
284
0
15 Aug 2020
Joint Policy Search for Multi-agent Collaboration with Imperfect
  Information
Joint Policy Search for Multi-agent Collaboration with Imperfect Information
Yuandong Tian
Qucheng Gong
Tina Jiang
103
19
0
14 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a
  Survey
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDLOffRL
127
17
0
11 Aug 2020
Woodpecker-DL: Accelerating Deep Neural Networks via Hardware-Aware
  Multifaceted Optimizations
Woodpecker-DL: Accelerating Deep Neural Networks via Hardware-Aware Multifaceted Optimizations
Yongchao Liu
Yue Jin
Yongqi Chen
Teng Teng
Hang Ou
Rui Zhao
Yao Zhang
110
1
0
11 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
82
72
0
08 Aug 2020
Convex Q-Learning, Part 1: Deterministic Optimal Control
Convex Q-Learning, Part 1: Deterministic Optimal Control
P. Mehta
Sean P. Meyn
36
4
0
08 Aug 2020
A Machine of Few Words -- Interactive Speaker Recognition with
  Reinforcement Learning
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
49
5
0
07 Aug 2020
Follow the Object: Curriculum Learning for Manipulation Tasks with
  Imagined Goals
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
Ozsel Kilinc
Giovanni Montana
66
5
0
05 Aug 2020
Robust Deep Reinforcement Learning through Adversarial Loss
Robust Deep Reinforcement Learning through Adversarial Loss
Tuomas P. Oikarinen
Wang Zhang
Alexandre Megretski
Luca Daniel
Tsui-Wei Weng
AAML
90
97
0
05 Aug 2020
Previous
123...404142...707172
Next