Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
v1
v2 (latest)
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 3,591 papers shown
Title
Embedding in Recommender Systems: A Survey
Xiangyu Zhao
Maolin Wang
Xinjian Zhao
Jiansheng Li
Shucheng Zhou
D. Yin
Qing Li
Jiliang Tang
Ruocheng Guo
AI4TS
90
12
0
28 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
113
4
0
27 Oct 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
168
8
0
27 Oct 2023
Social Contract AI: Aligning AI Assistants with Implicit Group Norms
Jan-Philipp Fränken
Sam Kwok
Peixuan Ye
Kanishk Gandhi
Dilip Arumugam
Jared Moore
Alex Tamkin
Tobias Gerstenberg
Noah D. Goodman
70
9
0
26 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
54
1
0
26 Oct 2023
Combining Behaviors with the Successor Features Keyboard
Wilka Carvalho
Andre Saraiva
Angelos Filos
Andrew Kyle Lampinen
Loic Matthey
Richard L. Lewis
Honglak Lee
Satinder Singh
Danilo Jimenez Rezende
Daniel Zoran
84
4
0
24 Oct 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
74
3
0
23 Oct 2023
Policy Gradient with Kernel Quadrature
Satoshi Hayakawa
Tetsuro Morimura
OffRL
BDL
101
1
0
23 Oct 2023
Learning to bag with a simulation-free reinforcement learning framework for robots
Francisco Munguia-Galeano
Jihong Zhu
Juan David Hernández
Ze Ji
54
0
0
22 Oct 2023
Reward Shaping for Happier Autonomous Cyber Security Agents
Elizabeth Bates
V. Mavroudis
Chris Hicks
77
15
0
20 Oct 2023
Absolute Policy Optimization
Weiye Zhao
Feihan Li
Yifan Sun
Rui Chen
Tianhao Wei
Changliu Liu
134
4
0
20 Oct 2023
Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions
Pedram Agand
Alexey Iskrov
Mo Chen
58
5
0
19 Oct 2023
Learning to Optimise Climate Sensor Placement using a Transformer
Chen Wang
Victoria Huang
Gang Chen
Hui Ma
Bryce Chen
Jochen Schmidt
61
0
0
18 Oct 2023
Fact-based Agent modeling for Multi-Agent Reinforcement Learning
Baofu Fang
Caiming Zheng
Hao Wang
OffRL
82
0
0
18 Oct 2023
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning
Rui Zheng
Wei Shen
Yuan Hua
Wenbin Lai
Shihan Dou
...
Xiao Wang
Haoran Huang
Tao Gui
Qi Zhang
Xuanjing Huang
109
17
0
18 Oct 2023
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
Chao Li
Chen Gong
Qiang He
Xinwen Hou
71
1
0
17 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
57
1
0
16 Oct 2023
Mimicking the Maestro: Exploring the Efficacy of a Virtual AI Teacher in Fine Motor Skill Acquisition
Hadar Mulian
Segev Shlomov
Lior Limonad
Alessia Noccaro
Silvia Buscaglione
25
4
0
16 Oct 2023
Leveraging Topological Maps in Deep Reinforcement Learning for Multi-Object Navigation
Simon Hakenes
Tobias Glasmachers
53
1
0
16 Oct 2023
Deep Reinforcement Learning with Explicit Context Representation
Francisco Munguia-Galeano
Ah-Hwee Tan
Ze Ji
OffRL
80
2
0
15 Oct 2023
Solving Max-Min Fair Resource Allocations Quickly on Large Graphs
Pooria Namyar
Behnaz Arzani
Srikanth Kandula
Santiago Segarra
Daniel Crankshaw
Umesh Krishnaswamy
Ramesh Govindan
Himanshu Raj
59
9
0
15 Oct 2023
Towards Semantic Communication Protocols for 6G: From Protocol Learning to Language-Oriented Approaches
Jihong Park
Seung-Woo Ko
Jinho Choi
Seong-Lyun Kim
M. Bennis
82
8
0
14 Oct 2023
Offline Reinforcement Learning for Optimizing Production Bidding Policies
D. Korenkevych
Frank Cheng
Artsiom Balakir
Alex Nikulkov
Lingnan Gao
Zhihao Cen
Zuobing Xu
Zheqing Zhu
OffRL
71
1
0
13 Oct 2023
Evading Community Detection via Counterfactual Neighborhood Search
Andrea Bernini
Fabrizio Silvestri
Gabriele Tolomei
BDL
80
1
0
13 Oct 2023
Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey
Gregory Palmer
Chris Parry
Daniel J.B. Harrold
Chris Willis
AI4CE
90
1
0
11 Oct 2023
Information Content Exploration
Jacob Chmura
Hasham Burhani
Xiao Qi Shi
129
0
0
10 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
212
150
0
10 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
57
0
0
09 Oct 2023
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network
Zhenyu Tao
Weihong Xu
Xiaohu You
OffRL
76
4
0
07 Oct 2023
Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning
Yinda Chen
Wei-Ping Huang
Shenglong Zhou
Qi Chen
Zhiwei Xiong
73
26
0
06 Oct 2023
A Survey of Multi-Robot Motion Planning
Hoang-Dung Bui
27
1
0
05 Oct 2023
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
Samuel Garcin
James Doran
Shangmin Guo
Christopher G. Lucas
Stefano V. Albrecht
80
0
0
05 Oct 2023
A Review of Deep Reinforcement Learning in Serverless Computing: Function Scheduling and Resource Auto-Scaling
Amjad Yousef Majid
Eduard Marin
OffRL
53
3
0
05 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
73
12
0
04 Oct 2023
ProGO: Probabilistic Global Optimizer
Xinyu Zhang
Sujit Ghosh
56
1
0
04 Oct 2023
Learning to Scale Logits for Temperature-Conditional GFlowNets
Minsu Kim
Joohwan Ko
Taeyoung Yun
Dinghuai Zhang
Ling Pan
W. Kim
Jinkyoo Park
Emmanuel Bengio
Yoshua Bengio
AI4CE
117
24
0
04 Oct 2023
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Matthew Jackson
Minqi Jiang
Jack Parker-Holder
Risto Vuorio
Chris Xiaoxuan Lu
Gregory Farquhar
Shimon Whiteson
Jakob N. Foerster
OOD
64
9
0
04 Oct 2023
Local Search GFlowNets
Minsu Kim
Taeyoung Yun
Emmanuel Bengio
Dinghuai Zhang
Yoshua Bengio
SungSoo Ahn
Jinkyoo Park
112
39
0
04 Oct 2023
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation
Benjamin Steenhoek
Michele Tufano
Neel Sundaresan
Alexey Svyatkovskiy
OffRL
ALM
153
22
0
03 Oct 2023
Solving the Quadratic Assignment Problem using Deep Reinforcement Learning
P. Bagga
Arthur Delarue
44
1
0
02 Oct 2023
Modularity in Deep Learning: A Survey
Haozhe Sun
Isabelle Guyon
MoMe
108
3
0
02 Oct 2023
Adapting LLM Agents with Universal Feedback in Communication
Kuan-Chieh Wang
Yadong Lu
Michael Santacroce
Yeyun Gong
Chao Zhang
Yelong Shen
LLMAG
84
9
0
01 Oct 2023
Order-Preserving GFlowNets
Yihang Chen
Lukas Mauch
128
12
0
30 Sep 2023
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment
Tianhao Wu
Banghua Zhu
Ruoyu Zhang
Zhaojin Wen
Kannan Ramchandran
Jiantao Jiao
108
61
0
30 Sep 2023
Reinforcement Learning for Node Selection in Branch-and-Bound
Alexander Mattick
Christopher Mutschler
53
2
0
29 Sep 2023
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Shengyi Huang
Jiayi Weng
Rujikorn Charakorn
Min Lin
Zhongwen Xu
Santiago Ontañón
92
3
0
29 Sep 2023
Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents
Marco Pleines
Matthias Pallasch
Frank Zimmer
Mike Preuss
OffRL
60
1
0
29 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
70
1
0
28 Sep 2023
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints
Chaoqi Wang
Yibo Jiang
Yuguang Yang
Han Liu
Yuxin Chen
90
108
0
28 Sep 2023
Learning to Terminate in Object Navigation
Yuhang Song
Anh Nguyen
Chun-Yi Lee
62
3
0
28 Sep 2023
Previous
1
2
3
...
11
12
13
...
70
71
72
Next