Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01783
Cited By
Asynchronous Methods for Deep Reinforcement Learning
4 February 2016
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Asynchronous Methods for Deep Reinforcement Learning"
50 / 1,546 papers shown
Title
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
122
7
0
27 Oct 2023
Learning to bag with a simulation-free reinforcement learning framework for robots
Francisco Munguia-Galeano
Jihong Zhu
Juan David Hernández
Ze Ji
30
0
0
22 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
38
1
0
16 Oct 2023
Towards Semantic Communication Protocols for 6G: From Protocol Learning to Language-Oriented Approaches
Jihong Park
Seung-Woo Ko
Jinho Choi
Seong-Lyun Kim
M. Bennis
39
7
0
14 Oct 2023
Offline Reinforcement Learning for Optimizing Production Bidding Policies
D. Korenkevych
Frank Cheng
Artsiom Balakir
Alex Nikulkov
Lingnan Gao
Zhihao Cen
Zuobing Xu
Zheqing Zhu
OffRL
31
1
0
13 Oct 2023
Evading Community Detection via Counterfactual Neighborhood Search
Andrea Bernini
Fabrizio Silvestri
Gabriele Tolomei
BDL
41
1
0
13 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Deep reinforcement learning for machine scheduling: Methodology, the state-of-the-art, and future directions
Maziyar Khadivi
Todd Charter
Marjan Yaghoubi
Masoud Jalayer
Maryam Ahang
Ardeshir Shojaeinasab
Homayoun Najjaran
35
11
0
04 Oct 2023
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation
Benjamin Steenhoek
Michele Tufano
Neel Sundaresan
Alexey Svyatkovskiy
OffRL
ALM
55
18
0
03 Oct 2023
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment
Tianhao Wu
Banghua Zhu
Ruoyu Zhang
Zhaojin Wen
Kannan Ramchandran
Jiantao Jiao
44
55
0
30 Sep 2023
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints
Chaoqi Wang
Yibo Jiang
Yuguang Yang
Han Liu
Yuxin Chen
42
82
0
28 Sep 2023
Gray-box Adversarial Attack of Deep Reinforcement Learning-based Trading Agents
Foozhan Ataiefard
Hadi Hemmati
AAML
29
2
0
26 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
37
9
0
25 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
Trip Planning for Autonomous Vehicles with Wireless Data Transfer Needs Using Reinforcement Learning
Yousef AlSaqabi
Bhaskar Krishnamachari
28
2
0
21 Sep 2023
Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces
Andrea Angiuli
J. Fouque
Ruimeng Hu
Alan Raydan
37
5
0
19 Sep 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
34
9
0
05 Sep 2023
Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection
Yazhou Xing
Amrita Mazumdar
Anjul Patney
Chao Liu
Hongxu Yin
Qifeng Chen
Jan Kautz
I. Frosio
52
1
0
29 Aug 2023
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences
Zhishen Huang
31
1
0
28 Aug 2023
Target-independent XLA optimization using Reinforcement Learning
Milan Ganai
Haichen Li
Theodore Enns
Yida Wang
Randy Huang
44
0
0
28 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
53
10
0
28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning
Jacob Wiebe
Ranwa Al Mallah
Li Li
AAML
36
3
0
25 Aug 2023
A Deep Reinforcement Learning based Algorithm for Time and Cost Optimized Scaling of Serverless Applications
Anupama Mampage
S. Karunasekera
Rajkumar Buyya
44
3
0
22 Aug 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
38
0
0
13 Aug 2023
Bag of Policies for Distributional Deep Exploration
Asen Nachkov
Luchen Li
Giulia Luise
Filippo Valdettaro
Aldo A. Faisal
OffRL
43
0
0
03 Aug 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
27
3
0
29 Jul 2023
Dialogue Shaping: Empowering Agents through NPC Interaction
Wei Zhou
Xiangyu Peng
Mark O. Riedl
LLMAG
38
8
0
28 Jul 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
26
1
0
22 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
45
5
0
20 Jul 2023
Robust Driving Policy Learning with Guided Meta Reinforcement Learning
Kanghoon Lee
Jiachen Li
David Isele
Jinkyoo Park
K. Fujimura
Mykel J. Kochenderfer
31
5
0
19 Jul 2023
Data Cross-Segmentation for Improved Generalization in Reinforcement Learning Based Algorithmic Trading
Vikram Duvvur
Aashay Mehta
Edward W. Sun
Bo Wu
Ken Yew Chan
J. Schneider
AIFin
32
0
0
18 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
40
4
0
16 Jul 2023
A Survey From Distributed Machine Learning to Distributed Deep Learning
Mohammad Dehghani
Zahra Yazdanparast
26
0
0
11 Jul 2023
Secrets of RLHF in Large Language Models Part I: PPO
Rui Zheng
Shihan Dou
Songyang Gao
Yuan Hua
Wei Shen
...
Hang Yan
Tao Gui
Qi Zhang
Xipeng Qiu
Xuanjing Huang
ALM
OffRL
55
160
0
11 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
Decentralized Multi-Agent Reinforcement Learning with Global State Prediction
Josh Bloom
Pranjal Paliwal
Apratim Mukherjee
Carlo Pinciroli
30
3
0
22 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
35
1
0
21 Jun 2023
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
36
4
0
20 Jun 2023
NASimEmu: Network Attack Simulator & Emulator for Training Agents Generalizing to Novel Scenarios
Jaromír Janisch
Tomávs Pevný
Viliam Lisý
23
13
0
26 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
36
19
0
26 May 2023
Lucy-SKG: Learning to Play Rocket League Efficiently Using Deep Reinforcement Learning
V. Moschopoulos
Pantelis Kyriakidis
A. Lazaridis
I. Vlahavas
23
0
0
25 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Road Planning for Slums via Deep Reinforcement Learning
Y. Zheng
Hongyuan Su
Jingtao Ding
Depeng Jin
Yong Li
19
13
0
22 May 2023
Continually Improving Extractive QA via Human Feedback
Ge Gao
Hung-Ting Chen
Yoav Artzi
Eunsol Choi
31
12
0
21 May 2023
Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction
Shulu Chen
A. Evans
Marc Brittain
Peng Wei
30
15
0
17 May 2023
Graph Reinforcement Learning for Network Control via Bi-Level Optimization
Daniele Gammelli
James Harrison
Kaidi Yang
Marco Pavone
Filipe Rodrigues
Francisco Câmara Pereira
AI4CE
46
6
0
16 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
39
3
0
11 May 2023
Optimizing Memory Mapping Using Deep Reinforcement Learning
Pengming Wang
Mikita Sazanovich
Berkin Ilbeyi
P. Phothilimthana
Manish Purohit
...
R. Tung
Paula Kurylowicz
Kieran Milan
Oriol Vinyals
D. Mankowitz
22
4
0
11 May 2023
Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas
H. Khadilkar
32
0
0
10 May 2023
Previous
1
2
3
...
5
6
7
...
29
30
31
Next