Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.08946
Cited By
Action Branching Architectures for Deep Reinforcement Learning
24 November 2017
Arash Tavakoli
Fabio Pardo
Petar Kormushev
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Action Branching Architectures for Deep Reinforcement Learning"
31 / 31 papers shown
Title
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNN
AI4CE
41
0
0
05 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
45
0
0
30 Apr 2025
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids
Caio Fabio Oliveira da Silva
Azita Dabiri
B. de Schutter
50
4
0
17 Sep 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
92
3
0
20 Aug 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
19
0
0
13 Feb 2024
Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in Autonomous Edge Systems
Cheng Zhang
Yinuo Deng
Hailiang Zhao
Tianlv Chen
Shuiguang Deng
23
1
0
28 Dec 2023
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
Terence Jie Chua
Wen-li Yu
Junfeng Zhao
Kwok-Yan Lam
FedML
29
5
0
26 Oct 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
25
5
0
03 Aug 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
53
39
0
02 May 2023
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments
Drew Penney
Bin Li
Lizhong Chen
J. Sydir
Anna Drewek-Ossowicka
R. Illikkal
Charlie Tai
R. Iyer
Andrew J. Herdrich
31
1
0
10 Apr 2023
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
Allen Z. Ren
Hongkai Dai
Benjamin Burchfiel
Anirudha Majumdar
27
14
0
09 Feb 2023
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
35
22
0
22 Oct 2022
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
25
87
0
08 May 2022
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
26
44
0
03 Nov 2021
Learning Large Neighborhood Search Policy for Integer Programming
Yaoxin Wu
Wen Song
Zhiguang Cao
Jie Zhang
27
40
0
01 Nov 2021
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
28
12
0
17 Sep 2021
Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran
Ashlesha Akella
Maxwell Standen
Junae Kim
David Bowman
Toby J. Richer
Chin-Teng Lin Institution One
46
38
0
14 Sep 2021
Learning Practically Feasible Policies for Online 3D Bin Packing
Hang Zhao
Chenyang Zhu
Xin Xu
Hui Huang
Kai Xu
OffRL
18
80
0
31 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
NavTuner: Learning a Scene-Sensitive Family of Navigation Policies
Haoxin Ma
Justin S. Smith
Patricio A. Vela
23
4
0
02 Mar 2021
Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks
Pierre Aumjaud
David McAuliffe
Francisco J. Rodríguez-Lera
P. Cardiff
19
15
0
11 Nov 2020
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling
Tegg Taekyong Sung
J. Ha
Jeewoo Kim
Alex Yahja
Chae-Bong Sohn
Bo Ryu
21
9
0
15 May 2020
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
26
60
0
22 Jan 2020
Discrete and Continuous Action Representation for Practical RL in Video Games
Olivier Delalleau
Maxim Peter
Eloi Alonso
Adrien Logut
17
52
0
23 Dec 2019
MACS: Deep Reinforcement Learning based SDN Controller Synchronization Policy Design
Ziyao Zhang
Liang Ma
Konstantinos Poularakis
K. Leung
J. Tucker
A. Swami
11
14
0
19 Sep 2019
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
38
0
28 Jun 2019
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
118
0
29 Jan 2019
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
Fabio Pardo
Vitaly Levdik
Petar Kormushev
25
4
0
06 Oct 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
13
2
0
26 Jul 2018
1