ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.08946
  4. Cited By
Action Branching Architectures for Deep Reinforcement Learning

Action Branching Architectures for Deep Reinforcement Learning

24 November 2017
Arash Tavakoli
Fabio Pardo
Petar Kormushev
ArXivPDFHTML

Papers citing "Action Branching Architectures for Deep Reinforcement Learning"

31 / 31 papers shown
Title
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework
Andrzej Mizera
Jakub Zarzycki
GNN
AI4CE
41
0
0
05 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
45
0
0
30 Apr 2025
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids
Caio Fabio Oliveira da Silva
Azita Dabiri
B. de Schutter
50
4
0
17 Sep 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
92
3
0
20 Aug 2024
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Andrzej Mizera
Jakub Zarzycki
19
0
0
13 Feb 2024
Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in
  Autonomous Edge Systems
Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in Autonomous Edge Systems
Cheng Zhang
Yinuo Deng
Hailiang Zhao
Tianlv Chen
Shuiguang Deng
23
1
0
28 Dec 2023
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine
  Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation
  Models with Mobile Edge Computing
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
Terence Jie Chua
Wen-li Yu
Junfeng Zhao
Kwok-Yan Lam
FedML
29
5
0
26 Oct 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning
  Agents
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
25
5
0
03 Aug 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement
  Learning in Healthcare
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
53
39
0
02 May 2023
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud
  Environments
RAPID: Enabling Fast Online Policy Learning in Dynamic Public Cloud Environments
Drew Penney
Bin Li
Lizhong Chen
J. Sydir
Anna Drewek-Ossowicka
R. Illikkal
Charlie Tai
R. Iyer
Andrew J. Herdrich
31
1
0
10 Apr 2023
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer
Allen Z. Ren
Hongkai Dai
Benjamin Burchfiel
Anirudha Majumdar
27
14
0
09 Feb 2023
Solving Continuous Control via Q-learning
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
35
22
0
22 Oct 2022
MAN: Multi-Action Networks Learning
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
21
3
0
19 Sep 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by
  Reconfigurable Intelligent Surfaces
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
25
87
0
08 May 2022
Is Bang-Bang Control All You Need? Solving Continuous Control with
  Bernoulli Policies
Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies
Tim Seyde
Igor Gilitschenski
Wilko Schwarting
Bartolomeo Stellato
Martin Riedmiller
Markus Wulfmeier
Daniela Rus
26
44
0
03 Nov 2021
Learning Large Neighborhood Search Policy for Integer Programming
Learning Large Neighborhood Search Policy for Integer Programming
Yaoxin Wu
Wen Song
Zhiguang Cao
Jie Zhang
27
40
0
01 Nov 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
Soft Actor-Critic With Integer Actions
Soft Actor-Critic With Integer Actions
Ting-Han Fan
Yubo Wang
28
12
0
17 Sep 2021
Deep hierarchical reinforcement agents for automated penetration testing
Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran
Ashlesha Akella
Maxwell Standen
Junae Kim
David Bowman
Toby J. Richer
Chin-Teng Lin Institution One
46
38
0
14 Sep 2021
Learning Practically Feasible Policies for Online 3D Bin Packing
Learning Practically Feasible Policies for Online 3D Bin Packing
Hang Zhao
Chenyang Zhu
Xin Xu
Hui Huang
Kai Xu
OffRL
18
80
0
31 Aug 2021
Implicitly Regularized RL with Implicit Q-Values
Implicitly Regularized RL with Implicit Q-Values
Nino Vieillard
Marcin Andrychowicz
Anton Raichuk
Olivier Pietquin
M. Geist
OffRL
24
9
0
16 Aug 2021
NavTuner: Learning a Scene-Sensitive Family of Navigation Policies
NavTuner: Learning a Scene-Sensitive Family of Navigation Policies
Haoxin Ma
Justin S. Smith
Patricio A. Vela
23
4
0
02 Mar 2021
Reinforcement Learning Experiments and Benchmark for Solving Robotic
  Reaching Tasks
Reinforcement Learning Experiments and Benchmark for Solving Robotic Reaching Tasks
Pierre Aumjaud
David McAuliffe
Francisco J. Rodríguez-Lera
P. Cardiff
19
15
0
11 Nov 2020
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC)
  Resource Scheduling
DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling
Tegg Taekyong Sung
J. Ha
Jeewoo Kim
Alex Yahja
Chae-Bong Sohn
Bo Ryu
21
9
0
15 May 2020
Q-Learning in enormous action spaces via amortized approximate
  maximization
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
26
60
0
22 Jan 2020
Discrete and Continuous Action Representation for Practical RL in Video
  Games
Discrete and Continuous Action Representation for Practical RL in Video Games
Olivier Delalleau
Maxim Peter
Eloi Alonso
Adrien Logut
17
52
0
23 Dec 2019
MACS: Deep Reinforcement Learning based SDN Controller Synchronization
  Policy Design
MACS: Deep Reinforcement Learning based SDN Controller Synchronization Policy Design
Ziyao Zhang
Liang Ma
Konstantinos Poularakis
K. Leung
J. Tucker
A. Swami
11
14
0
19 Sep 2019
Growing Action Spaces
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
38
0
28 Jun 2019
Discretizing Continuous Action Space for On-Policy Optimization
Discretizing Continuous Action Space for On-Policy Optimization
Yunhao Tang
Shipra Agrawal
OffRL
26
118
0
29 Jan 2019
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional
  Neural Networks
Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
Fabio Pardo
Vitaly Levdik
Petar Kormushev
25
4
0
06 Oct 2018
ToriLLE: Learning Environment for Hand-to-Hand Combat
ToriLLE: Learning Environment for Hand-to-Hand Combat
Anssi Kanervisto
Ville Hautamaki
13
2
0
26 Jul 2018
1