ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14171
  4. Cited By
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

25 June 2020
Shengyi Huang
Santiago Ontañón
ArXivPDFHTML

Papers citing "A Closer Look at Invalid Action Masking in Policy Gradient Algorithms"

50 / 82 papers shown
Title
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
A Goal-Oriented Reinforcement Learning-Based Path Planning Algorithm for Modular Self-Reconfigurable Satellites
Bofei Liu
Dong Ye
Zunhao Yao
Zhaowei Sun
33
0
0
04 May 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
35
0
0
16 Apr 2025
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Mirko Stappert
Bernhard Lutz
Niklas Goby
Dirk Neumann
OffRL
31
0
0
03 Apr 2025
Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control
Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control
Eloy Anguiano Batanero
Ángela Fernández
Álvaro Barbero
72
0
0
26 Mar 2025
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
Optimizing Navigation And Chemical Application in Precision Agriculture With Deep Reinforcement Learning And Conditional Action Tree
Mahsa Khosravi
Zhanhong Jiang
Joshua R. Waite
Sarah Jonesc
Hernan Torres
Arti Singh
Baskar Ganapathysubramanian
Asheesh Kumar Singh
S. Sarkar
41
0
0
23 Mar 2025
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
Minori Narita
Ryo Kuroiwa
J. Christopher Beck
49
0
0
20 Mar 2025
Embodied Escaping: End-to-End Reinforcement Learning for Robot Navigation in Narrow Environment
Han Zheng
Jun Zhang
Mingyang Jiang
Peiyuan Liu
Danni Liu
Tong Qin
Ming Yang
166
0
0
05 Mar 2025
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
Fangqi Liu
Rishav Sen
J. P. Talusan
Ava Pettet
Aaron Kandel
Yoshinori Suzue
Ayan Mukhopadhyay
A. Dubey
OffRL
39
0
0
24 Feb 2025
Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning
Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning
Austin Yubo He
Zi-Wen Liu
97
3
0
21 Feb 2025
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?
Michael Doherty
Robin Matzner
Rasoul Sadeghi
Polina Bayvel
Alejandra Beghelli
65
0
0
18 Feb 2025
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Shilong Zhang
Wenbo Li
Shoufa Chen
Chongjian Ge
Peize Sun
Yunke Zhang
Yi-Xin Jiang
Zehuan Yuan
Binyue Peng
Ping Luo
DiffM
VGen
101
3
0
07 Feb 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
50
0
0
24 Jan 2025
Integrating Transit Signal Priority into Multi-Agent Reinforcement
  Learning based Traffic Signal Control
Integrating Transit Signal Priority into Multi-Agent Reinforcement Learning based Traffic Signal Control
Dickness Kwesiga
Suyash Chandra Vishnoi
Angshuman Guin
Michael Hunter
73
0
0
28 Nov 2024
Effective Analog ICs Floorplanning with Relational Graph Neural Networks
  and Reinforcement Learning
Effective Analog ICs Floorplanning with Relational Graph Neural Networks and Reinforcement Learning
Davide Basso
Luca Bortolussi
Mirjana Videnovic-Misic
Husni M. Habal
60
1
0
20 Nov 2024
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh
  Smoothing
GNNRL-Smoothing: A Prior-Free Reinforcement Learning Model for Mesh Smoothing
Zhichao Wang
Xinhai Chen
Chunye Gong
Bo Yang
Liang Deng
Yufei Sun
Yufei Pang
Jie Liu
AI4CE
32
0
0
19 Oct 2024
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Multi-Agent Actor-Critics in Autonomous Cyber Defense
Mingjun Wang
Remington Dechene
31
0
0
11 Oct 2024
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile Robots
Milad Farjadnasab
Shahin Sirouspour
38
0
0
08 Oct 2024
Climate Adaptation with Reinforcement Learning: Experiments with
  Flooding and Transportation in Copenhagen
Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen
Miguel Costa
Morten W. Petersen
Arthur Vandervoort
Martin Drews
Karyn Morrissey
Francisco C. Pereira
AI4CE
27
0
0
27 Sep 2024
Revisiting Space Mission Planning: A Reinforcement Learning-Guided
  Approach for Multi-Debris Rendezvous
Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous
Agni Bandyopadhyay
Guenther Waxenegger-Wilfing
26
0
0
25 Sep 2024
Applying Action Masking and Curriculum Learning Techniques to Improve
  Data Efficiency and Overall Performance in Operational Technology Cyber
  Security using Reinforcement Learning
Applying Action Masking and Curriculum Learning Techniques to Improve Data Efficiency and Overall Performance in Operational Technology Cyber Security using Reinforcement Learning
Alec Wilson
William Holmes
Ryan Menzies
Kez Smithson Whitehead
33
0
0
13 Sep 2024
Cooperative Path Planning with Asynchronous Multiagent Reinforcement
  Learning
Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning
Jiaming Yin
Weixiong Rao
Yu Xiao
Keshuang Tang
21
0
0
01 Sep 2024
DECAF: a Discrete-Event based Collaborative Human-Robot Framework for
  Furniture Assembly
DECAF: a Discrete-Event based Collaborative Human-Robot Framework for Furniture Assembly
Giulio Giacomuzzo
Matteo Terreran
Siddarth Jain
Diego Romeres
23
1
0
28 Aug 2024
Earth Observation Satellite Scheduling with Graph Neural Networks
Earth Observation Satellite Scheduling with Graph Neural Networks
Antoine Jacquet
Guillaume Infantes
Nicolas Meuleau
Emmanuel Benazera
Stéphanie Roussel
Vincent Baudoui
Jonathan Guerra
25
0
0
27 Aug 2024
Scenario-based Thermal Management Parametrization Through Deep
  Reinforcement Learning
Scenario-based Thermal Management Parametrization Through Deep Reinforcement Learning
Thomas Rudolf
Philip Muhl
Sören Hohmann
Lutz Eckstein
29
0
0
04 Aug 2024
Field Deployment of Multi-Agent Reinforcement Learning Based Variable
  Speed Limit Controllers
Field Deployment of Multi-Agent Reinforcement Learning Based Variable Speed Limit Controllers
Yuhang Zhang
Zhiyao Zhang
Marcos Quiñones-Grueiro
William Barbour
Clay Weston
Gautam Biswas
Daniel Work
27
4
0
10 Jul 2024
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha
  Factors
AlphaForge: A Framework to Mine and Dynamically Combine Formulaic Alpha Factors
Hao Shi
Weili Song
Xinting Zhang
Jiahe Shi
Cuicui Luo
Xiang Ao
Hamid Arian
Luis Seco
34
2
0
26 Jun 2024
Injecting Combinatorial Optimization into MCTS: Application to the Board
  Game boop
Injecting Combinatorial Optimization into MCTS: Application to the Board Game boop
Florian Richoux
29
2
0
13 Jun 2024
Excluding the Irrelevant: Focusing Reinforcement Learning through
  Continuous Action Masking
Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking
Roland Stolz
Hanna Krasowski
Jakob Thumm
Michael Eichelbeck
Philipp Gassert
Matthias Althoff
CLL
30
2
0
06 Jun 2024
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios
Mingyang Jiang
Yueyuan Li
Songan Zhang
Siyuan Chen
Chunxiang Wang
Ming Yang
51
4
0
31 May 2024
Safety through Permissibility: Shield Construction for Fast and Safe
  Reinforcement Learning
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning
A. Politowicz
Sahisnu Mazumder
Bing-Quan Liu
31
0
0
29 May 2024
Egret: Reinforcement Mechanism for Sequential Computation Offloading in
  Edge Computing
Egret: Reinforcement Mechanism for Sequential Computation Offloading in Edge Computing
Haosong Peng
Yufeng Zhan
Dihua Zhai
Xiaopu Zhang
Yuanqing Xia
33
1
0
14 Apr 2024
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning
Shang Wang
Deepak Ranganatha Sastry Mamillapalli
Tianpei Yang
Matthew E. Taylor
36
0
0
11 Apr 2024
Deep Reinforcement Learning-Based Approach for a Single Vehicle
  Persistent Surveillance Problem with Fuel Constraints
Deep Reinforcement Learning-Based Approach for a Single Vehicle Persistent Surveillance Problem with Fuel Constraints
Manav Mishra
Hritik Bana
Saswata Sarkar
Sujeevraja Sanjeevi
PB Sujit
K. Sundar
21
0
0
09 Apr 2024
Intervention-Assisted Policy Gradient Methods for Online Stochastic
  Queuing Network Optimization: Technical Report
Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report
Jerrod Wigmore
B. Shrader
E. Modiano
OffRL
32
1
0
05 Apr 2024
Solving a Real-World Optimization Problem Using Proximal Policy
  Optimization with Curriculum Learning and Reward Engineering
Solving a Real-World Optimization Problem Using Proximal Policy Optimization with Curriculum Learning and Reward Engineering
Abhijeet Pendyala
Asma Atamna
Tobias Glasmachers
OffRL
27
1
0
03 Apr 2024
Scaling Team Coordination on Graphs with Reinforcement Learning
Scaling Team Coordination on Graphs with Reinforcement Learning
Manshi Limbu
Zechen Hu
Xuan Wang
Daigo Shishika
Xuesu Xiao
28
4
0
09 Mar 2024
Learning to Solve Job Shop Scheduling under Uncertainty
Learning to Solve Job Shop Scheduling under Uncertainty
Guillaume Infantes
Stéphanie Roussel
Pierre Pereira
Antoine Jacquet
Emmanuel Benazera
35
3
0
04 Mar 2024
Circuit Partitioning for Multi-Core Quantum Architectures with Deep
  Reinforcement Learning
Circuit Partitioning for Multi-Core Quantum Architectures with Deep Reinforcement Learning
Arnau Pastor
Pau Escofet
Sahar Ben Rached
Eduard Alarcón
Pere Barlet-Ros
S. Abadal
GNN
39
5
0
31 Jan 2024
Introducing PetriRL: An Innovative Framework for JSSP Resolution
  Integrating Petri nets and Event-based Reinforcement Learning
Introducing PetriRL: An Innovative Framework for JSSP Resolution Integrating Petri nets and Event-based Reinforcement Learning
Sofiene Lassoued
Andreas Schwung
OffRL
18
5
0
23 Jan 2024
Generative Modelling of Stochastic Actions with Arbitrary Constraints in
  Reinforcement Learning
Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning
Changyu Chen
Ramesha Karunasena
Thanh Hong Nguyen
Arunesh Sinha
Pradeep Varakantham
23
9
0
26 Nov 2023
MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable
  Speed Limits
MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable Speed Limits
Yuhang Zhang
Marcos Quiñones-Grueiro
Zhiyao Zhang
Yanbing Wang
William Barbour
Gautam Biswas
Dan Work
38
5
0
18 Oct 2023
Learning to Recharge: UAV Coverage Path Planning through Deep
  Reinforcement Learning
Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning
Mirco Theile
Harald Bayerlein
Marco Caccamo
Alberto L. Sangiovanni-Vincentelli
29
5
0
06 Sep 2023
The Impact of Overall Optimization on Warehouse Automation
The Impact of Overall Optimization on Warehouse Automation
H. Yoshitake
Pieter Abbeel
OffRL
31
1
0
11 Aug 2023
Reinforcement Learning -based Adaptation and Scheduling Methods for
  Multi-source DASH
Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASH
Nghia T. Nguyen
Long Luu
Phuong Vo
Sang Nguyen
Cuong T. Do
Ngoc-Thanh Nguyen
AI4TS
23
1
0
25 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
50
2
0
21 Jul 2023
Learning Hierarchical Interactive Multi-Object Search for Mobile
  Manipulation
Learning Hierarchical Interactive Multi-Object Search for Mobile Manipulation
F. Schmalstieg
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
24
14
0
12 Jul 2023
A Framework for dynamically meeting performance objectives on a service
  mesh
A Framework for dynamically meeting performance objectives on a service mesh
Forough Shahab Samani
Rolf Stadler
25
3
0
25 Jun 2023
Generating Synergistic Formulaic Alpha Collections via Reinforcement
  Learning
Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning
Shuo Yu
Hongyan Xue
Xiang Ao
Feiyang Pan
Jia He
Dandan Tu
Qing He
AIFin
35
11
0
25 May 2023
MARC: A multi-agent robots control framework for enhancing reinforcement
  learning in construction tasks
MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks
Kangkang Duan
C. W. Suen
Zhengbo Zou
20
1
0
23 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
12
Next