ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.03950
  4. Cited By
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning

Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

6 October 2020
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
    OffRL
ArXivPDFHTML

Papers citing "Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning"

50 / 111 papers shown
Title
HypRL: Reinforcement Learning of Control Policies for Hyperproperties
HypRL: Reinforcement Learning of Control Policies for Hyperproperties
Tzu-Han Hsu
Arshia Rafieioskouei
Borzoo Bonakdarpour
36
0
0
07 Apr 2025
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
A Survey of Reinforcement Learning-Based Motion Planning for Autonomous Driving: Lessons Learned from a Driving Task Perspective
Zhuoren Li
Guizhe Jin
Ran Yu
Z. Chen
Nan I. Li
...
Lu Xiong
Bo Leng
Jia Hu
I. Kolmanovsky
Dimitar Filev
51
0
0
31 Mar 2025
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
Pierrick Lorang
Hong Lu
Matthias Scheutz
43
0
0
06 Mar 2025
Graph Learning for Numeric Planning
Graph Learning for Numeric Planning
Dillon Z. Chen
Sylvie Thiébaux
42
0
0
08 Jan 2025
Adaptive Reward Design for Reinforcement Learning in Complex Robotic
  Tasks
Adaptive Reward Design for Reinforcement Learning in Complex Robotic Tasks
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
67
2
0
14 Dec 2024
Accelerating Proximal Policy Optimization Learning Using Task Prediction
  for Solving Environments with Delayed Rewards
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
A. Ahmad
Mehdi Kermanshah
Kevin J. Leahy
Zachary Serlin
H. Siu
Makai Mann
C. Vasile
Roberto Tron
C. Belta
OffRL
66
0
0
26 Nov 2024
Model Checking for Reinforcement Learning in Autonomous Driving: One Can
  Do More Than You Think!
Model Checking for Reinforcement Learning in Autonomous Driving: One Can Do More Than You Think!
Rong Gu
68
0
0
21 Nov 2024
Pluralistic Alignment Over Time
Pluralistic Alignment Over Time
Toryn Q. Klassen
P. A. Alamdari
Sheila A. McIlraith
31
1
0
16 Nov 2024
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Sheryl Paul
Anand Balakrishnan
Xin Qin
Jyotirmoy V. Deshmukh
26
0
0
15 Nov 2024
Show, Don't Tell: Learning Reward Machines from Demonstrations for
  Reinforcement Learning-Based Cardiac Pacemaker Synthesis
Show, Don't Tell: Learning Reward Machines from Demonstrations for Reinforcement Learning-Based Cardiac Pacemaker Synthesis
John Komp
Dananjay Srinivas
Maria Leonor Pacheco
Ashutosh Trivedi
AI4TS
29
0
0
04 Nov 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
Beyazit Yalcinkaya
Niklas Lauffer
Marcell Vazquez-Chanlatte
S. Seshia
AI4CE
47
5
0
31 Oct 2024
Neural Model Checking
Neural Model Checking
Mirco Giacobbe
Daniel Kroening
Abhinandan Pal
Michael Tautschnig
NAI
24
1
0
31 Oct 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Robot Policy Learning with Temporal Optimal Transport Reward
Yuwei Fu
Haichao Zhang
Di Wu
Wei-ping Xu
Benoit Boulet
OffRL
37
1
0
29 Oct 2024
Sample-Efficient Reinforcement Learning with Temporal Logic Objectives:
  Leveraging the Task Specification to Guide Exploration
Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration
Y. Kantaros
Jun Wang
27
5
0
16 Oct 2024
Generalization of Compositional Tasks with Logical Specification via
  Implicit Planning
Generalization of Compositional Tasks with Logical Specification via Implicit Planning
Duo Xu
Faramarz Fekri
OffRL
23
1
0
13 Oct 2024
Deep Learning for Generalised Planning with Background Knowledge
Deep Learning for Generalised Planning with Background Knowledge
Dillon Z. Chen
Rostislav Horčík
Gustav Šír
35
1
0
10 Oct 2024
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
Mathias Jackermeier
Alessandro Abate
OffRL
40
1
0
06 Oct 2024
AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models
AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models
Yifei Yao
Wentao He
Chenyu Gu
Jiaheng Du
Fuwei Tan
Zhen Zhu
Junguo Lu
OffRL
31
2
0
13 Sep 2024
Learning Task Specifications from Demonstrations as Probabilistic
  Automata
Learning Task Specifications from Demonstrations as Probabilistic Automata
Mattijs Baert
Sam Leroux
Pieter Simoens
40
1
0
11 Sep 2024
Beyond Preferences in AI Alignment
Beyond Preferences in AI Alignment
Tan Zhi-Xuan
Micah Carroll
Matija Franklin
Hal Ashton
35
16
0
30 Aug 2024
Efficient Reinforcement Learning in Probabilistic Reward Machines
Efficient Reinforcement Learning in Probabilistic Reward Machines
Xiaofeng Lin
Xuezhou Zhang
54
0
0
19 Aug 2024
Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full
  Version)
Synthesis of Reward Machines for Multi-Agent Equilibrium Design (Full Version)
Muhammad Najib
Giuseppe Perelli
18
0
0
19 Aug 2024
Directed Exploration in Reinforcement Learning from Linear Temporal
  Logic
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Marco Bagatella
Andreas Krause
Georg Martius
OffRL
31
1
0
18 Aug 2024
Neural Reward Machines
Neural Reward Machines
Elena Umili
F. Argenziano
Roberto Capobianco
NAI
25
2
0
16 Aug 2024
Maximally Permissive Reward Machines
Maximally Permissive Reward Machines
Giovanni Varricchione
N. Alechina
Mehdi Dastani
Brian Logan
33
0
0
15 Aug 2024
LLMs as Probabilistic Minimally Adequate Teachers for DFA Learning
LLMs as Probabilistic Minimally Adequate Teachers for DFA Learning
Lekai Chen
Ashutosh Trivedi
Alvaro Velasquez
21
1
0
06 Aug 2024
Three Dogmas of Reinforcement Learning
Three Dogmas of Reinforcement Learning
David Abel
Mark K. Ho
A. Harutyunyan
38
5
0
15 Jul 2024
Towards Adapting Reinforcement Learning Agents to New Tasks: Insights
  from Q-Values
Towards Adapting Reinforcement Learning Agents to New Tasks: Insights from Q-Values
Ashwin Ramaswamy
Ransalu Senanayake
19
0
0
14 Jul 2024
Bayesian Inverse Reinforcement Learning for Non-Markovian Rewards
Bayesian Inverse Reinforcement Learning for Non-Markovian Rewards
Noah Topper
Alvaro Velasquez
George K. Atia
BDL
OffRL
23
0
0
20 Jun 2024
Towards Real-World Efficiency: Domain Randomization in Reinforcement
  Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots
Bahador Beigomi
Zheng H. Zhu
32
0
0
10 Jun 2024
Reward Machines for Deep RL in Noisy and Uncertain Environments
Reward Machines for Deep RL in Noisy and Uncertain Environments
Andrew C. Li
Zizhao Chen
Toryn Q. Klassen
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
48
6
0
31 May 2024
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement
  Learning with Reward Machine
Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine
Yuanliang Li
Hanzheng Dai
Jun Yan
36
3
0
24 May 2024
Numeric Reward Machines
Numeric Reward Machines
Kristina Levina
Nikolaos Pappas
Athanasios Karapantelakis
Aneta Vulgarakis Feljan
Jendrik Seipp
38
1
0
30 Apr 2024
LTL-Constrained Policy Optimization with Cycle Experience Replay
LTL-Constrained Policy Optimization with Cycle Experience Replay
Ameesh Shah
Cameron Voloshin
Chenxi Yang
Abhinav Verma
Swarat Chaudhuri
S. Seshia
29
1
0
17 Apr 2024
A Review of Reward Functions for Reinforcement Learning in the context
  of Autonomous Driving
A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving
Ahmed Abouelazm
Jonas Michel
J. M. Zöllner
38
6
0
12 Apr 2024
Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Guillermo Infante
David Kuric
Anders Jonsson
Vicencc Gómez
H. V. Hoof
OffRL
27
2
0
22 Mar 2024
Meta-operators for Enabling Parallel Planning Using Deep Reinforcement
  Learning
Meta-operators for Enabling Parallel Planning Using Deep Reinforcement Learning
Ángel Aso-Mollar
Eva Onaindia
OffRL
19
0
0
13 Mar 2024
Explainable Session-based Recommendation via Path Reasoning
Explainable Session-based Recommendation via Path Reasoning
Yang Cao
Shuo Shang
Jun Wang
Wei Zhang
20
2
0
28 Feb 2024
Transformable Gaussian Reward Function for Socially-Aware Navigation
  with Deep Reinforcement Learning
Transformable Gaussian Reward Function for Socially-Aware Navigation with Deep Reinforcement Learning
Jinyeob Kim
Sumin Kang
Sungwoo Yang
Beomjoon Kim
Jargalbaatar Yura
Donghan Kim
116
1
0
22 Feb 2024
Logical Specifications-guided Dynamic Task Sampling for Reinforcement
  Learning Agents
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Robert Wright
Alvaro Velasquez
Jivko Sinapov
18
0
0
06 Feb 2024
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient
  Minimum Radiation Exposure Pathway
RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway
B. Sadhu
Trijit Sadhu
S. Anand
AI4CE
17
0
0
01 Feb 2024
On the Limitations of Markovian Rewards to Express Multi-Objective,
  Risk-Sensitive, and Modal Tasks
On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks
Joar Skalse
Alessandro Abate
19
9
0
26 Jan 2024
Sample Efficient Reinforcement Learning by Automatically Learning to
  Compose Subtasks
Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks
Shuai Han
Mehdi Dastani
Shihan Wang
OffRL
42
1
0
25 Jan 2024
Detecting Hidden Triggers: Mapping Non-Markov Reward Functions to Markov
Detecting Hidden Triggers: Mapping Non-Markov Reward Functions to Markov
Gregory Hyde
Eugene Santos
19
0
0
20 Jan 2024
Counting Reward Automata: Sample Efficient Reinforcement Learning
  Through the Exploitation of Reward Function Structure
Counting Reward Automata: Sample Efficient Reinforcement Learning Through the Exploitation of Reward Function Structure
Tristan Bester
Benjamin Rosman
Steven D. James
Geraud Nangue Tasse
11
1
0
18 Dec 2023
Omega-Regular Decision Processes
Omega-Regular Decision Processes
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
19
0
0
14 Dec 2023
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision
  Making
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making
P. A. Alamdari
Toryn Q. Klassen
Elliot Creager
Sheila A. McIlraith
13
4
0
08 Dec 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Jun Wang
Hosein Hasanbeig
Kaiyuan Tan
Zihe Sun
Y. Kantaros
29
3
0
28 Nov 2023
Assessing the Robustness of Intelligence-Driven Reinforcement Learning
Assessing the Robustness of Intelligence-Driven Reinforcement Learning
Lorenzo Nodari
Federico Cerutti
11
1
0
15 Nov 2023
General Policies, Subgoal Structure, and Planning Width
General Policies, Subgoal Structure, and Planning Width
Blai Bonet
Hector Geffner
18
2
0
09 Nov 2023
123
Next