ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.03950
  4. Cited By
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning

Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

6 October 2020
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
    OffRL
ArXivPDFHTML

Papers citing "Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning"

50 / 111 papers shown
Title
Signal Temporal Logic-Guided Apprenticeship Learning
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
S. Nikolaidis
38
1
0
09 Nov 2023
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep
  Reinforcement Learning
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Suraj Singireddy
Andre Beckus
George K. Atia
Sumit Kumar Jha
Alvaro Velasquez
OffRL
22
0
0
29 Oct 2023
Behavior Alignment via Reward Function Optimization
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
15
10
0
29 Oct 2023
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for
  Reinforcement Learning Agents
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Alvaro Velasquez
Robert Wright
Jivko Sinapov
22
9
0
14 Oct 2023
Verifiable Reinforcement Learning Systems via Compositionality
Verifiable Reinforcement Learning Systems via Compositionality
Cyrus Neary
Aryaman Singh Samyal
Christos K. Verginis
Murat Cubuktepe
Ufuk Topcu
OffRL
CoGe
17
0
0
09 Sep 2023
Omega-Regular Reward Machines
Omega-Regular Reward Machines
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
OffRL
11
1
0
14 Aug 2023
On the Convergence of Bounded Agents
On the Convergence of Bounded Agents
David Abel
André Barreto
Hado van Hasselt
Benjamin Van Roy
Doina Precup
Satinder Singh
20
4
0
20 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced
  Transfer in Deep Reinforcement Learning
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
31
1
0
11 Jul 2023
Achieving Stable Training of Reinforcement Learning Agents in Bimodal
  Environments through Batch Learning
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning
E. Hurwitz
N. Peace
G. Cevora
OffRL
4
0
0
03 Jul 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning
  from Human Feedback?
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?
Akansha Kalra
Daniel S. Brown
16
0
0
22 Jun 2023
Designing Equilibria in Concurrent Games with Social Welfare and
  Temporal Logic Constraints
Designing Equilibria in Concurrent Games with Social Welfare and Temporal Logic Constraints
Julian Gutierrez
Muhammad Najib
Giuseppe Perelli
Michael Wooldridge
9
0
0
05 Jun 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Cevahir Köprülü
Ufuk Topcu
10
3
0
25 May 2023
Latest Trends in Artificial Intelligence Technology: A Scoping Review
Latest Trends in Artificial Intelligence Technology: A Scoping Review
Teemu Niskanen
T. Sipola
Olli Väänänen
17
1
0
08 May 2023
Sample Efficient Model-free Reinforcement Learning from LTL
  Specifications with Optimality Guarantees
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees
Daqian Shao
M. Kwiatkowska
OffRL
21
6
0
02 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
14
3
0
30 Apr 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Chao Yu
Xuejing Zheng
H. Zhuo
OffRL
LRM
55
7
0
24 Apr 2023
Learning Reward Machines in Cooperative Multi-Agent Tasks
Learning Reward Machines in Cooperative Multi-Agent Tasks
Leo Ardon
Daniel Furelos-Blanco
A. Russo
15
5
0
24 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on
  Continuous-Time MDP
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP
A. Falah
Shibashis Guha
Ashutosh Trivedi
9
0
0
16 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey
Ankit Shah
G. Konidaris
25
3
0
09 Mar 2023
Eventual Discounting Temporal Logic Counterfactual Experience Replay
Eventual Discounting Temporal Logic Counterfactual Experience Replay
Cameron Voloshin
Abhinav Verma
Yisong Yue
OffRL
24
11
0
03 Mar 2023
Co-learning Planning and Control Policies Constrained by Differentiable
  Logic Specifications
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications
Zikang Xiong
Daniel Lawson
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
11
0
0
02 Mar 2023
Grounding Complex Natural Language Commands for Temporal Tasks in Unseen
  Environments
Grounding Complex Natural Language Commands for Temporal Tasks in Unseen Environments
Jason Liu
Ziyi Yang
Ifrah Idrees
Sam Liang
Benjamin Schornstein
Stefanie Tellex
Ankit Parag Shah
LM&Ro
22
40
0
22 Feb 2023
Generalizing LTL Instructions via Future Dependent Options
Generalizing LTL Instructions via Future Dependent Options
Duo Xu
Faramarz Fekri
OffRL
AI4CE
24
1
0
08 Dec 2022
Noisy Symbolic Abstractions for Deep RL: A case study with Reward
  Machines
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines
Andrew C. Li
Zizhao Chen
Pashootan Vaezipoor
Toryn Q. Klassen
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRL
12
9
0
20 Nov 2022
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal
  Specification and Reward Learning from Human Preferences
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences
L. Guan
Karthik Valmeekam
Subbarao Kambhampati
51
8
0
28 Oct 2022
Towards customizable reinforcement learning agents: Enabling preference
  specification through online vocabulary expansion
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion
Utkarsh Soni
Nupur Thakur
S. Sreedharan
L. Guan
Mudit Verma
Matthew Marquez
Subbarao Kambhampati
29
6
0
27 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear
  Temporal Logic Specifications
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications
Mingyu Cai
Makai Mann
Zachary Serlin
Kevin J. Leahy
C. Vasile
32
12
0
03 Oct 2022
Exploiting Transformer in Sparse Reward Reinforcement Learning for
  Interpretable Temporal Logic Motion Planning
Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning
Haotong Zhang
Hao Wang
Zheng Kan
OffRL
21
12
0
27 Sep 2022
Reward Learning using Structural Motifs in Inverse Reinforcement
  Learning
Reward Learning using Structural Motifs in Inverse Reinforcement Learning
Raeid Saqur
18
2
0
25 Sep 2022
Learning Task Automata for Reinforcement Learning using Hidden Markov
  Models
Learning Task Automata for Reinforcement Learning using Hidden Markov Models
Alessandro Abate
Y. Almulla
James Fox
David Hyland
Michael Wooldridge
OffRL
20
5
0
25 Aug 2022
Calculus on MDPs: Potential Shaping as a Gradient
Calculus on MDPs: Potential Shaping as a Gradient
Erik Jenner
H. V. Hoof
Adam Gleave
19
4
0
20 Aug 2022
Scaling up ML-based Black-box Planning with Partial STRIPS Models
Scaling up ML-based Black-box Planning with Partial STRIPS Models
M. Greco
Álvaro Torralba
Jorge A. Baier
Héctor Palacios
OffRL
13
0
0
10 Jul 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement
  Learning Systems
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
12
3
0
28 Jun 2022
Recursive Reinforcement Learning
Recursive Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
LRM
14
0
0
23 Jun 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function
  Approximation
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
21
49
0
19 Jun 2022
Specification-Guided Learning of Nash Equilibria with High Social
  Welfare
Specification-Guided Learning of Nash Equilibria with High Social Welfare
Kishor Jothimurugan
Suguman Bansal
Osbert Bastani
Rajeev Alur
31
9
0
06 Jun 2022
Hierarchies of Reward Machines
Hierarchies of Reward Machines
Daniel Furelos-Blanco
Mark Law
Anders Jonsson
Krysia Broda
A. Russo
19
8
0
31 May 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement
  Learning
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning
Geraud Nangue Tasse
Devon Jarvis
Steven D. James
Benjamin Rosman
36
4
0
25 May 2022
Moral reinforcement learning using actual causation
Moral reinforcement learning using actual causation
Tue Herlau
11
0
0
17 May 2022
Accelerated Reinforcement Learning for Temporal Logic Control Objectives
Accelerated Reinforcement Learning for Temporal Logic Control Objectives
Y. Kantaros
11
11
0
09 May 2022
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with
  Symbolic Reward Machines
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines
Weichao Zhou
Wenchao Li
BDL
20
11
0
20 Apr 2022
Joint Learning of Reward Machines and Policies in Environments with
  Partially Known Semantics
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics
Christos K. Verginis
Cevahir Köprülü
Sandeep P. Chinchali
Ufuk Topcu
25
10
0
20 Apr 2022
A Framework for Following Temporal Logic Instructions with Unknown
  Causal Dependencies
A Framework for Following Temporal Logic Instructions with Unknown Causal Dependencies
Duo Xu
Faramarz Fekri
25
2
0
07 Apr 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances
Possibility Before Utility: Learning And Using Hierarchical Affordances
Robby Costales
Shariq Iqbal
Fei Sha
18
5
0
23 Mar 2022
Hierarchical Reinforcement Learning with AI Planning Models
Hierarchical Reinforcement Learning with AI Planning Models
Junkyu Lee
Michael Katz
Don Joven Agravante
Miao Liu
Geraud Nangue Tasse
Tim Klinger
Shirin Sohrabi
18
1
0
01 Mar 2022
Knowledge-Integrated Informed AI for National Security
Knowledge-Integrated Informed AI for National Security
Anu Myne
Kevin J. Leahy
Ryan Soklaski
19
0
0
04 Feb 2022
Overcoming Exploration: Deep Reinforcement Learning for Continuous
  Control in Cluttered Environments from Temporal Logic Specifications
Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications
Mingyu Cai
Erfan Aasi
C. Belta
C. Vasile
27
24
0
28 Jan 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement
  Learning
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning
Rodrigo Toro Icarte
Ethan Waldie
Toryn Q. Klassen
Richard Valenzano
Margarita P. Castro
Sheila A. McIlraith
11
13
0
17 Dec 2021
Programmatic Reward Design by Example
Programmatic Reward Design by Example
Weichao Zhou
Wenchao Li
34
15
0
14 Dec 2021
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward
  Machines
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines
Xuejing Zheng
Chao Yu
C. L. P. Chen
Jianye Hao
H. Zhuo
CLL
OffRL
17
9
0
18 Nov 2021
Previous
123
Next