Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.03950
Cited By
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
6 October 2020
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning"
50 / 111 papers shown
Title
Signal Temporal Logic-Guided Apprenticeship Learning
Aniruddh Gopinath Puranic
Jyotirmoy V. Deshmukh
S. Nikolaidis
38
1
0
09 Nov 2023
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning
Suraj Singireddy
Andre Beckus
George K. Atia
Sumit Kumar Jha
Alvaro Velasquez
OffRL
22
0
0
29 Oct 2023
Behavior Alignment via Reward Function Optimization
Dhawal Gupta
Yash Chandak
Scott M. Jordan
Philip S. Thomas
Bruno Castro da Silva
15
10
0
29 Oct 2023
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents
Yash Shukla
Wenchang Gao
Vasanth Sarathy
Alvaro Velasquez
Robert Wright
Jivko Sinapov
22
9
0
14 Oct 2023
Verifiable Reinforcement Learning Systems via Compositionality
Cyrus Neary
Aryaman Singh Samyal
Christos K. Verginis
Murat Cubuktepe
Ufuk Topcu
OffRL
CoGe
17
0
0
09 Sep 2023
Omega-Regular Reward Machines
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
OffRL
11
1
0
14 Aug 2023
On the Convergence of Bounded Agents
David Abel
André Barreto
Hado van Hasselt
Benjamin Van Roy
Doina Precup
Satinder Singh
20
4
0
20 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
Guy Azran
Mohamad H. Danesh
Stefano V. Albrecht
Sarah Keren
AI4CE
31
1
0
11 Jul 2023
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning
E. Hurwitz
N. Peace
G. Cevora
OffRL
4
0
0
03 Jul 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback?
Akansha Kalra
Daniel S. Brown
16
0
0
22 Jun 2023
Designing Equilibria in Concurrent Games with Social Welfare and Temporal Logic Constraints
Julian Gutierrez
Muhammad Najib
Giuseppe Perelli
Michael Wooldridge
9
0
0
05 Jun 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning
Cevahir Köprülü
Ufuk Topcu
10
3
0
25 May 2023
Latest Trends in Artificial Intelligence Technology: A Scoping Review
Teemu Niskanen
T. Sipola
Olli Väänänen
17
1
0
08 May 2023
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees
Daqian Shao
M. Kwiatkowska
OffRL
21
6
0
02 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
14
3
0
30 Apr 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Chao Yu
Xuejing Zheng
H. Zhuo
OffRL
LRM
55
7
0
24 Apr 2023
Learning Reward Machines in Cooperative Multi-Agent Tasks
Leo Ardon
Daniel Furelos-Blanco
A. Russo
15
5
0
24 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP
A. Falah
Shibashis Guha
Ashutosh Trivedi
9
0
0
16 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
Benedict Quartey
Ankit Shah
G. Konidaris
25
3
0
09 Mar 2023
Eventual Discounting Temporal Logic Counterfactual Experience Replay
Cameron Voloshin
Abhinav Verma
Yisong Yue
OffRL
24
11
0
03 Mar 2023
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications
Zikang Xiong
Daniel Lawson
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
11
0
0
02 Mar 2023
Grounding Complex Natural Language Commands for Temporal Tasks in Unseen Environments
Jason Liu
Ziyi Yang
Ifrah Idrees
Sam Liang
Benjamin Schornstein
Stefanie Tellex
Ankit Parag Shah
LM&Ro
22
40
0
22 Feb 2023
Generalizing LTL Instructions via Future Dependent Options
Duo Xu
Faramarz Fekri
OffRL
AI4CE
24
1
0
08 Dec 2022
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines
Andrew C. Li
Zizhao Chen
Pashootan Vaezipoor
Toryn Q. Klassen
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRL
12
9
0
20 Nov 2022
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences
L. Guan
Karthik Valmeekam
Subbarao Kambhampati
51
8
0
28 Oct 2022
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion
Utkarsh Soni
Nupur Thakur
S. Sreedharan
L. Guan
Mudit Verma
Matthew Marquez
Subbarao Kambhampati
29
6
0
27 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications
Mingyu Cai
Makai Mann
Zachary Serlin
Kevin J. Leahy
C. Vasile
32
12
0
03 Oct 2022
Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning
Haotong Zhang
Hao Wang
Zheng Kan
OffRL
21
12
0
27 Sep 2022
Reward Learning using Structural Motifs in Inverse Reinforcement Learning
Raeid Saqur
18
2
0
25 Sep 2022
Learning Task Automata for Reinforcement Learning using Hidden Markov Models
Alessandro Abate
Y. Almulla
James Fox
David Hyland
Michael Wooldridge
OffRL
20
5
0
25 Aug 2022
Calculus on MDPs: Potential Shaping as a Gradient
Erik Jenner
H. V. Hoof
Adam Gleave
19
4
0
20 Aug 2022
Scaling up ML-based Black-box Planning with Partial STRIPS Models
M. Greco
Álvaro Torralba
Jorge A. Baier
Héctor Palacios
OffRL
13
0
0
10 Jul 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems
Joe Eappen
Suresh Jagannathan
12
3
0
28 Jun 2022
Recursive Reinforcement Learning
E. M. Hahn
Mateo Perez
S. Schewe
F. Somenzi
Ashutosh Trivedi
D. Wojtczak
LRM
14
0
0
23 Jun 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
21
49
0
19 Jun 2022
Specification-Guided Learning of Nash Equilibria with High Social Welfare
Kishor Jothimurugan
Suguman Bansal
Osbert Bastani
Rajeev Alur
31
9
0
06 Jun 2022
Hierarchies of Reward Machines
Daniel Furelos-Blanco
Mark Law
Anders Jonsson
Krysia Broda
A. Russo
19
8
0
31 May 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning
Geraud Nangue Tasse
Devon Jarvis
Steven D. James
Benjamin Rosman
36
4
0
25 May 2022
Moral reinforcement learning using actual causation
Tue Herlau
11
0
0
17 May 2022
Accelerated Reinforcement Learning for Temporal Logic Control Objectives
Y. Kantaros
11
11
0
09 May 2022
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines
Weichao Zhou
Wenchao Li
BDL
20
11
0
20 Apr 2022
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics
Christos K. Verginis
Cevahir Köprülü
Sandeep P. Chinchali
Ufuk Topcu
25
10
0
20 Apr 2022
A Framework for Following Temporal Logic Instructions with Unknown Causal Dependencies
Duo Xu
Faramarz Fekri
25
2
0
07 Apr 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances
Robby Costales
Shariq Iqbal
Fei Sha
18
5
0
23 Mar 2022
Hierarchical Reinforcement Learning with AI Planning Models
Junkyu Lee
Michael Katz
Don Joven Agravante
Miao Liu
Geraud Nangue Tasse
Tim Klinger
Shirin Sohrabi
18
1
0
01 Mar 2022
Knowledge-Integrated Informed AI for National Security
Anu Myne
Kevin J. Leahy
Ryan Soklaski
19
0
0
04 Feb 2022
Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications
Mingyu Cai
Erfan Aasi
C. Belta
C. Vasile
27
24
0
28 Jan 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning
Rodrigo Toro Icarte
Ethan Waldie
Toryn Q. Klassen
Richard Valenzano
Margarita P. Castro
Sheila A. McIlraith
11
13
0
17 Dec 2021
Programmatic Reward Design by Example
Weichao Zhou
Wenchao Li
34
15
0
14 Dec 2021
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines
Xuejing Zheng
Chao Yu
C. L. P. Chen
Jianye Hao
H. Zhuo
CLL
OffRL
17
9
0
18 Nov 2021
Previous
1
2
3
Next