Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

6 October 2020

Papers citing "Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning"

50 / 111 papers shown

Title
Signal Temporal Logic-Guided Apprenticeship Learning Aniruddh Gopinath Puranic Jyotirmoy V. Deshmukh S. Nikolaidis 38 1 0 09 Nov 2023
Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning Suraj Singireddy Andre Beckus George K. Atia Sumit Kumar Jha Alvaro Velasquez OffRL 22 0 0 29 Oct 2023
Behavior Alignment via Reward Function Optimization Dhawal Gupta Yash Chandak Scott M. Jordan Philip S. Thomas Bruno Castro da Silva 15 10 0 29 Oct 2023
LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents Yash Shukla Wenchang Gao Vasanth Sarathy Alvaro Velasquez Robert Wright Jivko Sinapov 22 9 0 14 Oct 2023
Verifiable Reinforcement Learning Systems via Compositionality Cyrus Neary Aryaman Singh Samyal Christos K. Verginis Murat Cubuktepe Ufuk Topcu OffRL CoGe 17 0 0 09 Sep 2023
Omega-Regular Reward Machines E. M. Hahn Mateo Perez S. Schewe F. Somenzi Ashutosh Trivedi D. Wojtczak OffRL 11 1 0 14 Aug 2023
On the Convergence of Bounded Agents David Abel André Barreto Hado van Hasselt Benjamin Van Roy Doina Precup Satinder Singh 20 4 0 20 Jul 2023
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning Guy Azran Mohamad H. Danesh Stefano V. Albrecht Sarah Keren AI4CE 31 1 0 11 Jul 2023
Achieving Stable Training of Reinforcement Learning Agents in Bimodal Environments through Batch Learning E. Hurwitz N. Peace G. Cevora OffRL 4 0 0 03 Jul 2023
Can Differentiable Decision Trees Enable Interpretable Reward Learning from Human Feedback? Akansha Kalra Daniel S. Brown 16 0 0 22 Jun 2023
Designing Equilibria in Concurrent Games with Social Welfare and Temporal Logic Constraints Julian Gutierrez Muhammad Najib Giuseppe Perelli Michael Wooldridge 9 0 0 05 Jun 2023
Reward-Machine-Guided, Self-Paced Reinforcement Learning Cevahir Köprülü Ufuk Topcu 10 3 0 25 May 2023
Latest Trends in Artificial Intelligence Technology: A Scoping Review Teemu Niskanen T. Sipola Olli Väänänen 17 1 0 08 May 2023
Sample Efficient Model-free Reinforcement Learning from LTL Specifications with Optimality Guarantees Daqian Shao M. Kwiatkowska OffRL 21 6 0 02 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward Zihan Zhou Animesh Garg OffRL 14 3 0 30 Apr 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey Chao Yu Xuejing Zheng H. Zhuo OffRL LRM 55 7 0 24 Apr 2023
Learning Reward Machines in Cooperative Multi-Agent Tasks Leo Ardon Daniel Furelos-Blanco A. Russo 15 5 0 24 Mar 2023
Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP A. Falah Shibashis Guha Ashutosh Trivedi 9 0 0 16 Mar 2023
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks Benedict Quartey Ankit Shah G. Konidaris 25 3 0 09 Mar 2023
Eventual Discounting Temporal Logic Counterfactual Experience Replay Cameron Voloshin Abhinav Verma Yisong Yue OffRL 24 11 0 03 Mar 2023
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications Zikang Xiong Daniel Lawson Joe Eappen A. H. Qureshi Suresh Jagannathan 11 0 0 02 Mar 2023
Grounding Complex Natural Language Commands for Temporal Tasks in Unseen Environments Jason Liu Ziyi Yang Ifrah Idrees Sam Liang Benjamin Schornstein Stefanie Tellex Ankit Parag Shah LM&Ro 22 40 0 22 Feb 2023
Generalizing LTL Instructions via Future Dependent Options Duo Xu Faramarz Fekri OffRL AI4CE 24 1 0 08 Dec 2022
Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines Andrew C. Li Zizhao Chen Pashootan Vaezipoor Toryn Q. Klassen Rodrigo Toro Icarte Sheila A. McIlraith OffRL 12 9 0 20 Nov 2022
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences L. Guan Karthik Valmeekam Subbarao Kambhampati 51 8 0 28 Oct 2022
Towards customizable reinforcement learning agents: Enabling preference specification through online vocabulary expansion Utkarsh Soni Nupur Thakur S. Sreedharan L. Guan Mudit Verma Matthew Marquez Subbarao Kambhampati 29 6 0 27 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications Mingyu Cai Makai Mann Zachary Serlin Kevin J. Leahy C. Vasile 32 12 0 03 Oct 2022
Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning Haotong Zhang Hao Wang Zheng Kan OffRL 21 12 0 27 Sep 2022
Reward Learning using Structural Motifs in Inverse Reinforcement Learning Raeid Saqur 18 2 0 25 Sep 2022
Learning Task Automata for Reinforcement Learning using Hidden Markov Models Alessandro Abate Y. Almulla James Fox David Hyland Michael Wooldridge OffRL 20 5 0 25 Aug 2022
Calculus on MDPs: Potential Shaping as a Gradient Erik Jenner H. V. Hoof Adam Gleave 19 4 0 20 Aug 2022
Scaling up ML-based Black-box Planning with Partial STRIPS Models M. Greco Álvaro Torralba Jorge A. Baier Héctor Palacios OffRL 13 0 0 10 Jul 2022
DistSPECTRL: Distributing Specifications in Multi-Agent Reinforcement Learning Systems Joe Eappen Suresh Jagannathan 12 3 0 28 Jun 2022
Recursive Reinforcement Learning E. M. Hahn Mateo Perez S. Schewe F. Somenzi Ashutosh Trivedi D. Wojtczak LRM 14 0 0 23 Jun 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation Christoph Dann Yishay Mansour M. Mohri Ayush Sekhari Karthik Sridharan 21 49 0 19 Jun 2022
Specification-Guided Learning of Nash Equilibria with High Social Welfare Kishor Jothimurugan Suguman Bansal Osbert Bastani Rajeev Alur 31 9 0 06 Jun 2022
Hierarchies of Reward Machines Daniel Furelos-Blanco Mark Law Anders Jonsson Krysia Broda A. Russo 19 8 0 31 May 2022
Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning Geraud Nangue Tasse Devon Jarvis Steven D. James Benjamin Rosman 36 4 0 25 May 2022
Moral reinforcement learning using actual causation Tue Herlau 11 0 0 17 May 2022
Accelerated Reinforcement Learning for Temporal Logic Control Objectives Y. Kantaros 11 11 0 09 May 2022
A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines Weichao Zhou Wenchao Li BDL 20 11 0 20 Apr 2022
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics Christos K. Verginis Cevahir Köprülü Sandeep P. Chinchali Ufuk Topcu 25 10 0 20 Apr 2022
A Framework for Following Temporal Logic Instructions with Unknown Causal Dependencies Duo Xu Faramarz Fekri 25 2 0 07 Apr 2022
Possibility Before Utility: Learning And Using Hierarchical Affordances Robby Costales Shariq Iqbal Fei Sha 18 5 0 23 Mar 2022
Hierarchical Reinforcement Learning with AI Planning Models Junkyu Lee Michael Katz Don Joven Agravante Miao Liu Geraud Nangue Tasse Tim Klinger Shirin Sohrabi 18 1 0 01 Mar 2022
Knowledge-Integrated Informed AI for National Security Anu Myne Kevin J. Leahy Ryan Soklaski 19 0 0 04 Feb 2022
Overcoming Exploration: Deep Reinforcement Learning for Continuous Control in Cluttered Environments from Temporal Logic Specifications Mingyu Cai Erfan Aasi C. Belta C. Vasile 27 24 0 28 Jan 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning Rodrigo Toro Icarte Ethan Waldie Toryn Q. Klassen Richard Valenzano Margarita P. Castro Sheila A. McIlraith 11 13 0 17 Dec 2021
Programmatic Reward Design by Example Weichao Zhou Wenchao Li 34 15 0 14 Dec 2021
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines Xuejing Zheng Chao Yu C. L. P. Chen Jianye Hao H. Zhuo CLL OffRL 17 9 0 18 Nov 2021