ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.00120
  4. Cited By
Reward Machines for Deep RL in Noisy and Uncertain Environments
v1v2v3v4 (latest)

Reward Machines for Deep RL in Noisy and Uncertain Environments

31 May 2024
Andrew C. Li
Zizhao Chen
Toryn Q. Klassen
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
ArXiv (abs)PDFHTML

Papers citing "Reward Machines for Deep RL in Noisy and Uncertain Environments"

44 / 44 papers shown
Title
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
Mathias Jackermeier
Alessandro Abate
OffRL
153
2
0
06 Oct 2024
LTL-Constrained Policy Optimization with Cycle Experience Replay
LTL-Constrained Policy Optimization with Cycle Experience Replay
Ameesh Shah
Cameron Voloshin
Chenxi Yang
Abhinav Verma
Swarat Chaudhuri
Sanjit A. Seshia
145
1
0
17 Apr 2024
Assessing the Robustness of Intelligence-Driven Reinforcement Learning
Assessing the Robustness of Intelligence-Driven Reinforcement Learning
Lorenzo Nodari
Federico Cerutti
64
1
0
15 Nov 2023
Reinforcement Learning of Action and Query Policies with LTL
  Instructions under Uncertain Event Detector
Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector
Wataru Hatanaka
R. Yamashina
Takamitsu Matsubara
114
5
0
06 Sep 2023
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning
  Environments for Goal-Oriented Tasks
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Maxime Chevalier-Boisvert
Bolun Dai
Mark Towers
Rodrigo de Lazcano
Lucas Willems
Salem Lahlou
Suman Pal
Pablo Samuel Castro
Jordan Terry
VGen
126
212
0
24 Jun 2023
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Shalev Lifshitz
Keiran Paster
Harris Chan
Jimmy Ba
Sheila A. McIlraith
LM&Ro
139
76
0
01 Jun 2023
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
Chao Yu
Xuejing Zheng
H. Zhuo
OffRLLRM
134
8
0
24 Apr 2023
Vision-Language Models as Success Detectors
Vision-Language Models as Success Detectors
Yuqing Du
Ksenia Konyushkova
Misha Denil
A. Raju
Jessica Landon
Felix Hill
Nando de Freitas
Serkan Cabi
MLLMLRM
130
86
0
13 Mar 2023
Learning to Follow Instructions in Text-Based Games
Learning to Follow Instructions in Text-Based Games
Mathieu Tuli
Andrew C. Li
Pashootan Vaezipoor
Toryn Q. Klassen
Scott Sanner
Sheila A. McIlraith
79
13
0
08 Nov 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
89
11
0
11 Oct 2022
Exploiting Transformer in Sparse Reward Reinforcement Learning for
  Interpretable Temporal Logic Motion Planning
Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning
Haotong Zhang
Hao Wang
Zheng Kan
OffRL
87
12
0
27 Sep 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
159
304
0
23 Jun 2022
Hierarchies of Reward Machines
Hierarchies of Reward Machines
Daniel Furelos-Blanco
Mark Law
Anders Jonsson
Krysia Broda
A. Russo
78
9
0
31 May 2022
Joint Learning of Reward Machines and Policies in Environments with
  Partially Known Semantics
Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics
Christos K. Verginis
Cevahir Köprülü
Sandeep Chinchali
Ufuk Topcu
78
11
0
20 Apr 2022
Learning Reward Machines: A Study in Partially Observable Reinforcement
  Learning
Learning Reward Machines: A Study in Partially Observable Reinforcement Learning
Rodrigo Toro Icarte
Ethan Waldie
Toryn Q. Klassen
Richard Valenzano
Margarita P. Castro
Sheila A. McIlraith
30
14
0
17 Dec 2021
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward
  Machines
Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines
Xuejing Zheng
Chao Yu
Chong Chen
Jianye Hao
H. Zhuo
CLLOffRL
69
9
0
18 Nov 2021
In a Nutshell, the Human Asked for This: Latent Goals for Following
  Temporal Specifications
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications
Borja G. Leon
Murray Shanahan
Francesco Belardinelli
AI4CE
98
16
0
18 Oct 2021
Compositional Reinforcement Learning from Logical Specifications
Compositional Reinforcement Learning from Logical Specifications
Kishor Jothimurugan
Suguman Bansal
Osbert Bastani
Rajeev Alur
CoGe
135
81
0
25 Jun 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.1K
30,053
0
26 Feb 2021
LTL2Action: Generalizing LTL Instructions for Multi-Task RL
LTL2Action: Generalizing LTL Instructions for Multi-Task RL
Pashootan Vaezipoor
Andrew C. Li
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRLAI4CE
127
77
0
13 Feb 2021
Reinforcement Learning Based Temporal Logic Control with Maximum
  Probabilistic Satisfaction
Reinforcement Learning Based Temporal Logic Control with Maximum Probabilistic Satisfaction
Mingyu Cai
Shaoping Xiao
Baoluo Li
Zhiliang Li
Z. Kan
86
35
0
14 Oct 2020
Reward Machines: Exploiting Reward Function Structure in Reinforcement
  Learning
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
Rodrigo Toro Icarte
Toryn Q. Klassen
Richard Valenzano
Sheila A. McIlraith
OffRL
145
222
0
06 Oct 2020
A Composable Specification Language for Reinforcement Learning Tasks
A Composable Specification Language for Reinforcement Learning Tasks
Kishor Jothimurugan
Rajeev Alur
Osbert Bastani
85
88
0
21 Aug 2020
Reward Machines for Cooperative Multi-Agent Reinforcement Learning
Reward Machines for Cooperative Multi-Agent Reinforcement Learning
Cyrus Neary
Zhe Xu
Bo Wu
Ufuk Topcu
83
47
0
03 Jul 2020
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement
  Learning Tasks
Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks
Yuqian Jiang
Suda Bharadwaj
Bo Wu
Rishi Shah
Ufuk Topcu
Peter Stone
CLLOffRLLRM
58
43
0
03 Jul 2020
Active Finite Reward Automaton Inference and Reinforcement Learning
  Using Queries and Counterexamples
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples
Zhe Xu
Bo Wu
Aditya Ojha
Daniel Neider
Ufuk Topcu
OffRL
89
30
0
28 Jun 2020
Encoding formulas as deep networks: Reinforcement learning for zero-shot
  execution of LTL formulas
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulas
Yen-Ling Kuo
Boris Katz
Andrei Barbu
78
41
0
01 Jun 2020
Learning Non-Markovian Reward Models in MDPs
Learning Non-Markovian Reward Models in MDPs
Gavin Rens
Jean-François Raskin
44
12
0
25 Jan 2020
Induction of Subgoal Automata for Reinforcement Learning
Induction of Subgoal Automata for Reinforcement Learning
Daniel Furelos-Blanco
Mark Law
A. Russo
Krysia Broda
Anders Jonsson
87
33
0
29 Nov 2019
Modular Deep Reinforcement Learning with Temporal Logic Specifications
Modular Deep Reinforcement Learning with Temporal Logic Specifications
Li-xin Yuan
Mohammadhosein Hasanbeig
Alessandro Abate
Daniel Kroening
OffRL
62
41
0
23 Sep 2019
Joint Inference of Reward Machines and Policies for Reinforcement
  Learning
Joint Inference of Reward Machines and Policies for Reinforcement Learning
Zhe Xu
I. Gavran
Yousef Ahmad
R. Majumdar
Daniel Neider
Ufuk Topcu
Bo Wu
84
94
0
12 Sep 2019
Transfer of Temporal Logic Formulas in Reinforcement Learning
Transfer of Temporal Logic Formulas in Reinforcement Learning
Zhe Xu
Ufuk Topcu
69
52
0
10 Sep 2019
Planning With Uncertain Specifications (PUnS)
Planning With Uncertain Specifications (PUnS)
Ankit J. Shah
Shen Li
J. Shah
81
25
0
07 Jun 2019
Reinforcement Learning with Probabilistic Guarantees for Autonomous
  Driving
Reinforcement Learning with Probabilistic Guarantees for Autonomous Driving
Maxime Bouton
J. Karlsson
A. Nakhaei
K. Fujimura
Mykel J. Kochenderfer
Jana Tumova
79
69
0
15 Apr 2019
Using Natural Language for Reward Shaping in Reinforcement Learning
Using Natural Language for Reward Shaping in Reinforcement Learning
Prasoon Goyal
S. Niekum
Raymond J. Mooney
LM&Ro
108
183
0
05 Mar 2019
Logically-Constrained Reinforcement Learning
Logically-Constrained Reinforcement Learning
Mohammadhosein Hasanbeig
Alessandro Abate
Daniel Kroening
95
83
0
24 Jan 2018
A Policy Search Method For Temporal Logic Specified Reinforcement
  Learning Tasks
A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks
Xiao Li
Yao Ma
C. Belta
75
59
0
27 Sep 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
690
19,343
0
20 Jul 2017
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement
  Learning
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Junhyuk Oh
Satinder Singh
Honglak Lee
Pushmeet Kohli
OffRL
164
269
0
15 Jun 2017
Environment-Independent Task Specifications via GLTL
Environment-Independent Task Specifications via GLTL
Michael L. Littman
Ufuk Topcu
Jie Fu
Charles Isbell
Min Wen
J. MacGlashan
93
146
0
14 Apr 2017
Reinforcement Learning With Temporal Logic Rewards
Reinforcement Learning With Temporal Logic Rewards
Xiao Li
C. Vasile
C. Belta
91
219
0
11 Dec 2016
Modular Multitask Reinforcement Learning with Policy Sketches
Modular Multitask Reinforcement Learning with Policy Sketches
Jacob Andreas
Dan Klein
Sergey Levine
OffRL
215
463
0
06 Nov 2016
Deep Recurrent Q-Learning for Partially Observable MDPs
Deep Recurrent Q-Learning for Partially Observable MDPs
Matthew J. Hausknecht
Peter Stone
134
1,686
0
23 Jul 2015
LTL Control in Uncertain Environments with Probabilistic Satisfaction
  Guarantees
LTL Control in Uncertain Environments with Probabilistic Satisfaction Guarantees
X. Ding
Stephen L. Smith
C. Belta
Daniela Rus
94
90
0
06 Apr 2011
1