Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.09151
Cited By
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences
17 October 2022
Mudit Verma
Katherine Metcalf
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Symbol Guided Hindsight Priors for Reward Learning from Human Preferences"
7 / 7 papers shown
Title
Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
33
21
0
10 Jan 2024
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
30
61
0
02 Mar 2023
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments
Tung Thai
Mingyu Shen
M. Garg
Ayush Kalani
Nakul Vaidya
...
Neeraj Varshney
Chitta Baral
Subbarao Kambhampati
Jivko Sinapov
matthias. scheutz
34
0
0
28 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
37
4
0
17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
30
2
0
17 Feb 2023
Data Driven Reward Initialization for Preference based Reinforcement Learning
Mudit Verma
Subbarao Kambhampati
32
1
0
17 Feb 2023
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations
S. Sreedharan
Utkarsh Soni
Mudit Verma
Siddharth Srivastava
S. Kambhampati
73
30
0
04 Feb 2020
1