ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.09151
  4. Cited By
Symbol Guided Hindsight Priors for Reward Learning from Human
  Preferences

Symbol Guided Hindsight Priors for Reward Learning from Human Preferences

17 October 2022
Mudit Verma
Katherine Metcalf
ArXivPDFHTML

Papers citing "Symbol Guided Hindsight Priors for Reward Learning from Human Preferences"

7 / 7 papers shown
Title
Theory of Mind abilities of Large Language Models in Human-Robot
  Interaction : An Illusion?
Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
33
21
0
10 Jan 2024
Preference Transformer: Modeling Human Preferences using Transformers
  for RL
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
30
61
0
02 Mar 2023
Methods and Mechanisms for Interactive Novelty Handling in Adversarial
  Environments
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments
Tung Thai
Mingyu Shen
M. Garg
Ayush Kalani
Nakul Vaidya
...
Neeraj Varshney
Chitta Baral
Subbarao Kambhampati
Jivko Sinapov
matthias. scheutz
34
0
0
28 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based
  Reinforcement Learning
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning
Mudit Verma
Siddhant Bhambri
Subbarao Kambhampati
37
4
0
17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human
  Preferences
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
30
2
0
17 Feb 2023
Data Driven Reward Initialization for Preference based Reinforcement
  Learning
Data Driven Reward Initialization for Preference based Reinforcement Learning
Mudit Verma
Subbarao Kambhampati
32
1
0
17 Feb 2023
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for
  Sequential Decision-Making Problems with Inscrutable Representations
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations
S. Sreedharan
Utkarsh Soni
Mudit Verma
Siddharth Srivastava
S. Kambhampati
73
30
0
04 Feb 2020
1