Symbol Guided Hindsight Priors for Reward Learning from Human Preferences

17 October 2022

Papers citing "Symbol Guided Hindsight Priors for Reward Learning from Human Preferences"

7 / 7 papers shown

Title
Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion? Mudit Verma Siddhant Bhambri Subbarao Kambhampati 33 21 0 10 Jan 2024
Preference Transformer: Modeling Human Preferences using Transformers for RL Changyeon Kim Jongjin Park Jinwoo Shin Honglak Lee Pieter Abbeel Kimin Lee OffRL 30 61 0 02 Mar 2023
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments Tung Thai Mingyu Shen M. Garg Ayush Kalani Nakul Vaidya ... Neeraj Varshney Chitta Baral Subbarao Kambhampati Jivko Sinapov matthias. scheutz 34 0 0 28 Feb 2023
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning Mudit Verma Siddhant Bhambri Subbarao Kambhampati 37 4 0 17 Feb 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences Mudit Verma Subbarao Kambhampati 30 2 0 17 Feb 2023
Data Driven Reward Initialization for Preference based Reinforcement Learning Mudit Verma Subbarao Kambhampati 32 1 0 17 Feb 2023
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations S. Sreedharan Utkarsh Soni Mudit Verma Siddharth Srivastava S. Kambhampati 73 30 0 04 Feb 2020