ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.04472
  4. Cited By
Deep Bayesian Reward Learning from Preferences

Deep Bayesian Reward Learning from Preferences

10 December 2019
Daniel S. Brown
S. Niekum
    BDL
ArXivPDFHTML

Papers citing "Deep Bayesian Reward Learning from Preferences"

14 / 14 papers shown
Title
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Xueru Wen
Jie Lou
Yunfan LU
Hongyu Lin
Xing Yu
Xinyu Lu
Xianpei Han
Xianpei Han
Debing Zhang
Le Sun
ALM
61
5
0
17 Feb 2025
Pareto-Optimal Learning from Preferences with Hidden Context
Pareto-Optimal Learning from Preferences with Hidden Context
Ryan Boldi
Li Ding
Lee Spector
S. Niekum
70
6
0
21 Jun 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
44
0
0
24 Mar 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry V. Dylov
OffRL
51
3
0
20 Feb 2024
Few-Shot Preference Learning for Human-in-the-Loop RL
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Using Features at Multiple Temporal and Spatial Resolutions to Predict
  Human Behavior in Real Time
Using Features at Multiple Temporal and Spatial Resolutions to Predict Human Behavior in Real Time
Li Zhang
Justin Lieffers
A. Pyarelal
30
1
0
12 Nov 2022
Learning from Imperfect Demonstrations via Adversarial Confidence
  Transfer
Learning from Imperfect Demonstrations via Adversarial Confidence Transfer
Zhangjie Cao
Zihan Wang
Dorsa Sadigh
AAML
32
7
0
07 Feb 2022
Modeling human intention inference in continuous 3D domains by inverse
  planning and body kinematics
Modeling human intention inference in continuous 3D domains by inverse planning and body kinematics
Yingdong Qian
Marta Kryven
Tao Gao
Hanbyul Joo
J. Tenenbaum
25
1
0
02 Dec 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Example-Driven Model-Based Reinforcement Learning for Solving
  Long-Horizon Visuomotor Tasks
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Preference-based Learning of Reward Function Features
Preference-based Learning of Reward Function Features
Sydney M. Katz
Amir Maleki
Erdem Biyik
Mykel J. Kochenderfer
33
11
0
03 Mar 2021
Scalable Bayesian Inverse Reinforcement Learning
Scalable Bayesian Inverse Reinforcement Learning
Alex J. Chan
M. Schaar
OffRL
BDL
21
66
0
12 Feb 2021
Bayesian Robust Optimization for Imitation Learning
Bayesian Robust Optimization for Imitation Learning
Daniel S. Brown
S. Niekum
Marek Petrik
27
32
0
24 Jul 2020
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
312
0
06 Jun 2015
1