Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.04472
Cited By
Deep Bayesian Reward Learning from Preferences
10 December 2019
Daniel S. Brown
S. Niekum
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Bayesian Reward Learning from Preferences"
14 / 14 papers shown
Title
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Xueru Wen
Jie Lou
Yunfan LU
Hongyu Lin
Xing Yu
Xinyu Lu
Xianpei Han
Xianpei Han
Debing Zhang
Le Sun
ALM
61
5
0
17 Feb 2025
Pareto-Optimal Learning from Preferences with Hidden Context
Ryan Boldi
Li Ding
Lee Spector
S. Niekum
70
6
0
21 Jun 2024
IBCB: Efficient Inverse Batched Contextual Bandit for Behavioral Evolution History
Yi Xu
Weiran Shen
Xiao Zhang
Jun Xu
OffRL
44
0
0
24 Mar 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry V. Dylov
OffRL
51
3
0
20 Feb 2024
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Using Features at Multiple Temporal and Spatial Resolutions to Predict Human Behavior in Real Time
Li Zhang
Justin Lieffers
A. Pyarelal
30
1
0
12 Nov 2022
Learning from Imperfect Demonstrations via Adversarial Confidence Transfer
Zhangjie Cao
Zihan Wang
Dorsa Sadigh
AAML
32
7
0
07 Feb 2022
Modeling human intention inference in continuous 3D domains by inverse planning and body kinematics
Yingdong Qian
Marta Kryven
Tao Gao
Hanbyul Joo
J. Tenenbaum
25
1
0
02 Dec 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
40
93
0
04 Nov 2021
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Preference-based Learning of Reward Function Features
Sydney M. Katz
Amir Maleki
Erdem Biyik
Mykel J. Kochenderfer
33
11
0
03 Mar 2021
Scalable Bayesian Inverse Reinforcement Learning
Alex J. Chan
M. Schaar
OffRL
BDL
21
66
0
12 Feb 2021
Bayesian Robust Optimization for Imitation Learning
Daniel S. Brown
S. Niekum
Marek Petrik
27
32
0
24 Jul 2020
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
312
0
06 Jun 2015
1