Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.12750
Cited By
Learning Multimodal Rewards from Rankings
27 September 2021
Vivek Myers
Erdem Biyik
Nima Anari
Dorsa Sadigh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Multimodal Rewards from Rankings"
34 / 34 papers shown
Title
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
Le Qiu
Zelai Xu
Qixin Tan
Wenhao Tang
Chao-Hua Yu
Yu Wang
AAML
38
0
0
24 Mar 2025
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
37
2
0
17 Jan 2025
Contrastive Learning from Exploratory Actions: Leveraging Natural Interactions for Preference Elicitation
N. Dennler
Stefanos Nikolaidis
Maja J. Matarić
141
0
0
03 Jan 2025
Improving User Experience in Preference-Based Optimization of Reward Functions for Assistive Robots
N. Dennler
Zhonghao Shi
Stefanos Nikolaidis
Maja J. Matarić
120
0
0
17 Nov 2024
Trajectory Improvement and Reward Learning from Comparative Language Feedback
Zhaojing Yang
Miru Jun
J. Tien
Stuart J. Russell
Anca Dragan
Erdem Bıyık
34
6
0
08 Oct 2024
Representation Alignment from Human Feedback for Cross-Embodiment Reward Learning from Mixed-Quality Demonstrations
Connor Mattson
Anurag Aribandi
Daniel S. Brown
38
0
0
10 Aug 2024
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
Heewoong Choi
Sangwon Jung
Hongjoon Ahn
Taesup Moon
OffRL
39
2
0
08 Aug 2024
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
48
3
0
22 Jul 2024
Safe MPC Alignment with Human Directional Feedback
Zhixian Xie
Wenlong Zhang
Yi Ren
Zhaoran Wang
George J. Pappas
Wanxin Jin
47
0
0
05 Jul 2024
Teaching Language Models to Self-Improve by Learning from Language Feedback
Chi Hu
Yimin Hu
Hang Cao
Tong Xiao
Jingbo Zhu
LRM
VLM
35
4
0
11 Jun 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
43
2
0
30 Apr 2024
A Generalized Acquisition Function for Preference-based Reward Learning
Evan Ellis
Gaurav R. Ghosal
Stuart J. Russell
Anca Dragan
Erdem Biyik
34
1
0
09 Mar 2024
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang
Zhanyi Sun
Jesse Zhang
Zhou Xian
Erdem Biyik
David Held
Zackory M. Erickson
VLM
55
50
0
06 Feb 2024
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Minyoung Hwang
Luca Weihs
Chanwoo Park
Kimin Lee
Aniruddha Kembhavi
Kiana Ehsani
29
18
0
14 Dec 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
S. Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
24
66
0
11 Oct 2023
REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
Zheyuan Hu
Aaron Rovinsky
Jianlan Luo
Vikash Kumar
Abhishek Gupta
Sergey Levine
OffRL
22
9
0
06 Sep 2023
Active Inverse Learning in Stackelberg Trajectory Games
Yue Yu
Jacob Levy
Negar Mehr
David Fridovich-Keil
Ufuk Topcu
21
2
0
15 Aug 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
47
472
0
27 Jul 2023
Boosting Feedback Efficiency of Interactive Reinforcement Learning by Adaptive Learning from Scores
Shukai Liu
Chenming Wu
Ying Li
Liang Zhang
31
0
0
11 Jul 2023
Inverse Preference Learning: Preference-based RL without a Reward Function
Joey Hejna
Dorsa Sadigh
OffRL
24
48
0
24 May 2023
Diffusion Co-Policy for Synergistic Human-Robot Collaborative Tasks
Eley Ng
Ziang Liu
Monroe Kennedy
DiffM
26
22
0
20 May 2023
Curriculum-Based Imitation of Versatile Skills
M. Li
Onur Celik
P. Becker
Denis Blessing
Rudolf Lioutikov
Gerhard Neumann
24
4
0
11 Apr 2023
Active Reward Learning from Online Preferences
Vivek Myers
Erdem Biyik
Dorsa Sadigh
OffRL
29
12
0
27 Feb 2023
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
16
19
0
19 Dec 2022
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
24
91
0
06 Dec 2022
Eliciting Compatible Demonstrations for Multi-Human Imitation Learning
Kanishk Gandhi
Siddharth Karamcheti
Madeline Liao
Dorsa Sadigh
44
22
0
14 Oct 2022
Probabilistic Conformal Prediction Using Conditional Random Samples
Zhendong Wang
Ruijiang Gao
Mingzhang Yin
Mingyuan Zhou
David M. Blei
TPM
39
22
0
14 Jun 2022
Learning from Imperfect Demonstrations via Adversarial Confidence Transfer
Zhangjie Cao
Zihan Wang
Dorsa Sadigh
AAML
26
7
0
07 Feb 2022
MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning
M. Peschl
Arkady Zgonnikov
F. Oliehoek
Luciano Cavalcante Siebert
12
26
0
30 Dec 2021
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
44
33
0
01 Oct 2021
APReL: A Library for Active Preference-based Reward Learning Algorithms
Erdem Biyik
Aditi Talati
Dorsa Sadigh
20
35
0
16 Aug 2021
Active Preference-Based Gaussian Process Regression for Reward Learning
Erdem Biyik
Nicolas Huynh
Mykel J. Kochenderfer
Dorsa Sadigh
GP
25
103
0
06 May 2020
Preference-Based Learning for Exoskeleton Gait Optimization
Maegan Tucker
Ellen R. Novoseller
Claudia K. Kann
Yanan Sui
Yisong Yue
J. W. Burdick
Aaron D. Ames
74
91
0
26 Sep 2019
Early Detection of Combustion Instabilities using Deep Convolutional Selective Autoencoders on Hi-speed Flame Video
Chandrayee Basu
Qian Yang
M. Singhal
Anca Dragan
51
174
0
25 Mar 2016
1