Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.06267
Cited By
FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
10 March 2024
Hanfang Lyu
Yuanchen Bai
Xin Liang
Ujaan Das
Chuhan Shi
Leiliang Gong
Yingchi Li
Mingfei Sun
Ming Ge
Xiaojuan Ma
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation"
20 / 20 papers shown
Title
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
110
513
0
27 Jul 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
76
59
0
29 May 2023
Perspectives on the Social Impacts of Reinforcement Learning with Human Feedback
Gabrielle K. Liu
OffRL
90
21
0
06 Mar 2023
Principled Reinforcement Learning with Human Feedback from Pairwise or
K
K
K
-wise Comparisons
Banghua Zhu
Jiantao Jiao
Michael I. Jordan
OffRL
77
203
0
26 Jan 2023
Benchmarks and Algorithms for Offline Preference-Based Reward Learning
Daniel Shin
Anca Dragan
Daniel S. Brown
OffRL
58
56
0
03 Jan 2023
A User Interface for Sense-making of the Reasoning Process while Interacting with Robots
Chao Wang
Joerg Deigmoeller
86
3
0
15 Oct 2022
MyMove: Facilitating Older Adults to Collect In-Situ Activity Labels on a Smartwatch with Speech
Young-Ho Kim
Diana Chou
Bongshin Lee
M. Danilovich
Amanda Lazar
D. Conroy
Hernisa Kacorri
E. Choe
37
26
0
01 Apr 2022
OneLabeler: A Flexible System for Building Data Labeling Tools
Yu Zhang
Yun Wang
Haidong Zhang
Bin Zhu
Si Chen
Dongmei Zhang
127
31
0
27 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
806
12,893
0
04 Mar 2022
Learning Reward Functions from Scale Feedback
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
67
33
0
01 Oct 2021
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
284
503
0
06 Aug 2021
Here's What I've Learned: Asking Questions that Reveal Reward Learning
Soheil Habibian
Ananth Jonnavittula
Dylan P. Losey
30
21
0
02 Jul 2021
Annotation Curricula to Implicitly Train Non-Expert Annotators
Ji-Ung Lee
Jan-Christoph Klie
Iryna Gurevych
26
11
0
04 Jun 2021
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach
Huixin Zhan
Feng Tao
Yongcan Cao
58
26
0
15 Oct 2020
Feature Expansive Reward Learning: Rethinking Human Input
Andreea Bobu
Marius Wiggert
Claire Tomlin
Anca Dragan
133
45
0
23 Jun 2020
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning
Erdem Biyik
Malayandi Palan
Nicholas C. Landolfi
Dylan P. Losey
Dorsa Sadigh
39
116
0
10 Oct 2019
Semi-Automatic Labeling for Deep Learning in Robotics
Daniele De Gregorio
A. Tonioni
G. Palli
Luigi Di Stefano
32
24
0
05 Aug 2019
The VIA Annotation Software for Images, Audio and Video
Abhishek Dutta
Andrew Zisserman
60
863
0
24 Apr 2019
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel S. Brown
Wonjoon Goo
P. Nagarajan
S. Niekum
68
355
0
12 Apr 2019
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
134
3,296
0
12 Jun 2017
1