v1v2 (latest)

Crowd-PrefRL: Preference-Based Reward Learning from Crowds

17 January 2024

Papers citing "Crowd-PrefRL: Preference-Based Reward Learning from Crowds"

27 / 27 papers shown

Title
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning Mingkang Wu Devin White Vernon J. Lawhern Nicholas R. Waytowich Yongcan Cao OffRL 65 1 0 13 Jan 2025
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning S. Poddar Yanming Wan Hamish Ivison Abhishek Gupta Natasha Jaques 75 50 0 19 Aug 2024
Learning From Crowdsourced Noisy Labels: A Signal Processing Perspective Shahana Ibrahim Panagiotis A. Traganitis Xiao Fu G. Giannakis NoLa 64 1 0 09 Jul 2024
Group Robust Preference Optimization in Reward-free RLHF Shyam Sundhar Ramesh Yifan Hu Iason Chaimalas Viraj Mehta Pier Giuseppe Sessa Haitham Bou-Ammar Ilija Bogunovic 79 39 0 30 May 2024
Corruption Robust Offline Reinforcement Learning with Human Feedback Debmalya Mandal Andi Nika Parameswaran Kamalaruban Adish Singla Goran Radanović OffRL 85 11 0 09 Feb 2024
Scalable Interactive Machine Learning for Future Command and Control Anna Madison Ellen R. Novoseller Vinicius G. Goecks Benjamin T. Files Nicholas R. Waytowich Alfred Yu Vernon J. Lawhern Steven Thurman Christopher Kelshaw Kaleb McDowell 58 4 0 09 Feb 2024
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF Anand Siththaranjan Cassidy Laidlaw Dylan Hadfield-Menell 99 71 0 13 Dec 2023
Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao John Dang Aditya Grover 67 30 0 17 Oct 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback Stephen Casper Xander Davies Claudia Shi T. Gilbert Jérémy Scheurer ... Erdem Biyik Anca Dragan David M. Krueger Dorsa Sadigh Dylan Hadfield-Menell ALM OffRL 131 529 0 27 Jul 2023
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback M. Torné Max Balsells Zihan Wang Samedh Desai Tao Chen Pulkit Agrawal Abhishek Gupta 67 8 0 20 Jul 2023
Towards Measuring the Representation of Subjective Global Opinions in Language Models Esin Durmus Karina Nyugen Thomas I. Liao Nicholas Schiefer Amanda Askell ... Alex Tamkin Janel Thamkul Jared Kaplan Jack Clark Deep Ganguli 92 244 0 28 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Rafael Rafailov Archit Sharma E. Mitchell Stefano Ermon Christopher D. Manning Chelsea Finn ALM 389 4,139 0 29 May 2023
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 883 13,176 0 04 Mar 2022
Imitation Learning by Estimating Expertise of Demonstrators M. Beliaev Andy Shih Stefano Ermon Dorsa Sadigh Ramtin Pedarsani 82 49 0 02 Feb 2022
Batch Reinforcement Learning from Crowds Guoxi Zhang H. Kashima OffRL 81 5 0 08 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning Kimin Lee Laura M. Smith Anca Dragan Pieter Abbeel OffRL 96 100 0 04 Nov 2021
Learning Reward Functions from Scale Feedback Nils Wilde Erdem Biyik Dorsa Sadigh Stephen L. Smith 89 34 0 01 Oct 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron Courville Marc G. Bellemare OffRL 123 676 0 30 Aug 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Kimin Lee Laura M. Smith Pieter Abbeel OffRL 65 288 0 09 Jun 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 566 2,044 0 04 May 2020
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations Daniel S. Brown Wonjoon Goo P. Nagarajan S. Niekum 78 358 0 12 Apr 2019
DeepMind Control Suite Yuval Tassa Yotam Doron Alistair Muldal Tom Erez Yazhe Li ... A. Abdolmaleki J. Merel Andrew Lefrancq Timothy Lillicrap Martin Riedmiller ELM LM&Ro BDL 148 1,143 0 02 Jan 2018
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 535 19,265 0 20 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 218 3,365 0 12 Jun 2017
Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets Karol Hausman Yevgen Chebotar S. Schaal Gaurav Sukhatme Joseph J. Lim GAN 69 150 0 30 May 2017
Ranking and combining multiple predictors without labeled data Fabio Parisi Francesco Strino B. Nadler Y. Kluger 152 135 0 13 Mar 2013
Bayesian multitask inverse reinforcement learning Christos Dimitrakakis Constantin Rothkopf BDL 82 107 0 18 Jun 2011