Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.06079
Cited By
TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations
9 May 2025
Shuaiyi Huang
Mara Levy
Anubhav Gupta
Daniel Ekpo
Ruijie Zheng
Abhinav Shrivastava
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations"
12 / 12 papers shown
Title
WayEx: Waypoint Exploration using a Single Demonstration
Mara Levy
Nirat Saini
Abhinav Shrivastava
90
1
0
22 Jul 2024
ARDuP: Active Region Video Diffusion for Universal Policies
Shuaiyi Huang
Mara Levy
Zhenyu Jiang
Anima Anandkumar
Yuke Zhu
Linxi Fan
De-An Huang
Abhinav Shrivastava
VGen
117
4
0
19 Jun 2024
RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences
Jie Cheng
Gang Xiong
Xingyuan Dai
Qinghai Miao
Yisheng Lv
Fei-Yue Wang
101
19
0
27 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
103
8
0
09 Feb 2024
LMPriors: Pre-Trained Language Models as Task-Specific Priors
Kristy Choi
Chris Cundy
Sanjari Srivastava
Stefano Ermon
BDL
109
43
0
22 Oct 2022
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
121
101
0
04 Nov 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
67
289
0
09 Jun 2021
Learning from Noisy Labels with Deep Neural Networks: A Survey
Hwanjun Song
Minseok Kim
Dongmin Park
Yooju Shin
Jae-Gil Lee
NoLa
130
1,001
0
16 Jul 2020
Does label smoothing mitigate label noise?
Michal Lukasik
Srinadh Bhojanapalli
A. Menon
Surinder Kumar
NoLa
193
351
0
05 Mar 2020
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
319
8,432
0
04 Jan 2018
Structured Attentions for Visual Question Answering
Chen Zhu
Yanpeng Zhao
Shuaiyi Huang
Kewei Tu
Yi-An Ma
FAtt
84
107
0
07 Aug 2017
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
222
3,380
0
12 Jun 2017
1