Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2403.07708
Cited By

Improving Reinforcement Learning from Human Feedback Using Contrastive
Rewards

v1v2 (latest)

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

12 March 2024

Wei Shen

Yang Liu

ArXiv (abs)PDF HTML Github

Papers citing "Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards"

14 / 14 papers shown

PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier

PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier

262

0

0

11 Nov 2025

Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Efficient Reinforcement Learning from Human Feedback via Bayesian Preference Inference

Valeria Capretti

Simone Formentin

298

3

0

06 Nov 2025

GCPO: When Contrast Fails, Go Gold

GCPO: When Contrast Fails, Go Gold

171

1

0

09 Oct 2025

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback

Oracle-RLAIF: An Improved Fine-Tuning Framework for Multi-modal Video Models through Reinforcement Learning from Ranking Feedback

Christine Klymko

Shashank Kushwaha

Felipe Leno Da Silva

226

0

0

02 Oct 2025

A Survey on Progress in LLM Alignment from the Perspective of Reward Design

A Survey on Progress in LLM Alignment from the Perspective of Reward Design

478

14

0

05 May 2025

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

...

OffRL AI4TS SyDa LRM VLM

644

35

0

23 Apr 2025

Energy-Based Reward Models for Robust Language Model Alignment

Energy-Based Reward Models for Robust Language Model Alignment

1.1K

3

0

17 Apr 2025

Reasoning without Regret

Reasoning without Regret

328

0

0

14 Apr 2025

Reward Shaping to Mitigate Reward Hacking in RLHF

Reward Shaping to Mitigate Reward Hacking in RLHF

736

58

0

26 Feb 2025

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference OptimizationInternational Conference on Learning Representations (ICLR), 2024

Sudipta Singha Roy

Maarten de Rijke

680

21

0

10 Oct 2024

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

MA-RLHF: Reinforcement Learning from Human Feedback with Macro ActionsInternational Conference on Learning Representations (ICLR), 2024

1.0K

9

0

03 Oct 2024

Reward-Robust RLHF in LLMs

Reward-Robust RLHF in LLMs

Yuzi Yan

Jialian Li

Yiping Zhang

Jian Xie

Yu Wang

Dong Yan

Yuan Shen

489

21

0

18 Sep 2024

Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates

Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates

Mei Han

596

71

0

23 Aug 2024

Noise Contrastive Alignment of Language Models with Explicit Rewards

Noise Contrastive Alignment of Language Models with Explicit Rewards

Jun Zhu

467

86

0

08 Feb 2024

Page 1 of 1