ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.02861
  4. Cited By
A Framework for Fine-Tuning LLMs using Heterogeneous Feedback

A Framework for Fine-Tuning LLMs using Heterogeneous Feedback

5 August 2024
Ryan Aponte
Ryan A. Rossi
Shunan Guo
Franck Dernoncourt
Tong Yu
Xiang Chen
Subrata Mitra
Nedim Lipka
    OffRL
ArXivPDFHTML

Papers citing "A Framework for Fine-Tuning LLMs using Heterogeneous Feedback"

2 / 2 papers shown
Title
Active Instruction Tuning: Improving Cross-Task Generalization by
  Training on Prompt Sensitive Tasks
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks
Po-Nien Kung
Fan Yin
Di Wu
Kai-Wei Chang
Nanyun Peng
77
40
0
01 Nov 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
319
11,953
0
04 Mar 2022
1