ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,370 papers shown
Title
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in
  Alignment
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Keming Lu
Bowen Yu
Fei Huang
Yang Fan
Runji Lin
Chang Zhou
MoMe
85
21
0
28 May 2024
The Evolution of Multimodal Model Architectures
The Evolution of Multimodal Model Architectures
S. Wadekar
Abhishek Chaurasia
Aman Chadha
Eugenio Culurciello
109
18
0
28 May 2024
Getting More Juice Out of the SFT Data: Reward Learning from Human
  Demonstration Improves SFT for LLM Alignment
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment
Jiaxiang Li
Siliang Zeng
Hoi-To Wai
Chenliang Li
Alfredo García
Mingyi Hong
133
18
0
28 May 2024
Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Qizhang Li
Yiwen Guo
Wangmeng Zuo
Hao Chen
AAMLSILM
87
7
0
28 May 2024
Personalized Steering of Large Language Models: Versatile Steering
  Vectors Through Bi-directional Preference Optimization
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
Yuanpu Cao
Tianrong Zhang
Bochuan Cao
Ziyi Yin
Lu Lin
Fenglong Ma
Jinghui Chen
LLMSV
92
33
0
28 May 2024
Detection-Correction Structure via General Language Model for
  Grammatical Error Correction
Detection-Correction Structure via General Language Model for Grammatical Error Correction
Wei Li
Houfeng Wang
102
6
0
28 May 2024
C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark
  for Large Language Models
C3^{3}3Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models
Jiahuan Cao
Yongxin Shi
Dezhi Peng
Yang Liu
Lianwen Jin
ELM
70
0
0
28 May 2024
MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting
MockLLM: A Multi-Agent Behavior Collaboration Framework for Online Job Seeking and Recruiting
Hongda Sun
Hongzhan Lin
Haiyu Yan
Chen Zhu
Yang Song
Xin Gao
82
8
0
28 May 2024
The Impossibility of Fair LLMs
The Impossibility of Fair LLMs
Jacy Reese Anthis
Kristian Lum
Michael Ekstrand
Avi Feller
Alexander D’Amour
FaML
130
14
0
28 May 2024
Cross-Modal Safety Alignment: Is textual unlearning all you need?
Cross-Modal Safety Alignment: Is textual unlearning all you need?
Trishna Chakraborty
Erfan Shayegani
Zikui Cai
Nael B. Abu-Ghazaleh
M. Salman Asif
Yue Dong
Amit K. Roy-Chowdhury
Chengyu Song
85
17
0
27 May 2024
Revision Matters: Generative Design Guided by Revision Edits
Revision Matters: Generative Design Guided by Revision Edits
Tao Li
Chin-Yi Cheng
Amber Xie
Gang Li
Yang Li
85
1
0
27 May 2024
SMR: State Memory Replay for Long Sequence Modeling
SMR: State Memory Replay for Long Sequence Modeling
Biqing Qi
Junqi Gao
Kaiyan Zhang
Dong Li
Jianxing Liu
Ligang Wu
Bowen Zhou
103
5
0
27 May 2024
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
Zixian Huang
Wenhao Zhu
Gong Cheng
Lei Li
Fei Yuan
LRM
93
14
0
27 May 2024
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Hyunseok Lee
Jihoon Tack
Jinwoo Shin
DeLMO
61
1
0
27 May 2024
Navigating the Safety Landscape: Measuring Risks in Finetuning Large
  Language Models
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models
Sheng-Hsuan Peng
Pin-Yu Chen
Matthew Hull
Duen Horng Chau
102
30
0
27 May 2024
Prompt Optimization with Human Feedback
Prompt Optimization with Human Feedback
Xiaoqiang Lin
Zhongxiang Dai
Arun Verma
See-Kiong Ng
Patrick Jaillet
K. H. Low
AAML
97
12
0
27 May 2024
Benchmarking General-Purpose In-Context Learning
Benchmarking General-Purpose In-Context Learning
Fan Wang
Chuan Lin
Yang Cao
Yu Kang
128
1
0
27 May 2024
Aligning LLMs through Multi-perspective User Preference Ranking-based
  Feedback for Programming Question Answering
Aligning LLMs through Multi-perspective User Preference Ranking-based Feedback for Programming Question Answering
Hongyu Yang
Liyang He
Min Hou
Shuanghong Shen
Rui Li
Jiahui Hou
Jianhui Ma
Junda Zhao
50
4
0
27 May 2024
LLM-Optic: Unveiling the Capabilities of Large Language Models for
  Universal Visual Grounding
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding
Haoyu Zhao
Wenhang Ge
Ying-Cong Chen
ObjDMLLMVLM
88
5
0
27 May 2024
Empowering Character-level Text Infilling by Eliminating Sub-Tokens
Empowering Character-level Text Infilling by Eliminating Sub-Tokens
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Hongsheng Li
AI4CE
72
1
0
27 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
64
2
0
27 May 2024
Position: Foundation Agents as the Paradigm Shift for Decision Making
Position: Foundation Agents as the Paradigm Shift for Decision Making
Xiaoqian Liu
Xingzhou Lou
Jianbin Jiao
Junge Zhang
OffRLLLMAG
105
7
0
27 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
109
15
0
27 May 2024
Exploring the LLM Journey from Cognition to Expression with Linear
  Representations
Exploring the LLM Journey from Cognition to Expression with Linear Representations
Yuzi Yan
J. Li
Yipin Zhang
Dong Yan
77
2
0
27 May 2024
Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models
Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models
Chia-Yi Hsu
Yu-Lin Tsai
Chih-Hsun Lin
Pin-Yu Chen
Chia-Mu Yu
Chun-ying Huang
143
56
0
27 May 2024
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT
  Even in Low-Resource Settings
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Robert Wolfe
Isaac Slaughter
Bin Han
Bingbing Wen
Yiwei Yang
...
Bernease Herman
E. Brown
Zening Qu
Nicholas Weber
Bill Howe
107
8
0
27 May 2024
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models
Chankyu Lee
Rajarshi Roy
Mengyao Xu
Jonathan Raiman
Mohammad Shoeybi
Bryan Catanzaro
Ming-Yu Liu
RALM
308
205
0
27 May 2024
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Aojun Zhou
Junting Pan
Hongsheng Li
SyDa
125
7
0
27 May 2024
The AI-DEC: A Card-based Design Method for User-centered AI Explanations
The AI-DEC: A Card-based Design Method for User-centered AI Explanations
Christine P. Lee
M. Lee
Bilge Mutlu
HAI
79
6
0
26 May 2024
Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
Peiran Yao
Denilson Barbosa
ELM
87
7
0
26 May 2024
A Survey of Multimodal Large Language Model from A Data-centric
  Perspective
A Survey of Multimodal Large Language Model from A Data-centric Perspective
Tianyi Bai
Hao Liang
Binwang Wan
Yanran Xu
Xi Li
...
Ping Huang
Jiulong Shan
Conghui He
Binhang Yuan
Wentao Zhang
139
45
0
26 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs
  across Diverse Granularity
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
79
4
0
26 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
75
1
0
26 May 2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is
  Implicitly an Adversarial Regularizer
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer
Zhihan Liu
Miao Lu
Shenao Zhang
Boyi Liu
Hongyi Guo
Yingxiang Yang
Jose H. Blanchet
Zhaoran Wang
147
62
0
26 May 2024
M-RAG: Reinforcing Large Language Model Performance through
  Retrieval-Augmented Generation with Multiple Partitions
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions
Zheng Wang
Shu Xian Teo
Jieer Ouyang
Yongjun Xu
Wei Shi
RALMVLM
71
17
0
26 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective
  Reinforcement Learning
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
79
3
0
26 May 2024
Multi-Reference Preference Optimization for Large Language Models
Multi-Reference Preference Optimization for Large Language Models
Hung Le
Quan Tran
D. Nguyen
Kien Do
Saloni Mittal
Kelechi Ogueji
Svetha Venkatesh
89
1
0
26 May 2024
Rewarded Region Replay (R3) for Policy Learning with Discrete Action
  Space
Rewarded Region Replay (R3) for Policy Learning with Discrete Action Space
Bangzheng Li
Ningshan Ma
Zifan Wang
44
0
1
26 May 2024
RLSF: Fine-tuning LLMs via Symbolic Feedback
RLSF: Fine-tuning LLMs via Symbolic Feedback
Piyush Jha
Prithwish Jana
Pranavkrishna Suresh
Arnav Arora
Vijay Ganesh
LRM
111
4
0
26 May 2024
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
Max Liu
Chan-Hung Yu
Wei-Hsu Lee
Cheng-Wei Hung
Yen-Chun Chen
Shao-Hua Sun
145
5
0
26 May 2024
Comparative Analysis of Open-Source Language Models in Summarizing
  Medical Text Data
Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data
Yuhao Chen
Zhimu Wang
Bo Wen
F. Zulkernine
ELMLM&MAAI4MH
25
4
0
25 May 2024
Confidence Under the Hood: An Investigation into the
  Confidence-Probability Alignment in Large Language Models
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
Abhishek Kumar
Robert D Morabito
Sanzhar Umbet
Jad Kabbara
Ali Emami
110
13
0
25 May 2024
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time
Jikun Kang
Xin Zhe Li
Xi Chen
Amirreza Kazemi
Qianyi Sun
...
Xu He
Quan He
Feng Wen
Jianye Hao
Jun Yao
LRMReLM
83
22
0
25 May 2024
InstructPatentGPT: Training patent language models to follow
  instructions with human feedback
InstructPatentGPT: Training patent language models to follow instructions with human feedback
Jieh-Sheng Lee
ALM
108
8
0
25 May 2024
Accelerating Inference of Retrieval-Augmented Generation via Sparse
  Context Selection
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection
Yun Zhu
Jia-Chen Gu
Caitlin Sikora
Ho Ko
Yinxiao Liu
...
Lei Shu
Liangchen Luo
Lei Meng
Bang Liu
Jindong Chen
RALM
99
19
0
25 May 2024
5W1H Extraction With Large Language Models
5W1H Extraction With Large Language Models
Yang Cao
Yangsong Lan
Feiyan Zhai
Piji Li
105
1
0
25 May 2024
Streaming Long Video Understanding with Large Language Models
Streaming Long Video Understanding with Large Language Models
Rui Qian
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Shuangrui Ding
Dahua Lin
Jiaqi Wang
VLM
137
49
0
25 May 2024
The Impact and Opportunities of Generative AI in Fact-Checking
The Impact and Opportunities of Generative AI in Fact-Checking
Robert Wolfe
Tanushree Mitra
81
16
0
24 May 2024
Scaling Laws for Discriminative Classification in Large Language Models
Scaling Laws for Discriminative Classification in Large Language Models
Dean Wyatte
Fatemeh Tahmasbi
Ming Li
Thomas Markovich
99
2
0
24 May 2024
Sparse maximal update parameterization: A holistic approach to sparse
  training dynamics
Sparse maximal update parameterization: A holistic approach to sparse training dynamics
Nolan Dey
Shane Bergsma
Joel Hestness
75
5
0
24 May 2024
Previous
123...737475...126127128
Next