ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,381 papers shown
Title
From Words to Numbers: Your Large Language Model Is Secretly A Capable
  Regressor When Given In-Context Examples
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples
Robert Vacareanu
Vlad-Andrei Negru
Vasile Suciu
Mihai Surdeanu
74
35
0
11 Apr 2024
Best Practices and Lessons Learned on Synthetic Data for Language Models
Best Practices and Lessons Learned on Synthetic Data for Language Models
Ruibo Liu
Jerry W. Wei
Fangyu Liu
Chenglei Si
Yanzhe Zhang
...
Steven Zheng
Daiyi Peng
Diyi Yang
Denny Zhou
Andrew M. Dai
SyDaEgoV
126
96
0
11 Apr 2024
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
219
6
0
11 Apr 2024
"We Need Structured Output": Towards User-centered Constraints on Large
  Language Model Output
"We Need Structured Output": Towards User-centered Constraints on Large Language Model Output
Michael Xieyang Liu
Frederick Liu
Alexander J. Fiannaca
Terry Koo
Lucas Dixon
Michael Terry
Carrie J. Cai
145
34
0
10 Apr 2024
Reward Learning from Suboptimal Demonstrations with Applications in
  Surgical Electrocautery
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery
Zohre Karimi
Shing-Hei Ho
Bao Thach
Alan Kuntz
Daniel S. Brown
OffRL
60
7
0
10 Apr 2024
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on
  Graphs
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
Bowen Jin
Chulin Xie
Jiawei Zhang
Kashob Kumar Roy
Yu Zhang
...
Ruirui Li
Xianfeng Tang
Suhang Wang
Yu Meng
Jiawei Han
LRMRALM
126
52
0
10 Apr 2024
Improving Language Model Reasoning with Self-motivated Learning
Improving Language Model Reasoning with Self-motivated Learning
Yunlong Feng
Yang Xu
Libo Qin
Yasheng Wang
Wanxiang Che
LRMReLM
73
7
0
10 Apr 2024
LM Transparency Tool: Interactive Tool for Analyzing Transformer
  Language Models
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models
Igor Tufanov
Karen Hambardzumyan
Javier Ferrando
Elena Voita
KELM
103
8
0
10 Apr 2024
Accelerating Inference in Large Language Models with a Unified Layer
  Skipping Strategy
Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy
Yijin Liu
Fandong Meng
Jie Zhou
AI4CE
81
9
0
10 Apr 2024
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education
Murong Yue
Wijdane Mifdal
Yixuan Zhang
Jennifer Suh
Yixuan Zhang
Ziyu Yao
LLMAG
161
21
0
10 Apr 2024
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Rahul G. Krishnan
Vasilis Syrgkanis
128
1
0
10 Apr 2024
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Kehua Feng
Keyan Ding
Hongzhi Tan
Kede Ma
Zhihua Wang
...
Yuzhou Cheng
Ge Sun
Guozhou Zheng
Qiang Zhang
H. Chen
128
13
0
10 Apr 2024
FairPair: A Robust Evaluation of Biases in Language Models through
  Paired Perturbations
FairPair: A Robust Evaluation of Biases in Language Models through Paired Perturbations
Jane Dwivedi-Yu
Raaz Dwivedi
Timo Schick
67
2
0
09 Apr 2024
MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with
  Human Users In-the-loop
MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with Human Users In-the-loop
Ruolin Ye
Yifei Hu
Yuhan Bian
Bian
Luke Kulm
Tapomayukh Bhattacharjee
95
7
0
09 Apr 2024
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model
  Handling Resolutions from 336 Pixels to 4K HD
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Xingcheng Zhang
Jifeng Dai
Yuxin Qiao
Dahua Lin
Jiaqi Wang
VLMMLLM
116
127
0
09 Apr 2024
Autonomous Evaluation and Refinement of Digital Agents
Autonomous Evaluation and Refinement of Digital Agents
Jiayi Pan
Yichi Zhang
Nicholas Tomlin
Yifei Zhou
Sergey Levine
Alane Suhr
ELM
154
66
0
09 Apr 2024
Rethinking How to Evaluate Language Model Jailbreak
Rethinking How to Evaluate Language Model Jailbreak
Hongyu Cai
Arjun Arunasalam
Leo Y. Lin
Antonio Bianchi
Z. Berkay Celik
ALM
65
8
0
09 Apr 2024
SurveyAgent: A Conversational System for Personalized and Efficient
  Research Survey
SurveyAgent: A Conversational System for Personalized and Efficient Research Survey
Xintao Wang
Jiangjie Chen
Nianqi Li
Lida Chen
Xinfeng Yuan
Wei Shi
Xuyang Ge
Rui Xu
Yanghua Xiao
54
3
0
09 Apr 2024
AgentsCoDriver: Large Language Model Empowered Collaborative Driving
  with Lifelong Learning
AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning
Senkang Hu
Zhengru Fang
Zihan Fang
Yiqin Deng
Xianhao Chen
Yuguang Fang
125
35
0
09 Apr 2024
Understanding Cross-Lingual Alignment -- A Survey
Understanding Cross-Lingual Alignment -- A Survey
Katharina Hämmerl
Jindvrich Libovický
Alexander Fraser
85
14
0
09 Apr 2024
Cendol: Open Instruction-tuned Generative Large Language Models for
  Indonesian Languages
Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Samuel Cahyawijaya
Holy Lovenia
Fajri Koto
Rifki Afina Putri
Emmanuel Dave
...
Bryan Wilie
Genta Indra Winata
Alham Fikri Aji
Ayu Purwarianti
Pascale Fung
141
18
0
09 Apr 2024
Feel-Good Thompson Sampling for Contextual Dueling Bandits
Feel-Good Thompson Sampling for Contextual Dueling Bandits
Xuheng Li
Heyang Zhao
Quanquan Gu
119
14
0
09 Apr 2024
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
Guangchen Lan
Dong-Jun Han
Abolfazl Hashemi
Vaneet Aggarwal
Christopher G. Brinton
232
16
0
09 Apr 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation
  of Large Language Models
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Zhuohao Yu
Chang Gao
Wenjin Yao
Yidong Wang
Zhengran Zeng
Wei Ye
Jindong Wang
Yue Zhang
Shikun Zhang
61
3
0
09 Apr 2024
AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM
  Experts
AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts
Shaona Ghosh
Prasoon Varshney
Erick Galinkin
Christopher Parisien
ELM
95
52
0
09 Apr 2024
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Parishad BehnamGhader
Vaibhav Adlakha
Marius Mosbach
Dzmitry Bahdanau
Nicolas Chapados
Siva Reddy
124
242
0
09 Apr 2024
Efficient Multi-Task Reinforcement Learning via Task-Specific Action
  Correction
Efficient Multi-Task Reinforcement Learning via Task-Specific Action Correction
Jinyuan Feng
Min Chen
Zhiqiang Pu
Tenghai Qiu
Jianqiang Yi
80
2
0
09 Apr 2024
The Hallucinations Leaderboard -- An Open Effort to Measure
  Hallucinations in Large Language Models
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
Giwon Hong
Aryo Pradipta Gema
Rohit Saxena
Xiaotang Du
Ping Nie
...
Laura Perez-Beltrachini
Max Ryabinin
Xuanli He
Clémentine Fourrier
Pasquale Minervini
LRMHILM
85
12
0
08 Apr 2024
Eraser: Jailbreaking Defense in Large Language Models via Unlearning
  Harmful Knowledge
Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge
Weikai Lu
Huiping Zhuang
Jianwei Wang
Zhengdong Lu
Zelin Chen
Huiping Zhuang
Cen Chen
MUAAMLKELM
86
30
0
08 Apr 2024
CodecLM: Aligning Language Models with Tailored Synthetic Data
CodecLM: Aligning Language Models with Tailored Synthetic Data
Zifeng Wang
Chun-Liang Li
Vincent Perot
Long T. Le
Jin Miao
Zizhao Zhang
Chen-Yu Lee
Tomas Pfister
SyDaALM
73
21
0
08 Apr 2024
Negative Preference Optimization: From Catastrophic Collapse to
  Effective Unlearning
Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning
Ruiqi Zhang
Licong Lin
Yu Bai
Song Mei
MU
141
193
0
08 Apr 2024
LTNER: Large Language Model Tagging for Named Entity Recognition with
  Contextualized Entity Marking
LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking
Faren Yan
Peng Yu
Xin Chen
68
6
0
08 Apr 2024
Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data
Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data
Tim Baumgärtner
Yang Gao
Dana Alon
Donald Metzler
AAML
99
23
0
08 Apr 2024
Towards Objectively Benchmarking Social Intelligence for Language Agents
  at Action Level
Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level
Chenxu Wang
Bin Dai
Huaping Liu
Baoyuan Wang
ALMLLMAGELM
67
7
0
08 Apr 2024
Interpreting Themes from Educational Stories
Interpreting Themes from Educational Stories
Yigeng Zhang
Fabio A. González
Thamar Solorio
112
1
0
08 Apr 2024
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Yutao Ouyang
Jinhan Li
Yunfei Li
Zhongyu Li
Chao Yu
Koushil Sreenath
Yi Wu
136
15
0
08 Apr 2024
Chart What I Say: Exploring Cross-Modality Prompt Alignment in
  AI-Assisted Chart Authoring
Chart What I Say: Exploring Cross-Modality Prompt Alignment in AI-Assisted Chart Authoring
Nazar Ponochevnyi
Anastasia Kuzminykh
80
1
0
07 Apr 2024
Facial Affective Behavior Analysis with Instruction Tuning
Facial Affective Behavior Analysis with Instruction Tuning
Yifan Li
Anh Dao
Wentao Bao
Zhen Tan
Tianlong Chen
Huan Liu
Yu Kong
CVBM
116
15
0
07 Apr 2024
X-VARS: Introducing Explainability in Football Refereeing with
  Multi-Modal Large Language Model
X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model
Jan Held
Hani Itani
A. Cioppa
Silvio Giancola
Guohao Li
Marc Van Droogenbroeck
84
17
0
07 Apr 2024
Towards Understanding the Influence of Reward Margin on Preference Model
  Performance
Towards Understanding the Influence of Reward Margin on Preference Model Performance
Bowen Qin
Duanyu Feng
Xi Yang
58
4
0
07 Apr 2024
Regularized Conditional Diffusion Model for Multi-Task Preference
  Alignment
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
Xudong Yu
Chenjia Bai
Haoran He
Changhong Wang
Xuelong Li
122
6
0
07 Apr 2024
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving
  Imitation Learning with LLMs
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
Yiqun Duan
Qiang Zhang
Renjing Xu
135
12
0
07 Apr 2024
Light the Night: A Multi-Condition Diffusion Framework for Unpaired
  Low-Light Enhancement in Autonomous Driving
Light the Night: A Multi-Condition Diffusion Framework for Unpaired Low-Light Enhancement in Autonomous Driving
Jinlong Li
Baolu Li
Zhengzhong Tu
Xinyu Liu
Qing Guo
Felix Juefei Xu
Runsheng Xu
Hongkai Yu
DiffM
128
26
0
07 Apr 2024
GenEARL: A Training-Free Generative Framework for Multimodal Event
  Argument Role Labeling
GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling
Hritik Bansal
Po-Nien Kung
P. Brantingham
Weisheng Wang
Miao Zheng
VLM
72
1
0
07 Apr 2024
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Weilin Cai
Juyong Jiang
Le Qin
Junwei Cui
Sunghun Kim
Jiayi Huang
185
10
0
07 Apr 2024
BARMPy: Bayesian Additive Regression Models Python Package
BARMPy: Bayesian Additive Regression Models Python Package
Danielle Van Boxel
BDLKELMGP
52
0
0
06 Apr 2024
PhyloLM : Inferring the Phylogeny of Large Language Models and
  Predicting their Performances in Benchmarks
PhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks
Nicolas Yax
Pierre-Yves Oudeyer
Stefano Palminteri
137
6
0
06 Apr 2024
Towards Analyzing and Understanding the Limitations of DPO: A
  Theoretical Perspective
Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective
Duanyu Feng
Bowen Qin
Chen Huang
Zheng Zhang
Wenqiang Lei
76
41
0
06 Apr 2024
Language Models as Critical Thinking Tools: A Case Study of Philosophers
Language Models as Critical Thinking Tools: A Case Study of Philosophers
Andre Ye
Jared Moore
Rose Novick
Amy X. Zhang
KELMELMLRMLLMAG
61
10
0
06 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
157
34
0
06 Apr 2024
Previous
123...828384...126127128
Next