ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,395 papers shown
Title
Navigating the OverKill in Large Language Models
Navigating the OverKill in Large Language Models
Chenyu Shi
Xiao Wang
Qiming Ge
Songyang Gao
Xianjun Yang
Tao Gui
Qi Zhang
Xuanjing Huang
Xun Zhao
Dahua Lin
101
13
0
31 Jan 2024
Neighboring Perturbations of Knowledge Editing on Large Language Models
Neighboring Perturbations of Knowledge Editing on Large Language Models
Jun-Yu Ma
Zhen-Hua Ling
Ningyu Zhang
Jia-Chen Gu
KELM
82
6
0
31 Jan 2024
SPECTRUM: Speaker-Enhanced Pre-Training for Long Dialogue Summarization
SPECTRUM: Speaker-Enhanced Pre-Training for Long Dialogue Summarization
Sangwoo Cho
Kaiqiang Song
Chao Zhao
Xiaoyang Wang
Dong Yu
79
0
0
31 Jan 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
Rheeya Uppaal
Yixuan Li
Junjie Hu
159
6
0
31 Jan 2024
Weaver: Foundation Models for Creative Writing
Weaver: Foundation Models for Creative Writing
Tiannan Wang
Jiamin Chen
Qingrui Jia
Shuai Wang
Ruoyu Fang
...
Xiaohua Xu
Ningyu Zhang
Huajun Chen
Yuchen Eleanor Jiang
Wangchunshu Zhou
99
20
0
30 Jan 2024
Robust Prompt Optimization for Defending Language Models Against
  Jailbreaking Attacks
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
Andy Zhou
Bo Li
Haohan Wang
AAML
143
88
0
30 Jan 2024
Rethinking Interpretability in the Era of Large Language Models
Rethinking Interpretability in the Era of Large Language Models
Chandan Singh
J. Inala
Michel Galley
Rich Caruana
Jianfeng Gao
LRMAI4CE
133
72
0
30 Jan 2024
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on
  Semantic Textual Similarity
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Ansar Aynetdinov
Alan Akbik
ALM
87
12
0
30 Jan 2024
Can Large Language Models be Trusted for Evaluation? Scalable
  Meta-Evaluation of LLMs as Evaluators via Agent Debate
Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate
Steffi Chern
Ethan Chern
Graham Neubig
Pengfei Liu
LLMAGALMELM
46
30
0
30 Jan 2024
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large
  Language Models
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models
Wai-Chung Kwan
Xingshan Zeng
Yuxin Jiang
Yufei Wang
Liangyou Li
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
LRMELM
51
22
0
30 Jan 2024
Security and Privacy Challenges of Large Language Models: A Survey
Security and Privacy Challenges of Large Language Models: A Survey
B. Das
M. H. Amini
Yanzhao Wu
PILMELM
138
145
0
30 Jan 2024
Gradient-Based Language Model Red Teaming
Gradient-Based Language Model Red Teaming
Nevan Wichers
Carson E. Denison
Ahmad Beirami
78
33
0
30 Jan 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian
  Portuguese
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
93
9
0
30 Jan 2024
Improving Reinforcement Learning from Human Feedback with Efficient
  Reward Model Ensemble
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
Shun Zhang
Zhenfang Chen
Sunli Chen
Yikang Shen
Zhiqing Sun
Chuang Gan
82
27
0
30 Jan 2024
Weak-to-Strong Jailbreaking on Large Language Models
Weak-to-Strong Jailbreaking on Large Language Models
Xuandong Zhao
Xianjun Yang
Tianyu Pang
Chao Du
Lei Li
Yu-Xiang Wang
William Y. Wang
145
62
0
30 Jan 2024
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence
  Labeling Tasks
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks
Bolei Ma
Ercong Nie
Shuzhou Yuan
Helmut Schmid
Michael Farber
Frauke Kreuter
Hinrich Schütze
VLM
154
6
0
29 Jan 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and
  Comprehension in Vision-Language Large Model
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLMMLLM
175
268
0
29 Jan 2024
Zero-shot Imitation Policy via Search in Demonstration Dataset
Zero-shot Imitation Policy via Search in Demonstration Dataset
Federico Malato
Florian Leopold
Andrew Melnik
Ville Hautamaki
LM&RoOffRL
52
7
0
29 Jan 2024
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language
  Modeling
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Pratyush Maini
Skyler Seto
Richard He Bai
David Grangier
Yizhe Zhang
Navdeep Jaitly
SyDa
92
67
0
29 Jan 2024
Iterative Data Smoothing: Mitigating Reward Overfitting and
  Overoptimization in RLHF
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Banghua Zhu
Michael I. Jordan
Jiantao Jiao
84
33
0
29 Jan 2024
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large
  Models
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large Models
Yi Zhao
Yilin Zhang
Rong Xiang
Jing Li
Hillming Li
77
16
0
29 Jan 2024
KAUCUS: Knowledge Augmented User Simulators for Training Language Model
  Assistants
KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants
Kaustubh D. Dhole
84
3
0
29 Jan 2024
Corrective Retrieval Augmented Generation
Corrective Retrieval Augmented Generation
Shi-Qi Yan
Jia-Chen Gu
Yun Zhu
Zhen-Hua Ling
RALM
250
89
0
29 Jan 2024
LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning
LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning
Yuqiang Sun
Daoyuan Wu
Yue Xue
Han Liu
Wei Ma
Lyuye Zhang
Miaolei Shi
Yingjiu Li
ELM
201
55
0
29 Jan 2024
PILOT: Legal Case Outcome Prediction with Case Law
PILOT: Legal Case Outcome Prediction with Case Law
Lang Cao
Zifeng Wang
Cao Xiao
Jimeng Sun
AILawELM
72
8
0
28 Jan 2024
YODA: Teacher-Student Progressive Learning for Language Models
YODA: Teacher-Student Progressive Learning for Language Models
Jianqiao Lu
Wanjun Zhong
Yufei Wang
Zhijiang Guo
Qi Zhu
...
Baojun Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
LRM
106
7
0
28 Jan 2024
PRE: A Peer Review Based Large Language Model Evaluator
PRE: A Peer Review Based Large Language Model Evaluator
Zhumin Chu
Qingyao Ai
Yiteng Tu
Haitao Li
Yiqun Liu
LRMALM
112
21
0
28 Jan 2024
Diffusion-based Graph Generative Methods
Diffusion-based Graph Generative Methods
Hongyang Chen
Can Xu
Lingyu Zheng
Qiang Zhang
Xuemin Lin
DiffMMedIm
100
0
0
28 Jan 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought
  Prompting
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
81
33
0
28 Jan 2024
Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers
  via Self-Imitation Learning
Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning
Xiaofei Xu
Ke Deng
Michael Dann
Xiuzhen Zhang
42
6
0
28 Jan 2024
Enhancing Human Experience in Human-Agent Collaboration: A
  Human-Centered Modeling Approach Based on Positive Human Gain
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Yiming Gao
Feiyu Liu
Liang Wang
Zhenjie Lian
Dehua Zheng
...
Jing Dai
Qiang Fu
Wei Yang
Lanxiao Huang
Wei Liu
82
1
0
28 Jan 2024
Quantifying Stereotypes in Language
Quantifying Stereotypes in Language
Yang Liu
75
1
0
28 Jan 2024
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering
  Without Data Exposure
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure
J. Ye
Mengnan Du
Guiling Wang
LMTD
51
8
0
27 Jan 2024
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for
  Hallucination Mitigation
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
Yuxin Liang
Zhuoyang Song
Hao Wang
Jiaxing Zhang
HILM
102
36
0
27 Jan 2024
An Empirical Study on Large Language Models in Accuracy and Robustness
  under Chinese Industrial Scenarios
An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios
Zongjie Li
Wenying Qiu
Pingchuan Ma
Yichen Li
You Li
Sijia He
Baozheng Jiang
Shuai Wang
Weixi Gu
122
2
0
27 Jan 2024
Improving Medical Reasoning through Retrieval and Self-Reflection with
  Retrieval-Augmented Large Language Models
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models
Minbyul Jeong
Jiwoong Sohn
Mujeen Sung
Jaewoo Kang
120
34
0
27 Jan 2024
Health Text Simplification: An Annotated Corpus for Digestive Cancer
  Education and Novel Strategies for Reinforcement Learning
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning
Md Mushfiqur Rahman
Mohammad Sabik Irbaz
Kai North
Michelle S. Williams
Marcos Zampieri
Kevin Lybarger
86
1
0
26 Jan 2024
GeoDecoder: Empowering Multimodal Map Understanding
GeoDecoder: Empowering Multimodal Map Understanding
Feng Qi
Mian Dai
Zixian Zheng
Chao Wang
90
2
0
26 Jan 2024
Design Principles for Generative AI Applications
Design Principles for Generative AI Applications
Justin D. Weisz
Jessica He
Michael J. Muller
Gabriela Hoefer
Rachel Miles
Werner Geyer
AI4CE
92
143
0
25 Jan 2024
Wordflow: Social Prompt Engineering for Large Language Models
Wordflow: Social Prompt Engineering for Large Language Models
Zijie J. Wang
Aishwarya Chakravarthy
David Munechika
Duen Horng Chau
80
14
0
25 Jan 2024
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Asaf Yehudai
Boaz Carmeli
Y. Mass
Ofir Arviv
Nathaniel Mills
Assaf Toledo
Eyal Shnarch
Leshem Choshen
67
25
0
25 Jan 2024
Commonsense-augmented Memory Construction and Management in Long-term
  Conversations via Context-aware Persona Refinement
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement
Hana Kim
Kai Tzu-iunn Ong
Seoyeon Kim
Dongha Lee
Jinyoung Yeo
78
9
0
25 Jan 2024
Parameter-Efficient Conversational Recommender System as a Language
  Processing Task
Parameter-Efficient Conversational Recommender System as a Language Processing Task
Mathieu Ravaut
Hao Zhang
Lu Xu
Aixin Sun
Yong Liu
113
11
0
25 Jan 2024
True Knowledge Comes from Practice: Aligning LLMs with Embodied
  Environments via Reinforcement Learning
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
Weihao Tan
Wentao Zhang
Shanqi Liu
Longtao Zheng
Xinrun Wang
Bo An
OffRL
109
22
0
25 Jan 2024
Mapping the Design Space of Teachable Social Media Feed Experiences
Mapping the Design Space of Teachable Social Media Feed Experiences
K. J. Kevin Feng
Xander Koo
Lawrence Tan
Amy Bruckman
David W. McDonald
Amy X. Zhang
110
15
0
25 Jan 2024
WebVoyager: Building an End-to-End Web Agent with Large Multimodal
  Models
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Hongliang He
Wenlin Yao
Kaixin Ma
Wenhao Yu
Yong Dai
Hongming Zhang
Zhenzhong Lan
Dong Yu
LLMAG
191
151
0
25 Jan 2024
MULTIVERSE: Exposing Large Language Model Alignment Problems in Diverse
  Worlds
MULTIVERSE: Exposing Large Language Model Alignment Problems in Diverse Worlds
Xiaolong Jin
Zhuo Zhang
Xiangyu Zhang
39
4
0
25 Jan 2024
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support
Inhwa Song
Sachin R. Pendse
Neha Kumar
Munmun De Choudhury
AI4MH
79
18
0
25 Jan 2024
DenoSent: A Denoising Objective for Self-Supervised Sentence
  Representation Learning
DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
Xinghao Wang
Junliang He
Pengyu Wang
Yunhua Zhou
Tianxiang Sun
Xipeng Qiu
AI4TS
51
4
0
24 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRLLRM
174
217
0
24 Jan 2024
Previous
123...102103104...126127128
Next