ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,392 papers shown
Title
Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
Callie Y. Kim
Christine P. Lee
Bilge Mutlu
LM&Ro
95
82
0
06 Jan 2024
The Dawn After the Dark: An Empirical Study on Factuality Hallucination
  in Large Language Models
The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models
Junyi Li
Jie Chen
Ruiyang Ren
Xiaoxue Cheng
Wayne Xin Zhao
Jian-Yun Nie
Ji-Rong Wen
HILM
103
57
0
06 Jan 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and
  Efficient Autonomous Driving
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
89
36
0
06 Jan 2024
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language
  Models
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
Yaojia Lv
Haojie Pan
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
68
5
0
06 Jan 2024
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Lin Sun
Kai Zhang
Qingyuan Li
Renze Lou
89
14
0
05 Jan 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
DeepSeek-AI Xiao Bi
:
Xiao Bi
Deli Chen
Guanting Chen
...
Yao Zhao
Shangyan Zhou
Shunfeng Zhou
Qihao Zhu
Yuheng Zou
LRMALM
209
382
0
05 Jan 2024
Towards ASR Robust Spoken Language Understanding Through In-Context
  Learning With Word Confusion Networks
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Kevin Everson
Yile Gu
Huck Yang
Prashanth Gurunath Shivakumar
Guan-Ting Lin
...
Shalini Ghosh
Wael Hamza
Hung-yi Lee
Ariya Rastrow
A. Stolcke
70
6
0
05 Jan 2024
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Renjie Pi
Tianyang Han
Jianshu Zhang
Yueqi Xie
Boyao Wang
Qing Lian
Hanze Dong
Jipeng Zhang
Tong Zhang
AAML
111
71
0
05 Jan 2024
Object-Centric Instruction Augmentation for Robotic Manipulation
Object-Centric Instruction Augmentation for Robotic Manipulation
Junjie Wen
Yichen Zhu
Minjie Zhu
Jinming Li
Zhiyuan Xu
...
Yaxin Peng
Chaomin Shen
Dong Liu
Feifei Feng
Jian Tang
LM&Ro
116
17
0
05 Jan 2024
From LLM to Conversational Agent: A Memory Enhanced Architecture with
  Fine-Tuning of Large Language Models
From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
Na Liu
Liangyu Chen
Xiaoyu Tian
Wei Zou
Kaijiang Chen
Ming Cui
LLMAG
106
31
0
05 Jan 2024
AST-T5: Structure-Aware Pretraining for Code Generation and
  Understanding
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Linyuan Gong
Mostafa Elhoushi
Alvin Cheung
150
18
0
05 Jan 2024
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal
  Models with Multiple Image Inputs
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs
Daoan Zhang
Junming Yang
Hanjia Lyu
Zijian Jin
Yuan Yao
Mingkai Chen
Jiebo Luo
104
40
0
05 Jan 2024
Large Language Models for Social Networks: Applications, Challenges, and
  Solutions
Large Language Models for Social Networks: Applications, Challenges, and Solutions
Jingying Zeng
Richard Huang
Waleed Malik
Langxuan Yin
Bojan Babic
Danny Shacham
Xiao Yan
Jaewon Yang
Qi He
72
9
0
04 Jan 2024
LLaMA Pro: Progressive LLaMA with Block Expansion
LLaMA Pro: Progressive LLaMA with Block Expansion
Chengyue Wu
Yukang Gan
Yixiao Ge
Zeyu Lu
Jiahao Wang
Ye Feng
Ying Shan
Ping Luo
CLL
90
72
0
04 Jan 2024
ChartAssisstant: A Universal Chart Multimodal Language Model via
  Chart-to-Table Pre-training and Multitask Instruction Tuning
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning
Fanqing Meng
Wenqi Shao
Quanfeng Lu
Peng Gao
Kaipeng Zhang
Yu Qiao
Ping Luo
121
55
0
04 Jan 2024
DIALIGHT: Lightweight Multilingual Development and Evaluation of
  Task-Oriented Dialogue Systems with Large Language Models
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models
Songbo Hu
Xiaobin Wang
Moy Yuan
Anna Korhonen
Ivan Vulić
89
4
0
04 Jan 2024
Data-Centric Foundation Models in Computational Healthcare: A Survey
Data-Centric Foundation Models in Computational Healthcare: A Survey
Yunkun Zhang
Jin Gao
Zheling Tan
Lingfeng Zhou
Kexin Ding
Mu Zhou
Shaoting Zhang
Dequan Wang
AI4CE
113
25
0
04 Jan 2024
MobileAgent: enhancing mobile control via human-machine interaction and
  SOP integration
MobileAgent: enhancing mobile control via human-machine interaction and SOP integration
Tinghe Ding
LLMAGLM&Ro
124
12
0
04 Jan 2024
Understanding LLMs: A Comprehensive Overview from Training to Inference
Understanding LLMs: A Comprehensive Overview from Training to Inference
Yi-Hsueh Liu
Haoyang He
Tianle Han
Xu-Yao Zhang
Mengyuan Liu
...
Xintao Hu
Tuo Zhang
Ning Qiang
Tianming Liu
Bao Ge
SyDa
166
80
0
04 Jan 2024
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO
  and Toxicity
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity
Andrew Lee
Xiaoyan Bai
Itamar Pres
Martin Wattenberg
Jonathan K. Kummerfeld
Rada Mihalcea
150
121
0
03 Jan 2024
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Uri Shaham
Jonathan Herzig
Roee Aharoni
Idan Szpektor
Reut Tsarfaty
Matan Eyal
LRM
133
52
0
03 Jan 2024
Physio: An LLM-Based Physiotherapy Advisor
Physio: An LLM-Based Physiotherapy Advisor
Rúben Almeida
Hugo Sousa
L. F. Cunha
Nuno Guimarães
Ricardo Campos
A. Jorge
LM&MA
54
1
0
03 Jan 2024
A Novel Paradigm for Neural Computation: X-Net with Learnable Neurons
  and Adaptable Structure
A Novel Paradigm for Neural Computation: X-Net with Learnable Neurons and Adaptable Structure
Yanjie Li
Weijun Li
Lina Yu
Min Wu
Jinyi Liu
...
Xin Ning
Yugui Zhang
Baoli Lu
Jian Xu
Shuang Li
85
0
0
03 Jan 2024
GPT-4V(ision) is a Generalist Web Agent, if Grounded
GPT-4V(ision) is a Generalist Web Agent, if Grounded
Boyuan Zheng
Boyu Gou
Jihyung Kil
Huan Sun
Yu-Chuan Su
MLLMVLMLLMAG
142
264
0
03 Jan 2024
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse
GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse
Hongzhan Lin
Ziyang Luo
Bo Wang
Ruichao Yang
Jing Ma
124
31
0
03 Jan 2024
Theoretical guarantees on the best-of-n alignment policy
Theoretical guarantees on the best-of-n alignment policy
Ahmad Beirami
Alekh Agarwal
Jonathan Berant
Alex DÁmour
Jacob Eisenstein
Chirag Nagpal
A. Suresh
129
61
0
03 Jan 2024
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
  Models
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Zixiang Chen
Yihe Deng
Huizhuo Yuan
Kaixuan Ji
Quanquan Gu
SyDa
155
327
0
02 Jan 2024
LLM Harmony: Multi-Agent Communication for Problem Solving
LLM Harmony: Multi-Agent Communication for Problem Solving
Sumedh Rasal
LLMAG
66
24
0
02 Jan 2024
Taking the Next Step with Generative Artificial Intelligence: The
  Transformative Role of Multimodal Large Language Models in Science Education
Taking the Next Step with Generative Artificial Intelligence: The Transformative Role of Multimodal Large Language Models in Science Education
Arne Bewersdorff
Christian Hartmann
Marie Hornberger
Kathrin Seßler
Maria Bannert
Enkelejda Kasneci
Gjergji Kasneci
Xiaoming Zhai
Claudia Nerdel
127
37
0
01 Jan 2024
A Computational Framework for Behavioral Assessment of LLM Therapists
A Computational Framework for Behavioral Assessment of LLM Therapists
Yu Ying Chiu
Ashish Sharma
Inna Wanyin Lin
Tim Althoff
AI4MH
83
44
0
01 Jan 2024
Astraios: Parameter-Efficient Instruction Tuning Code Large Language
  Models
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Terry Yue Zhuo
A. Zebaze
Nitchakarn Suppattarachai
Leandro von Werra
H. D. Vries
Qian Liu
Niklas Muennighoff
ALM
101
18
0
01 Jan 2024
Temporal Validity Change Prediction
Temporal Validity Change Prediction
Georg Wenzel
Adam Jatowt
99
0
0
01 Jan 2024
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
Jinglong Luo
Yehong Zhang
Zhuo Zhang
Jiaqi Zhang
Xin Mu
Hui Wang
Yue Yu
Zenglin Xu
116
10
0
01 Jan 2024
DocLLM: A layout-aware generative language model for multimodal document
  understanding
DocLLM: A layout-aware generative language model for multimodal document understanding
Dongsheng Wang
Natraj Raman
Mathieu Sibue
Zhiqiang Ma
Petr Babkin
Simerjot Kaur
Yulong Pei
Armineh Nourbakhsh
Xiaomo Liu
VLM
102
62
0
31 Dec 2023
A Generalist FaceX via Learning Unified Facial Representation
A Generalist FaceX via Learning Unified Facial Representation
Yue Han
Jiangning Zhang
Junwei Zhu
Xiangtai Li
Yanhao Ge
Wei Li
Chengjie Wang
Yong Liu
Xiaoming Liu
Ying Tai
DiffM
104
13
0
31 Dec 2023
HSC-GPT: A Large Language Model for Human Settlements Construction
HSC-GPT: A Large Language Model for Human Settlements Construction
Ran Chen
Xueqi Yao
Xuhui Jiang
Zhengqi Han
Jingze Guo
...
Chumin Liu
Jing Zhao
Zeke Lian
Jingjing Zhang
Keke Li
53
1
0
31 Dec 2023
keqing: knowledge-based question answering is a nature chain-of-thought
  mentor of LLM
keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM
Chaojie Wang
Yishi Xu
Zhong Peng
Chenxi Zhang
Bo Chen
Xinrun Wang
Lei Feng
Bo An
139
19
0
31 Dec 2023
RAGTruth: A Hallucination Corpus for Developing Trustworthy
  Retrieval-Augmented Language Models
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models
Cheng Niu
Yuanhao Wu
Juno Zhu
Siliang Xu
Kashun Shum
Randy Zhong
Juntong Song
Tong Zhang
HILM
110
109
0
31 Dec 2023
KernelGPT: Enhanced Kernel Fuzzing via Large Language Models
KernelGPT: Enhanced Kernel Fuzzing via Large Language Models
Chenyuan Yang
Zijie Zhao
Lingming Zhang
112
17
0
31 Dec 2023
Boosting Large Language Model for Speech Synthesis: An Empirical Study
Boosting Large Language Model for Speech Synthesis: An Empirical Study
Hong-ping Hao
Long Zhou
Shujie Liu
Jinyu Li
Shujie Hu
Rui Wang
Furu Wei
125
19
0
30 Dec 2023
Uncertainty-Penalized Reinforcement Learning from Human Feedback with
  Diverse Reward LoRA Ensembles
Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles
Yuanzhao Zhai
Han Zhang
Yu Lei
Yue Yu
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
AI4CE
147
35
0
30 Dec 2023
The Problem of Alignment
The Problem of Alignment
Tsvetelina Hristova
Liam Magee
K. Soldatić
58
0
0
30 Dec 2023
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained
  Language Models for Question Answering over Knowledge Graph
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph
Jinhao Jiang
Kun Zhou
Wayne Xin Zhao
Yaliang Li
Ji-Rong Wen
LRM
77
27
0
30 Dec 2023
LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning
LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning
S. P. Sharan
Francesco Pittaluga
G. VijayKumarB.
Manmohan Chandraker
LRM
127
55
0
30 Dec 2023
ConfusionPrompt: Practical Private Inference for Online Large Language
  Models
ConfusionPrompt: Practical Private Inference for Online Large Language Models
Peihua Mai
Ran Yan
Rui Ye
Youjia Yang
Yinchuan Li
Yan Pang
73
2
0
30 Dec 2023
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
Julien Piet
Maha Alrashed
Chawin Sitawarin
Sizhe Chen
Zeming Wei
Elizabeth Sun
Basel Alomair
David Wagner
AAMLSyDa
154
59
0
29 Dec 2023
Large Language Models for Generative Information Extraction: A Survey
Large Language Models for Generative Information Extraction: A Survey
Derong Xu
Wei-neng Chen
Wenjun Peng
Chao Zhang
Tong Xu
Xiangyu Zhao
Xian Wu
Yefeng Zheng
Yang Wang
Enhong Chen
156
175
0
29 Dec 2023
The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems:
  A Scoping Survey
The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey
Dhruv Dhamani
Mary Lou Maher
92
1
0
29 Dec 2023
Building Efficient Universal Classifiers with Natural Language Inference
Building Efficient Universal Classifiers with Natural Language Inference
Moritz Laurer
W. Atteveldt
Andreu Casas
Kasper Welbers
98
8
0
29 Dec 2023
Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of
  LLMs
Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs
Shaojie Zhu
Zhaobin Wang
Chengxiang Zhuo
Hui Lu
Bo Hu
Zang Li
LRM
47
0
0
29 Dec 2023
Previous
123...106107108...126127128
Next