ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,386 papers shown
Title
Decoding Continuous Character-based Language from Non-invasive Brain
  Recordings
Decoding Continuous Character-based Language from Non-invasive Brain Recordings
Cenyuan Zhang
Xiaoqing Zheng
Ruicheng Yin
Shujie Geng
Jianhan Xu
...
Changze Lv
Zixuan Ling
Xuanjing Huang
Miao Cao
Jianfeng Feng
105
0
0
17 Mar 2024
Scaling Data Diversity for Fine-Tuning Language Models in Human
  Alignment
Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment
Feifan Song
Bowen Yu
Hao Lang
Haiyang Yu
Fei Huang
Houfeng Wang
Yongbin Li
ALM
86
15
0
17 Mar 2024
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with
  Self-Reflexive Large Language Models
ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
Yuzhao Heng
Chun-Ying Deng
Yitong Li
Yue Yu
Yinghao Li
Rongzhi Zhang
Chao Zhang
86
6
0
17 Mar 2024
Reward Guided Latent Consistency Distillation
Reward Guided Latent Consistency Distillation
Jiachen Li
Weixi Feng
Wenhu Chen
William Y. Wang
EGVM
86
15
0
16 Mar 2024
Optimizing Language Augmentation for Multilingual Large Language Models:
  A Case Study on Korean
Optimizing Language Augmentation for Multilingual Large Language Models: A Case Study on Korean
Changsu Choi
Yongbin Jeong
Seoyoon Park
Inho Won
HyeonSeok Lim
...
Yiseul Lee
HyeJin Lee
Younggyun Hahm
Hansaem Kim
Kyungtae Lim
60
13
0
16 Mar 2024
A Comprehensive Study of Multimodal Large Language Models for Image
  Quality Assessment
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment
Tianhe Wu
Kede Ma
Jie Liang
Yujiu Yang
Lei Zhang
75
26
0
16 Mar 2024
VideoAgent: Long-form Video Understanding with Large Language Model as
  Agent
VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Xiaohan Wang
Yuhui Zhang
Orr Zohar
Serena Yeung-Levy
VLM
209
107
0
15 Mar 2024
Mitigating Dialogue Hallucination for Large Vision Language Models via
  Adversarial Instruction Tuning
Mitigating Dialogue Hallucination for Large Vision Language Models via Adversarial Instruction Tuning
Dongmin Park
Zhaofang Qian
Guangxing Han
Ser-Nam Lim
MLLM
84
0
0
15 Mar 2024
Uni-SMART: Universal Science Multimodal Analysis and Research
  Transformer
Uni-SMART: Universal Science Multimodal Analysis and Research Transformer
Hengxing Cai
Xiaochen Cai
Shuwen Yang
Jiankun Wang
Lin Yao
...
Mujie Lin
Yaqi Li
Yuqi Yin
Linfeng Zhang
Guolin Ke
OffRL
52
1
0
15 Mar 2024
RAFT: Adapting Language Model to Domain Specific RAG
RAFT: Adapting Language Model to Domain Specific RAG
Tianjun Zhang
Shishir G. Patil
Naman Jain
Sheng Shen
Matei A. Zaharia
Ion Stoica
Joseph E. Gonzalez
RALM
108
213
0
15 Mar 2024
CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a
  Cross-level Manner
CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner
Tingbing Yan
Wenzheng Zeng
Yang Xiao
Xingyu Tong
Bo Tan
Zhiwen Fang
Zhiguo Cao
Qiufeng Wang
84
7
0
15 Mar 2024
Knowledge Condensation and Reasoning for Knowledge-based VQA
Knowledge Condensation and Reasoning for Knowledge-based VQA
Dongze Hao
Jian Jia
Longteng Guo
Qunbo Wang
Te Yang
...
Yanhua Cheng
Bo Wang
Quan Chen
Han Li
Jing Liu
84
1
0
15 Mar 2024
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Yongquan He
Wenyuan Zhang
Xuancheng Huang
Peng Zhang
Lingxun Meng
Wei Lin
Wenyuan Zhang
Yifu Gao
CLLALM
156
5
0
15 Mar 2024
Scaling Behavior of Machine Translation with Large Language Models under
  Prompt Injection Attacks
Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks
Zhifan Sun
Antonio Valerio Miceli Barone
41
2
0
14 Mar 2024
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models
Yifan Li
Hangyu Guo
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
134
56
0
14 Mar 2024
Generalized Predictive Model for Autonomous Driving
Generalized Predictive Model for Autonomous Driving
Jiazhi Yang
Shenyuan Gao
Yihang Qiu
Li Chen
Tianyu Li
...
Ping Luo
Jun Zhang
Andreas Geiger
Yu Qiao
Hongyang Li
VGen
135
76
0
14 Mar 2024
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text
  Transformation
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Yunhao Gou
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
James T. Kwok
Yu Zhang
MLLM
125
52
0
14 Mar 2024
Clinical Reasoning over Tabular Data and Text with Bayesian Networks
Clinical Reasoning over Tabular Data and Text with Bayesian Networks
Paloma Rabaey
Johannes Deleu
Stefan Heytens
Thomas Demeester
69
5
0
14 Mar 2024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Zhiqing Sun
Longhui Yu
Yikang Shen
Weiyang Liu
Yiming Yang
Sean Welleck
Chuang Gan
93
69
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
103
13
0
14 Mar 2024
Dial-insight: Fine-tuning Large Language Models with High-Quality
  Domain-Specific Data Preventing Capability Collapse
Dial-insight: Fine-tuning Large Language Models with High-Quality Domain-Specific Data Preventing Capability Collapse
Jianwei Sun
Chaoyang Mei
Linlin Wei
Kaiyu Zheng
Na Liu
Ming Cui
Tianyi Li
ALM
95
4
0
14 Mar 2024
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine
  Knowledge
Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge
Yizhen Li
Shaohan Huang
Jiaxing Qi
Lei Quan
Dongran Han
Zhongzhi Luan
LM&MAAI4MH
48
5
0
14 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLMMLLM
79
9
0
14 Mar 2024
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language
  Models to Coding Preferences
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding Preferences
Martin Weyssow
Aton Kamanda
H. Sahraoui
ALM
116
38
0
14 Mar 2024
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
Ahmed Masry
Mehrad Shahmohammadi
Md. Rizwan Parvez
Enamul Hoque
Shafiq Joty
107
37
0
14 Mar 2024
AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic
AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic
Emad A. Alghamdi
Reem I. Masoud
Deema Alnuhait
Afnan Y. Alomairi
Ahmed Ashraf
Mohamed Zaytoon
84
6
0
14 Mar 2024
Teaching Machines to Code: Smart Contract Translation with LLMs
Teaching Machines to Code: Smart Contract Translation with LLMs
Rabimba Karanjai
Lei Xu
Weidong Shi
69
6
0
13 Mar 2024
Strengthening Multimodal Large Language Model with Bootstrapped
  Preference Optimization
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Renjie Pi
Tianyang Han
Wei Xiong
Jipeng Zhang
Runtao Liu
Boyao Wang
Tong Zhang
MLLM
141
48
0
13 Mar 2024
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language
  Agents
SOTOPIA-πππ: Interactive Learning of Socially Intelligent Language Agents
Ruiyi Wang
Haofei Yu
W. Zhang
Zhengyang Qi
Maarten Sap
Graham Neubig
Yonatan Bisk
Hao Zhu
LLMAG
121
44
0
13 Mar 2024
Human Alignment of Large Language Models through Online Preference
  Optimisation
Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello
Daniel Guo
Rémi Munos
Mark Rowland
Yunhao Tang
...
Michal Valko
Tianqi Liu
Rishabh Joshi
Zeyu Zheng
Bilal Piot
110
67
0
13 Mar 2024
Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over
  Structured Environments
Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments
Sitao Cheng
Ziyuan Zhuang
Yong Xu
Fangkai Yang
Chaoyun Zhang
...
Ling Chen
Qingwei Lin
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
KELMLLMAGLRM
77
19
0
13 Mar 2024
Model Will Tell: Training Membership Inference for Diffusion Models
Model Will Tell: Training Membership Inference for Diffusion Models
Xiaomeng Fu
Xi Wang
Qiao Li
Jin Liu
Jiao Dai
Jizhong Han
103
5
0
13 Mar 2024
Specification Overfitting in Artificial Intelligence
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
235
1
0
13 Mar 2024
Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Tastle: Distract Large Language Models for Automatic Jailbreak Attack
Zeguan Xiao
Yan Yang
Guanhua Chen
Yun-Nung Chen
AAML
90
27
0
13 Mar 2024
LLM-Assisted Light: Leveraging Large Language Model Capabilities for
  Human-Mimetic Traffic Signal Control in Complex Urban Environments
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments
Maonan Wang
Aoyu Pang
Yuheng Kan
Man-On Pun
Chung Shue Chen
Bo Huang
111
22
0
13 Mar 2024
Knowledge Conflicts for LLMs: A Survey
Knowledge Conflicts for LLMs: A Survey
Rongwu Xu
Zehan Qi
Zhijiang Guo
Cunxiang Wang
Hongru Wang
Yue Zhang
Wei Xu
313
122
0
13 Mar 2024
OverleafCopilot: Empowering Academic Writing in Overleaf with Large
  Language Models
OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language Models
Haomin Wen
Zhenjie Wei
Yan Lin
Jiyuan Wang
Yuxuan Liang
Huaiyu Wan
37
1
0
13 Mar 2024
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain
  Reinforcement Learning From AI Feedback
HRLAIF: Improvements in Helpfulness and Harmlessness in Open-domain Reinforcement Learning From AI Feedback
Ang Li
Qiugen Xiao
Peng Cao
Jian Tang
Yi Yuan
...
Weidong Guo
Yukang Gan
Jeffrey Xu Yu
D. Wang
Ying Shan
VLMALM
93
10
0
13 Mar 2024
AutoDev: Automated AI-Driven Development
AutoDev: Automated AI-Driven Development
Michele Tufano
Anisha Agarwal
Jinu Jang
Roshanak Zilouchian Moghaddam
Neel Sundaresan
80
21
0
13 Mar 2024
Gemma: Open Models Based on Gemini Research and Technology
Gemma: Open Models Based on Gemini Research and Technology
Gemma Team
Gemma Team Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
...
Armand Joulin
Noah Fiedel
Evan Senter
Alek Andreev
Kathleen Kenealy
VLMLLMAG
245
515
0
13 Mar 2024
Generative Pretrained Structured Transformers: Unsupervised Syntactic
  Language Models at Scale
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale
Xiang Hu
Pengyu Ji
Qingyang Zhu
Wei Wu
Kewei Tu
LRM
77
5
0
13 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly
  Specialized Language Models
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALMMoMeAI4CE
152
7
0
13 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
101
14
0
13 Mar 2024
Deep Submodular Peripteral Networks
Deep Submodular Peripteral Networks
Gantavya Bhatt
Arnav M. Das
Jeff Bilmes
87
1
0
13 Mar 2024
Large Language Models are Contrastive Reasoners
Large Language Models are Contrastive Reasoners
Liang Yao
ReLMELMLRM
110
3
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object
  Detection
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjDMLLMVLM
106
18
0
12 Mar 2024
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial
  Creation on Physical Tasks
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks
Yuexi Chen
Vlad I. Morariu
Anh Truong
Zhicheng Liu
DiffMVGen
72
5
0
12 Mar 2024
Do Agents Dream of Electric Sheep?: Improving Generalization in
  Reinforcement Learning through Generative Learning
Do Agents Dream of Electric Sheep?: Improving Generalization in Reinforcement Learning through Generative Learning
Giorgio Franceschelli
Mirco Musolesi
AI4CE
89
0
0
12 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLMVLM
112
22
0
12 Mar 2024
Rethinking Generative Large Language Model Evaluation for Semantic
  Comprehension
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
Fangyun Wei
Xi Chen
Linzi Luo
ELMALMLRM
63
8
0
12 Mar 2024
Previous
123...888990...126127128
Next