ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.02155
  4. Cited By
Training language models to follow instructions with human feedback

Training language models to follow instructions with human feedback

4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
    OSLMALM
ArXiv (abs)PDFHTML

Papers citing "Training language models to follow instructions with human feedback"

50 / 6,390 papers shown
Title
Faithful Explanations of Black-box NLP Models Using LLM-generated
  Counterfactuals
Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
Y. Gat
Nitay Calderon
Amir Feder
Alexander Chapanin
Amit Sharma
Roi Reichart
135
36
0
01 Oct 2023
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Amita Gajewar
Paul Vicol
G. Bansal
David J Fleet
128
177
0
29 Sep 2023
LoRA ensembles for large language model fine-tuning
LoRA ensembles for large language model fine-tuning
Xi Wang
Laurence Aitchison
Maja Rudolph
UQCV
113
39
0
29 Sep 2023
Improving Audio Captioning Models with Fine-grained Audio Features, Text
  Embedding Supervision, and LLM Mix-up Augmentation
Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation
Shih-Lun Wu
Xuankai Chang
Gordon Wichern
Jee-weon Jung
Franccois G. Germain
Jonathan Le Roux
Shinji Watanabe
83
20
0
29 Sep 2023
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback
Wanpeng Zhang
Zongqing Lu
LLMAG
154
8
0
29 Sep 2023
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
375
1,924
0
28 Sep 2023
LawBench: Benchmarking Legal Knowledge of Large Language Models
LawBench: Benchmarking Legal Knowledge of Large Language Models
Zhiwei Fei
Xiaoyu Shen
D. Zhu
Fengzhe Zhou
Zhuo Han
Songyang Zhang
Kai-xiang Chen
Zongwen Shen
Jidong Ge
ELMAILaw
134
46
0
28 Sep 2023
Integrating LLM, EEG, and Eye-Tracking Biomarker Analysis for Word-Level
  Neural State Classification in Semantic Inference Reading Comprehension
Integrating LLM, EEG, and Eye-Tracking Biomarker Analysis for Word-Level Neural State Classification in Semantic Inference Reading Comprehension
Yuhong Zhang
Qin Li
Sujal Nahata
Tasnia Jamal
Shih-kuen Cheng
G. Cauwenberghs
Tzyy-Ping Jung
60
4
0
27 Sep 2023
InternLM-XComposer: A Vision-Language Large Model for Advanced
  Text-image Comprehension and Composition
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Pan Zhang
Xiaoyi Wang
Bin Wang
Yuhang Cao
Chao Xu
...
Conghui He
Xingcheng Zhang
Yu Qiao
Da Lin
Jiaqi Wang
MLLM
198
241
0
26 Sep 2023
Supersonic: Learning to Generate Source Code Optimizations in C/C++
Supersonic: Learning to Generate Source Code Optimizations in C/C++
Zimin Chen
Sen Fang
Monperrus Martin
128
13
0
26 Sep 2023
MentaLLaMA: Interpretable Mental Health Analysis on Social Media with
  Large Language Models
MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models
Kailai Yang
Tianlin Zhang
Zi-Zhou Kuang
Qianqian Xie
Jimin Huang
Sophia Ananiadou
AI4MH
94
58
0
24 Sep 2023
Probing the Moral Development of Large Language Models through Defining
  Issues Test
Probing the Moral Development of Large Language Models through Defining Issues Test
Kumar Tanmay
Aditi Khandelwal
Utkarsh Agarwal
Monojit Choudhury
LRM
60
17
0
23 Sep 2023
Calibrating LLM-Based Evaluator
Calibrating LLM-Based Evaluator
Yuxuan Liu
Tianchi Yang
Shaohan Huang
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
122
33
0
23 Sep 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions
Diversifying Question Generation over Knowledge Base via External Natural Questions
Shasha Guo
Jing Zhang
Xirui Ke
Cuiping Li
Hong Chen
126
5
0
23 Sep 2023
Privacy Assessment on Reconstructed Images: Are Existing Evaluation
  Metrics Faithful to Human Perception?
Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception?
Xiaoxiao Sun
Nidham Gazagnadou
Vivek Sharma
Lingjuan Lyu
Hongdong Li
Liang Zheng
103
8
0
22 Sep 2023
Frustrated with Code Quality Issues? LLMs can Help!
Frustrated with Code Quality Issues? LLMs can Help!
Nalin Wadhwa
Jui Pradhan
Atharv Sonwane
Surya Prakash Sahu
Nagarajan Natarajan
Aditya Kanade
Suresh Parthasarathy
S. Rajamani
78
6
0
22 Sep 2023
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language
  Model as an Agent
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent
Jianing Yang
Xuweiyi Chen
Shengyi Qian
Nikhil Madaan
Madhavan Iyengar
David Fouhey
Joyce Chai
LM&RoLLMAG
152
101
0
21 Sep 2023
Reranking for Natural Language Generation from Logical Forms: A Study
  based on Large Language Models
Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models
Levon Haroutunian
Zhuang Li
Lucian Galescu
Philip R. Cohen
Raj Tumuluri
Gholamreza Haffari
LRM
89
1
0
21 Sep 2023
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
  Models
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
L. Yu
Weisen Jiang
Han Shi
Jincheng Yu
Zhengying Liu
Yu Zhang
James T. Kwok
Zheng Li
Adrian Weller
Weiyang Liu
OSLMLRM
124
395
0
21 Sep 2023
Evaluating Large Language Models for Document-grounded Response
  Generation in Information-Seeking Dialogues
Evaluating Large Language Models for Document-grounded Response Generation in Information-Seeking Dialogues
N. Braunschweiler
R. Doddipatla
Simon Keizer
Svetlana Stoyanchev
LM&MA
56
10
0
21 Sep 2023
The Wizard of Curiosities: Enriching Dialogues with Fun Facts
The Wizard of Curiosities: Enriching Dialogues with Fun Facts
Frederico Vicente
Rafael Ferreira
David Semedo
João Magalhães
53
2
0
20 Sep 2023
XATU: A Fine-grained Instruction-based Benchmark for Explainable Text
  Updates
XATU: A Fine-grained Instruction-based Benchmark for Explainable Text Updates
Haopeng Zhang
Hayate Iso
Sairam Gurajada
Nikita Bhutani
115
6
0
20 Sep 2023
GPT4AIGChip: Towards Next-Generation AI Accelerator Design Automation via Large Language Models
GPT4AIGChip: Towards Next-Generation AI Accelerator Design Automation via Large Language Models
Yonggan Fu
Yongan Zhang
Zhongzhi Yu
Sixu Li
Zhifan Ye
Chaojian Li
Cheng Wan
Ying Lin
108
69
0
19 Sep 2023
NusaWrites: Constructing High-Quality Corpora for Underrepresented and
  Extremely Low-Resource Languages
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Samuel Cahyawijaya
Holy Lovenia
Fajri Koto
Dea Adhista
Emmanuel Dave
...
Genta Indra Winata
David Moeljadi
Alham Fikri Aji
Ayu Purwarianti
Pascale Fung
87
9
0
19 Sep 2023
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded
  Dialogue Systems
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems
Bryan Wilie
Yan Xu
Willy Chung
Samuel Cahyawijaya
Holy Lovenia
Pascale Fung
68
1
0
19 Sep 2023
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated
  Jailbreak Prompts
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
Jiahao Yu
Xingwei Lin
Zheng Yu
Xinyu Xing
SILM
232
353
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELMLRM
375
755
0
19 Sep 2023
Positive and Risky Message Assessment for Music Products
Positive and Risky Message Assessment for Music Products
Yigeng Zhang
Mahsa Shafaei
Fabio Gonzalez
Thamar Solorio
68
3
0
18 Sep 2023
Instruction-Following Speech Recognition
Instruction-Following Speech Recognition
Cheng-I Jeff Lai
Zhiyun Lu
Liangliang Cao
Ruoming Pang
AuLLM
80
6
0
18 Sep 2023
Pruning Large Language Models via Accuracy Predictor
Pruning Large Language Models via Accuracy Predictor
Yupeng Ji
Yibo Cao
Jiu-si Liu
KELM
79
4
0
18 Sep 2023
Investigating Zero- and Few-shot Generalization in Fact Verification
Investigating Zero- and Few-shot Generalization in Fact Verification
Liangming Pan
Yunxiang Zhang
Min-Yen Kan
53
6
0
18 Sep 2023
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence
  across Paragraphs
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs
Juan Diego Rodriguez
Katrin Erk
Greg Durrett
94
4
0
16 Sep 2023
Reward Engineering for Generating Semi-structured Explanation
Reward Engineering for Generating Semi-structured Explanation
Paul Burgess
Wray Buntine
Ehsan Shareghi
LRM
65
0
0
15 Sep 2023
Investigating Answerability of LLMs for Long-Form Question Answering
Investigating Answerability of LLMs for Long-Form Question Answering
Meghana Moorthy Bhat
Rui Meng
Ye Liu
Yingbo Zhou
Semih Yavuz
75
11
0
15 Sep 2023
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the
  Wild
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
Huayang Li
Siheng Li
Deng Cai
Longyue Wang
Lemao Liu
Taro Watanabe
Yujiu Yang
Shuming Shi
MLLM
145
18
0
14 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
126
81
0
13 Sep 2023
TrafficGPT: Viewing, Processing and Interacting with Traffic Foundation
  Models
TrafficGPT: Viewing, Processing and Interacting with Traffic Foundation Models
Siyao Zhang
Daocheng Fu
Zhao Zhang
Bin Yu
Pinlong Cai
72
50
0
13 Sep 2023
Exploring Large Language Models for Ontology Alignment
Exploring Large Language Models for Ontology Alignment
Yuan He
Jiaoyan Chen
Hang Dong
Ian Horrocks
101
36
0
12 Sep 2023
Circuit Breaking: Removing Model Behaviors with Targeted Ablation
Circuit Breaking: Removing Model Behaviors with Targeted Ablation
Maximilian Li
Xander Davies
Max Nadeau
KELMMU
81
29
0
12 Sep 2023
Evaluating the Deductive Competence of Large Language Models
Evaluating the Deductive Competence of Large Language Models
S. M. Seals
V. Shalin
ELMReLMLRM
87
10
0
11 Sep 2023
Mitigating Word Bias in Zero-shot Prompt-based Classifiers
Mitigating Word Bias in Zero-shot Prompt-based Classifiers
Adian Liusie
Potsawee Manakul
Mark Gales
59
7
0
10 Sep 2023
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language
  Models
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Yangyi Chen
Karan Sikka
Michael Cogswell
Heng Ji
Ajay Divakaran
LRM
99
27
0
08 Sep 2023
Zero-Shot Robustification of Zero-Shot Models
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila
Changho Shin
Lin Cai
Frederic Sala
88
20
0
08 Sep 2023
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs
Patrick Haller
Ansar Aynetdinov
Alan Akbik
81
26
0
07 Sep 2023
FLM-101B: An Open LLM and How to Train It with $100K Budget
FLM-101B: An Open LLM and How to Train It with 100KBudget100K Budget100KBudget
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
155
22
0
07 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
E. Sciascio
ALM
132
30
0
07 Sep 2023
From Base to Conversational: Japanese Instruction Dataset and Tuning
  Large Language Models
From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models
Masahiro Suzuki
Masanori Hirano
Hiroki Sakaji
102
6
0
07 Sep 2023
Framework-Based Qualitative Analysis of Free Responses of Large Language
  Models: Algorithmic Fidelity
Framework-Based Qualitative Analysis of Free Responses of Large Language Models: Algorithmic Fidelity
A. Amirova
T. Fteropoulli
Nafiso Ahmed
Martin R. Cowie
Joel Z Leibo
89
11
0
06 Sep 2023
Persona-aware Generative Model for Code-mixed Language
Persona-aware Generative Model for Code-mixed Language
Ayan Sengupta
Md. Shad Akhtar
Tanmoy Chakraborty
62
1
0
06 Sep 2023
Aligning Large Language Models for Clinical Tasks
Aligning Large Language Models for Clinical Tasks
Supun Manathunga
Isuru Hettigoda
LM&MAELMAI4MH
92
11
0
06 Sep 2023
Previous
123...113114115...126127128
Next