Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,380 papers shown
Title
Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
Donggyun Kim
Seongwoong Cho
Semin Kim
Chong Luo
Seunghoon Hong
VLM
88
3
0
29 Apr 2024
A Framework for Real-time Safeguarding the Text Generation of Large Language Model
Ximing Dong
Dayi Lin
Shaowei Wang
Ahmed E. Hassan
133
1
0
29 Apr 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
Han Zhong
Zikang Shan
Guhao Feng
Wei Xiong
Xinle Cheng
Li Zhao
Di He
Jiang Bian
Liwei Wang
155
72
0
29 Apr 2024
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLM
LRM
283
197
0
29 Apr 2024
Towards Incremental Learning in Large Language Models: A Critical Review
M. Jovanovic
Peter Voss
ELM
CLL
KELM
116
5
0
28 Apr 2024
PatentGPT: A Large Language Model for Intellectual Property
Zilong Bai
Ruiji Zhang
Linqing Chen
Qijun Cai
Yuan Zhong
...
Fu Bian
Xiaolong Gu
Lisha Zhang
Weilei Wang
Changyang Tu
105
5
0
28 Apr 2024
From Persona to Personalization: A Survey on Role-Playing Language Agents
Jiangjie Chen
Xintao Wang
Rui Xu
Siyu Yuan
Yikai Zhang
...
Caiyu Hu
Siye Wu
Scott Ren
Ziquan Fu
Yanghua Xiao
137
98
0
28 Apr 2024
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model
Zhengpeng Shi
Haoran Luo
LRM
ALM
93
2
0
28 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
135
16
0
28 Apr 2024
SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models
M. Kapadnis
Sohan Patnaik
Abhilash Nandy
Sourjyadip Ray
Pawan Goyal
Debdoot Sheet
VLM
75
5
0
27 Apr 2024
VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition
Junyi Bian
W. Zhai
Xiaodi Huang
Jiaxuan Zheng
Shanfeng Zhu
105
3
0
27 Apr 2024
Evaluation of Few-Shot Learning for Classification Tasks in the Polish Language
Tsimur Hadeliya
D. Kajtoch
117
1
0
27 Apr 2024
Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning
Dapeng Li
Hang Dong
Lu Wang
Bo Qiao
Si Qin
...
Dongmei Zhang
Qi Zhang
Zhiwei Xu
Bin Zhang
Guoliang Fan
90
5
0
27 Apr 2024
Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo
Stephen Zhao
Rob Brekelmans
Alireza Makhzani
Roger C. Grosse
89
41
0
26 Apr 2024
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
92
3
0
26 Apr 2024
Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM
Xuan Zhang
Wei Gao
LRM
KELM
97
10
0
26 Apr 2024
Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study
Yang Wu
Yao Wan
Hongyu Zhang
Yulei Sui
Wucai Wei
Wei Zhao
Guandong Xu
Hai Jin
58
25
0
26 Apr 2024
Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs
Valeriia Cherepanova
James Zou
AAML
102
6
0
26 Apr 2024
Near to Mid-term Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksandar Petrov
Bertie Vidgen
Christian Schroeder de Witt
Fabio Pizzati
...
Paul Röttger
Philip Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
117
8
0
25 Apr 2024
Continual Learning of Large Language Models: A Comprehensive Survey
Haizhou Shi
Zihao Xu
Hengyi Wang
Weiyi Qin
Wenyuan Wang
Yibin Wang
Zifeng Wang
Sayna Ebrahimi
Hao Wang
CLL
KELM
LRM
162
88
0
25 Apr 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
Jonathan D. Chang
Wenhao Zhan
Owen Oertell
Gokul Swamy
Kianté Brantley
Thorsten Joachims
J. Andrew Bagnell
Jason D. Lee
Wen Sun
OffRL
85
41
0
25 Apr 2024
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan
Xinyi Yang
Derek F. Wong
Lidia S. Chao
Yue Zhang
128
10
0
25 Apr 2024
Tele-FLM Technical Report
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Chao Wang
...
Yequan Wang
Zhongjiang He
Zhongyuan Wang
Xuelong Li
Tiejun Huang
81
4
0
25 Apr 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning
Liang Zhang
Anwen Hu
Haiyang Xu
Mingshi Yan
Yichen Xu
Qin Jin
Ji Zhang
Fei Huang
109
17
0
25 Apr 2024
Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare
Emre Can Acikgoz
Osman Batur .Ince
Rayene Bench
Arda Anil Boz
.Ilker Kesen
Aykut Erdem
Erkut Erdem
LM&MA
79
10
0
25 Apr 2024
Building a Japanese Document-Level Relation Extraction Dataset Assisted by Cross-Lingual Transfer
Youmi Ma
An Wang
Naoaki Okazaki
83
0
0
25 Apr 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Pengnian Qi
Zhiyu Li
133
10
0
25 Apr 2024
Don't Say No: Jailbreaking LLM by Suppressing Refusal
Yukai Zhou
Jian Lou
Zhijie Huang
Zhan Qin
Yibei Yang
Wenjie Wang
AAML
116
19
0
25 Apr 2024
Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Jiaqing Yuan
Lin Pan
Chung-Wei Hang
Jiang Guo
Jiarong Jiang
Bonan Min
Patrick Ng
Zhiguo Wang
HILM
ELM
87
4
0
24 Apr 2024
Domain-Specific Improvement on Psychotherapy Chatbot Using Assistant
Cheng Kang
Daniel Novak
Kateřina Urbanová
Yuqing Cheng
Yong Hu
AI4MH
LM&MA
40
3
0
24 Apr 2024
Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
Timin Gao
Peixian Chen
Mengdan Zhang
Chaoyou Fu
Yunhang Shen
...
Shengchuan Zhang
Xiawu Zheng
Xing Sun
Liujuan Cao
Rongrong Ji
MLLM
LRM
123
22
0
24 Apr 2024
Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach
Linyu Liu
Yu Pan
Xiaocheng Li
Guanting Chen
105
39
0
24 Apr 2024
Assessing The Potential Of Mid-Sized Language Models For Clinical QA
Elliot Bolton
Betty Xiong
Vijaytha Muralidharan
J. Schamroth
Vivek Muralidharan
Christopher D. Manning
R. Daneshjou
AI4MH
ELM
LM&MA
44
4
0
24 Apr 2024
A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry
Yining Huang
Keke Tang
Meilian Chen
Boyuan Wang
ELM
LM&MA
72
15
0
24 Apr 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
105
6
0
24 Apr 2024
Return of EM: Entity-driven Answer Set Expansion for QA Evaluation
Dongryeol Lee
Minwoo Lee
Kyungmin Min
Joonsuk Park
Kyomin Jung
84
1
0
24 Apr 2024
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia
Rui Wang
Xu Liu
Mingyan Li
Tong Yu
Xiang Chen
Julian McAuley
Shuai Li
LRM
128
22
0
24 Apr 2024
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Mihir Parmar
Nisarg Patel
Neeraj Varshney
Mutsumi Nakamura
Man Luo
Santosh Mashetty
Arindam Mitra
Chitta Baral
LRM
ReLM
ELM
215
31
0
23 Apr 2024
IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents
Jean-Philippe Corbeil
90
3
0
23 Apr 2024
Interactive Analysis of LLMs using Meaningful Counterfactuals
Furui Cheng
Vilém Zouhar
Robin Shing Moon Chan
Daniel Fürst
Hendrik Strobelt
Mennatallah El-Assady
132
11
0
23 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
100
13
0
23 Apr 2024
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
Davide Caffagni
Federico Cocchi
Nicholas Moratelli
Sara Sarto
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
KELM
118
47
0
23 Apr 2024
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models
Wanrong Zhu
Jennifer Healey
Ruiyi Zhang
William Y. Wang
Tong Sun
3DV
51
2
0
23 Apr 2024
Aligning LLM Agents by Learning Latent Preference from User Edits
Ge Gao
Alexey Taymanov
Eduardo Salinas
Paul Mineiro
Dipendra Kumar Misra
LLMAG
94
31
0
23 Apr 2024
Re-Thinking Inverse Graphics With Large Language Models
Peter Kulits
Haiwen Feng
Weiyang Liu
Victoria Fernandez-Abrevaya
Michael J. Black
AI4CE
101
9
0
23 Apr 2024
Does Instruction Tuning Make LLMs More Consistent?
Constanza Fierro
Jiaang Li
Anders Sogaard
LRM
104
2
0
23 Apr 2024
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Xun Wu
Shaohan Huang
Furu Wei
85
10
0
23 Apr 2024
Pillars of Grammatical Error Correction: Comprehensive Inspection Of Contemporary Approaches In The Era of Large Language Models
Kostiantyn Omelianchuk
Andrii Liubonko
Oleksandr Skurzhanskyi
Artem Chernodub
Oleksandr Korniienko
Igor Samokhin
86
2
0
23 Apr 2024
FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering
Siqi Ping
Yuzhu Mao
Yang Liu
Xiao-Ping Zhang
Wenbo Ding
FedML
83
4
0
23 Apr 2024
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
OffRL
76
5
0
23 Apr 2024
Previous
1
2
3
...
79
80
81
...
126
127
128
Next