Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,398 papers shown
Title
Self-Generated Critiques Boost Reward Modeling for Language Models
Yue Yu
Zhengxing Chen
Aston Zhang
L Tan
Chenguang Zhu
...
Suchin Gururangan
Chao-Yue Zhang
Melanie Kambadur
Dhruv Mahajan
Rui Hou
LRM
ALM
212
27
0
25 Nov 2024
Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown
Lifu Tu
Rui Meng
Shafiq Joty
Yingbo Zhou
Semih Yavuz
HILM
132
1
0
24 Nov 2024
Ruppert-Polyak averaging for Stochastic Order Oracle
V. N. Smirnov
K. M. Kazistova
I. A. Sudakov
V. Leplat
A. V. Gasnikov
A. V. Lobanov
76
0
0
24 Nov 2024
Decoding Urban Industrial Complexity: Enhancing Knowledge-Driven Insights via IndustryScopeGPT
Siqi Wang
Chao Liang
Yunfan Gao
Yebin Liu
Jing Li
Haoyu Wang
AI4CE
145
3
0
24 Nov 2024
Development of Pre-Trained Transformer-based Models for the Nepali Language
Prajwal Thapa
Jinu Nyachhyon
Mridul Sharma
Bal Krishna Bal
123
1
0
24 Nov 2024
State-Space Large Audio Language Models
Saurabhchand Bhati
Yuan Gong
Leonid Karlinsky
Hilde Kuehne
Rogerio Feris
James Glass
156
1
0
24 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yize Zhang
Min Li
178
1
0
23 Nov 2024
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy
Te Yang
Jian Jia
Xiangyu Zhu
Weisong Zhao
Bo Wang
...
Shengyuan Liu
Quan Chen
Peng Jiang
Kun Gai
Zhen Lei
86
1
0
23 Nov 2024
From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
M. Finkelstein
Dan Deutsch
Parker Riley
Juraj Juraska
Geza Kovacs
Markus Freitag
128
0
0
23 Nov 2024
Locating the Leading Edge of Cultural Change
Sarah Griebel
Becca Cohen
Lucian Li
Jaihyun Park
Jiayu Liu
Jana Perkins
Ted Underwood
93
1
0
22 Nov 2024
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
Junzhe Chen
Tianshu Zhang
Shijie Huang
Yuwei Niu
Linfeng Zhang
Lijie Wen
Xuming Hu
MLLM
VLM
513
6
0
22 Nov 2024
On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
Elita Lobo
Chirag Agarwal
Himabindu Lakkaraju
LRM
176
10
0
22 Nov 2024
Learning from "Silly" Questions Improves Large Language Models, But Only Slightly
Tingyuan Zhu
Shudong Liu
Yidong Wang
Derek F. Wong
Han Yu
T. Shinozaki
Jindong Wang
ALM
LRM
100
0
0
21 Nov 2024
Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned Policies
Frédéric Berdoz
Roger Wattenhofer
133
0
0
21 Nov 2024
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Song Jiang
Da JU
Andrew Cohen
Sasha Mitts
Aaron Foss
Justine T Kao
Xian Li
Yuandong Tian
141
3
0
21 Nov 2024
AdaptAgent: Adapting Multimodal Web Agents with Few-Shot Learning from Human Demonstrations
Gaurav Verma
Rachneet Kaur
Nishan Srishankar
Zhen Zeng
T. Balch
Manuela Veloso
LLMAG
118
6
0
20 Nov 2024
Combining Autoregressive and Autoencoder Language Models for Text Classification
João Gonçalves
119
0
0
20 Nov 2024
I Can Tell What I am Doing: Toward Real-World Natural Language Grounding of Robot Experiences
Zihan Wang
Brian Liang
Varad Dhat
Zander Brumbaugh
Nick Walker
Ranjay Krishna
Maya Cakmak
112
5
0
20 Nov 2024
Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control
Yunkee Chae
Eunsik Shin
Hwang Suntae
Seungryeol Paik
Kyogu Lee
122
1
0
20 Nov 2024
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection
Gabriel Chua
Shing Yee Chan
Shaun Khoo
204
1
0
20 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
218
22
0
20 Nov 2024
MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices
Mohammadali Shakerdargah
Shan Lu
Chao Gao
Di Niu
176
0
0
20 Nov 2024
Engagement-Driven Content Generation with Large Language Models
Simone Mungari
Federico Cinus
Marco Minici
Francesco Bonchi
Giuseppe Manco
170
1
0
20 Nov 2024
Reward Modeling with Ordinal Feedback: Wisdom of the Crowd
Shang Liu
Yu Pan
Guanting Chen
Xiaocheng Li
127
3
0
19 Nov 2024
Generative Timelines for Instructed Visual Assembly
Alejandro Pardo
Jui-hsien Wang
Guohao Li
Josef Sivic
Bryan C. Russell
Fabian Caba Heilbron
VGen
110
0
0
19 Nov 2024
Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets
Ike Obi
Rohan Pant
Srishti Shekhar Agrawal
Maham Ghazanfar
Aaron Basiletti
91
2
0
18 Nov 2024
No-regret Exploration in Shuffle Private Reinforcement Learning
Shaojie Bai
Mohammad Sadegh Talebi
Chengcheng Zhao
Peng Cheng
Jiming Chen
OffRL
120
0
0
18 Nov 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
...
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
ALM
188
5
0
18 Nov 2024
The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Xikang Yang
Xuehai Tang
Jizhong Han
Songlin Hu
118
0
0
18 Nov 2024
PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback
Yun Peng
Akhilesh Deepak Gotmare
Michael R. Lyu
Caiming Xiong
Silvio Savarese
Doyen Sahoo
97
0
0
18 Nov 2024
TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation
Ranmin Wang
Limin Zhuang
Hongkun Chen
Boyan Xu
Ruichu Cai
70
0
0
18 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
91
8
0
18 Nov 2024
SEFD: Semantic-Enhanced Framework for Detecting LLM-Generated Text
Weiqing He
Bojian Hou
Tianqi Shang
Davoud Ataee Tarzanagh
Qi Long
Li Shen
DeLMO
133
1
0
17 Nov 2024
SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization
Hongrui Jia
Chaoya Jiang
Haiyang Xu
Wei Ye
Mengfan Dong
Ming Yan
Ji Zhang
Fei Huang
Shikun Zhang
MLLM
149
3
0
17 Nov 2024
Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering
Zeping Yu
Sophia Ananiadou
495
2
0
17 Nov 2024
SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment
Quan Ze Chen
K. J. Kevin Feng
Chan Young Park
Amy X. Zhang
74
0
0
16 Nov 2024
I'm Spartacus, No, I'm Spartacus: Measuring and Understanding LLM Identity Confusion
Kun Li
Shichao Zhuang
Yue Zhang
Minghui Xu
Ruoxi Wang
Kaidi Xu
Xinwen Fu
Xiuzhen Cheng
151
0
0
16 Nov 2024
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ Guidelines
Yixiang Chen
Xinyu Zhang
Jinran Wang
Xurong Xie
Nan Yan
Hui Chen
Lan Wang
AI4MH
70
3
0
16 Nov 2024
Does Prompt Formatting Have Any Impact on LLM Performance?
Jia He
Mukund Rungta
David Koleczek
Arshdeep Sekhon
Franklin X Wang
Sadid Hasan
LLMAG
LRM
115
59
0
15 Nov 2024
Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment
Andrew Konya
Aviv Ovadya
K. J. Kevin Feng
Quan Ze Chen
Lisa Schirch
Colin Irwin
Amy X. Zhang
ALM
105
2
0
15 Nov 2024
Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Jianfeng Chi
Ujjwal Karn
Hongyuan Zhan
Eric Michael Smith
Javier Rando
Yiming Zhang
Kate Plawiak
Zacharie Delpierre Coudert
Kartikeya Upasani
Mahesh Pasupuleti
MLLM
3DH
126
33
0
15 Nov 2024
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
Michael Aerni
Javier Rando
Edoardo Debenedetti
Nicholas Carlini
Daphne Ippolito
F. Tramèr
75
5
0
15 Nov 2024
Visual question answering based evaluation metrics for text-to-image generation
Mizuki Miyamoto
Ryugo Morita
Jinjia Zhou
EGVM
112
1
0
15 Nov 2024
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Yutao Hou
Yajing Luo
Zhiwen Ruan
Hongru Wang
Weifeng Ge
Yuxiao Chen
Guanhua Chen
ELM
84
0
0
15 Nov 2024
Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Md. Asif Haider
Ayesha Binte Mostofa
Sk. Sabit Bin Mosaddek
Anindya Iqbal
Toufique Ahmed
ALM
96
3
0
15 Nov 2024
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Weiyun Wang
Zhe Chen
Wenhai Wang
Yue Cao
Yangzhou Liu
...
Jinguo Zhu
X. Zhu
Lewei Lu
Yu Qiao
Jifeng Dai
LRM
165
93
1
15 Nov 2024
Efficient Alignment of Large Language Models via Data Sampling
Amrit Khera
Rajat Ghosh
Debojyoti Dutta
182
1
0
15 Nov 2024
PTR: Precision-Driven Tool Recommendation for Large Language Models
Hang Gao
Yongfeng Zhang
KELM
80
0
0
14 Nov 2024
A Practical Guide to Fine-tuning Language Models with Limited Data
Márton Szép
Daniel Rueckert
Rüdiger von Eisenhart-Rothe
Florian Hinterwimmer
SyDa
ALM
137
2
0
14 Nov 2024
DROJ: A Prompt-Driven Attack against Large Language Models
Leyang Hu
Boran Wang
57
0
0
14 Nov 2024
Previous
1
2
3
...
37
38
39
...
126
127
128
Next