Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,398 papers shown
Title
Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation
Leonardo Pina
Yongmin Li
VGen
DiffM
97
0
0
07 Dec 2024
Beyond the Binary: Capturing Diverse Preferences With Reward Regularization
Vishakh Padmakumar
Chuanyang Jin
Hannah Rose Kirk
He He
107
6
0
05 Dec 2024
Reinforcement Learning Enhanced LLMs: A Survey
Shuhe Wang
Shengyu Zhang
Jing Zhang
Runyi Hu
Xiaoya Li
Minlie Huang
Jiwei Li
Leilei Gan
G. Wang
Eduard H. Hovy
OffRL
270
16
0
05 Dec 2024
AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning
Yiwu Zhong
Zhuoming Liu
Yin Li
Liwei Wang
153
7
0
04 Dec 2024
Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models
XiuYu Zhang
Zening Luo
AI4MH
77
1
0
04 Dec 2024
Unifying KV Cache Compression for Large Language Models with LeanKV
Yanqi Zhang
Yuwei Hu
Runyuan Zhao
John C. S. Lui
Haibo Chen
MQ
290
7
0
04 Dec 2024
Curriculum-style Data Augmentation for LLM-based Metaphor Detection
Kaidi Jia
Yanxia Wu
Rongsheng Li
Rongsheng Li
103
0
0
04 Dec 2024
Video LLMs for Temporal Reasoning in Long Videos
Fawad Javed Fateh
Umer Ahmed
Hamza Khan
M. Zia
Quoc-Huy Tran
VLM
188
1
0
04 Dec 2024
Explainable CTR Prediction via LLM Reasoning
Xiaohan Yu
Li Zhang
C. L. Philip Chen
OffRL
LRM
127
1
0
03 Dec 2024
Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach
T. T. Wang
John Hughes
Henry Sleight
Rylan Schaeffer
Rajashree Agrawal
Fazl Barez
Mrinank Sharma
Jesse Mu
Nir Shavit
Ethan Perez
AAML
134
4
0
03 Dec 2024
Time-Reversal Provides Unsupervised Feedback to LLMs
Yerram Varun
Rahul Madhavan
Sravanti Addepalli
A. Suggala
Karthikeyan Shanmugam
Prateek Jain
LRM
SyDa
123
0
0
03 Dec 2024
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Meng Cao
Haoran Tang
Haoze Zhao
Hangyu Guo
Jing Liu
Ge Zhang
Ruyang Liu
Qiang Sun
Ian Reid
Xiaodan Liang
228
3
0
02 Dec 2024
R-Bot: An LLM-based Query Rewrite System
Zhaoyan Sun
Xuanhe Zhou
Guoliang Li
114
6
0
02 Dec 2024
Yi-Lightning Technical Report
01. AI
:
Alan Wake
Albert Wang
Bei Chen
...
Yuxuan Sha
Zhaodong Yan
Zhiyuan Liu
Zirui Zhang
Zonghong Dai
OSLM
229
4
0
02 Dec 2024
Automated Extraction of Acronym-Expansion Pairs from Scientific Papers
Izhar Ali
Million Haileyesus
Serhiy Hnatyshyn
Jan-Lucas Ott
Vasil Hnatyshin
165
1
0
02 Dec 2024
Detecting Memorization in Large Language Models
Eduardo Slonski
90
0
0
02 Dec 2024
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Chunlin Yu
Hanqing Wang
Ye Shi
Haoyang Luo
Sibei Yang
Jingyi Yu
Jingya Wang
LRM
LM&Ro
234
4
0
02 Dec 2024
If Eleanor Rigby Had Met ChatGPT: A Study on Loneliness in a Post-LLM World
Adrian de Wynter
112
1
0
02 Dec 2024
Linear Probe Penalties Reduce LLM Sycophancy
Henry Papadatos
Rachel Freedman
LLMSV
121
4
0
01 Dec 2024
Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence
Qianren Mao
Weifeng Jiang
Qingbin Liu
Chenghua Lin
Qian Li
Xianqing Wen
Jianxin Li
Jinhu Lu
115
0
0
01 Dec 2024
Large Language Models in Politics and Democracy: A Comprehensive Survey
Goshi Aoki
LM&MA
AILaw
174
1
0
01 Dec 2024
VideoSAVi: Self-Aligned Video Language Models without Human Supervision
Yogesh Kulkarni
Pooyan Fazli
VLM
233
2
0
01 Dec 2024
Benchmark Real-time Adaptation and Communication Capabilities of Embodied Agent in Collaborative Scenarios
Shipeng Liu
Boshen Zhang
Zhehui Huang
98
0
0
30 Nov 2024
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
Huadong Tang
Youpeng Zhao
Y. Huang
Min Xu
Jun Wang
Qiang Wu
MLLM
VLM
139
0
0
30 Nov 2024
PlanCritic: Formal Planning with Human Feedback
Owen Burns
Dana Hughes
Katia Sycara
OffRL
103
0
0
30 Nov 2024
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding
Jiancan Wu
Yancheng Yuan
Jinda Lu
Kai Zhang
Alex Su
Xiang Wang
Xiangnan He
MU
KELM
189
8
0
30 Nov 2024
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
Jianping Jiang
Weiye Xiao
Zhengyu Lin
Han Zhang
Tianxiang Ren
Yang Gao
Zhiqian Lin
Zhongang Cai
Lei Yang
Ziwei Liu
152
3
0
29 Nov 2024
Knowledge Management for Automobile Failure Analysis Using Graph RAG
Yuta Ojima
Hiroki Sakaji
Tadashi Nakamura
Hiroaki Sakata
Kazuya Seki
Yuu Teshigawara
Masami Yamashita
Kazuhiro Aoyama
82
0
0
29 Nov 2024
Quantized Delta Weight Is Safety Keeper
Yule Liu
Zhen Sun
Xinlei He
Xinyi Huang
146
6
0
29 Nov 2024
PEFT-as-an-Attack! Jailbreaking Language Models during Federated Parameter-Efficient Fine-Tuning
Shenghui Li
Edith C.H. Ngai
Fanghua Ye
Thiemo Voigt
SILM
206
6
0
28 Nov 2024
Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising
A. Tavanaei
Kee Kiat Koo
Hayreddin Ceker
Shaobai Jiang
Qi Li
Julien Han
Karim Bouyarmane
102
1
0
28 Nov 2024
3D-WAG: Hierarchical Wavelet-Guided Autoregressive Generation for High-Fidelity 3D Shapes
Tejaswini Medi
Arianna Rampini
Pradyumna Reddy
P. Jayaraman
Margret Keuper
DiffM
166
0
0
28 Nov 2024
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
400
4
0
28 Nov 2024
The Performance of the LSTM-based Code Generated by Large Language Models (LLMs) in Forecasting Time Series Data
Saroj Gopali
Sima Siami-Namini
Faranak Abri
A. Namin
108
3
0
27 Nov 2024
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students
Tiffany Zhu
Kexun Zhang
William Yang Wang
SyDa
ELM
AI4Ed
98
3
0
27 Nov 2024
Neutralizing Backdoors through Information Conflicts for Large Language Models
Chen Chen
Yuchen Sun
Xueluan Gong
Jiaxin Gao
K. Lam
KELM
AAML
167
0
0
27 Nov 2024
SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment
Jie Wang
Yichen Wang
Zhilin Zhang
Jianhao Zeng
Kaidi Wang
Zhiyang Chen
168
0
0
27 Nov 2024
DRS: Deep Question Reformulation With Structured Output
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Nanyun Peng
Kai-Wei Chang
KELM
223
0
0
27 Nov 2024
Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
Jiaxin Wen
Vivek Hebbar
Caleb Larson
Aryan Bhatt
Ansh Radhakrishnan
...
Shi Feng
He He
Ethan Perez
Buck Shlegeris
Akbir Khan
AAML
127
11
0
26 Nov 2024
H
3
H^3
H
3
Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Selim Furkan Tekin
Fatih Ilhan
Tiansheng Huang
Sihao Hu
Zachary Yahn
Ling Liu
MoMe
136
3
0
26 Nov 2024
Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Changgeon Ko
Jisu Shin
Hoyun Song
Jeongyeon Seo
Jong C. Park
107
0
0
26 Nov 2024
Safe to Serve: Aligning Instruction-Tuned Models for Safety and Helpfulness
Avinash Amballa
Durga Sandeep Saluru
Gayathri Akkinapalli
Abhishek Sureddy
Akshay Kumar Sureddy
ALM
118
0
0
26 Nov 2024
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
Shijian Deng
Wentian Zhao
Yu-Jhe Li
Kun Wan
Daniel Miranda
Ajinkya Kale
Yapeng Tian
LRM
175
6
0
26 Nov 2024
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Lei Li
Y. X. Wei
Zhihui Xie
Xuqing Yang
Yifan Song
...
Tianyu Liu
Sujian Li
Bill Yuchen Lin
Dianbo Sui
Qiang Liu
VLM
CoGe
200
32
0
26 Nov 2024
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELM
AILaw
408
112
0
25 Nov 2024
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
Zhen Huang
Haoyang Zou
Xuefeng Li
Yixiu Liu
Yuxiang Zheng
Ethan Chern
Shijie Xia
Yiwei Qin
Weizhe Yuan
Pengfei Liu
VLM
135
52
0
25 Nov 2024
Can AI grade your essays? A comparative analysis of large language models and teacher ratings in multidimensional essay scoring
Kathrin Seßler
Maurice Fürstenberg
B. Bühler
Enkelejda Kasneci
AI4Ed
ELM
110
4
0
25 Nov 2024
LLM Augmentations to support Analytical Reasoning over Multiple Documents
Raquib Bin Yousuf
Nicholas Defelice
Mandar Sharma
Shengzhe Xu
Naren Ramakrishnan
94
2
0
25 Nov 2024
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
Yongwei Chen
Yushi Lan
Shangchen Zhou
Tengfei Wang
Xingang Pan
275
6
0
25 Nov 2024
Interpreting Language Reward Models via Contrastive Explanations
Junqi Jiang
Tom Bewley
Saumitra Mishra
Freddy Lecue
Manuela Veloso
175
2
0
25 Nov 2024
Previous
1
2
3
...
36
37
38
...
126
127
128
Next