Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 6,398 papers shown
Title
Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models
Somanshu Singla
Zhen Wang
Tianyang Liu
Abdullah Ashfaq
Zhiting Hu
Eric Xing
75
2
0
13 Nov 2024
NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation
Youzhi Liu
Fanglong Yao
Yuanchang Yue
Guangluan Xu
Xian Sun
Kun Fu
LM&Ro
109
3
0
13 Nov 2024
ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening
Hojun Jang
Y. Kim
3DH
86
0
0
13 Nov 2024
PyGen: A Collaborative Human-AI Approach to Python Package Creation
Saikat Barua
Mostafizur Rahman
Md Jafor Sadek
Rafiul Islam
Shehnaz Khaled
Md. Shohrab Hossain
138
2
0
13 Nov 2024
New Emerged Security and Privacy of Pre-trained Model: a Survey and Outlook
Meng Yang
Tianqing Zhu
Chi Liu
Wanlei Zhou
Shui Yu
Philip S. Yu
AAML
ELM
PILM
112
1
0
12 Nov 2024
A Survey on Adversarial Machine Learning for Code Data: Realistic Threats, Countermeasures, and Interpretations
Yulong Yang
Haoran Fan
Chenhao Lin
Qian Li
Zhengyu Zhao
Chao Shen
Xiaohong Guan
AAML
85
0
0
12 Nov 2024
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni
Jonathan Colaço-Carr
Yash More
Jackie CK Cheung
G. Farnadi
181
1
0
12 Nov 2024
SetLexSem Challenge: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models
Bardiya Akhbari
Manish Gawali
Nicholas A. Dronen
AAML
120
0
0
11 Nov 2024
SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs
Ruben Härle
Felix Friedrich
Manuel Brack
Bjorn Deiseroth
P. Schramowski
Kristian Kersting
80
2
0
11 Nov 2024
On Active Privacy Auditing in Supervised Fine-tuning for White-Box Language Models
Qian Sun
Hanpeng Wu
Xi Sheryl Zhang
79
0
0
11 Nov 2024
Evaluating Large Language Models on Financial Report Summarization: An Empirical Study
Xinqi Yang
Scott Zang
Yong Ren
Dingjie Peng
Zheng Wen
66
1
0
11 Nov 2024
HarmLevelBench: Evaluating Harm-Level Compliance and the Impact of Quantization on Model Alignment
Yannis Belkhiter
Giulio Zizzo
S. Maffeis
70
3
0
11 Nov 2024
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant
Yujia Zhou
Zheng Liu
Zhicheng Dou
AIFin
LRM
RALM
56
2
0
11 Nov 2024
Script-Strategy Aligned Generation: Aligning LLMs with Expert-Crafted Dialogue Scripts and Therapeutic Strategies for Psychotherapy
Xin Sun
Jan de Wit
Zhuying Li
Jiahuan Pei
Abdallah El Ali
Jos A. Bosch
115
2
0
11 Nov 2024
Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models
Yeming Wen
Swarat Chaudhuri
96
0
0
11 Nov 2024
Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models
Xiaojun Wu
Junxi Liu
Huanyi Su
Zhouchi Lin
Yiyan Qi
...
Fuwei Wang
Saizhuo Wang
Fengrui Hua
Jia Li
Jian Guo
109
2
0
09 Nov 2024
Detecting Reference Errors in Scientific Literature with Large Language Models
Tianmai M. Zhang
Neil F. Abernethy
33
0
0
09 Nov 2024
CoPrompter: User-Centric Evaluation of LLM Instruction Alignment for Improved Prompt Engineering
Ishika Joshi
Simra Shahid
Shreeya Venneti
Manushree Vasu
Yantao Zheng
Yunyao Li
Balaji Krishnamurthy
Gromit Yeuk-Yin Chan
103
4
0
09 Nov 2024
Towards Low-Resource Harmful Meme Detection with LMM Agents
Jianzhao Huang
Hongzhan Lin
Ziyan Liu
Ziyang Luo
Guang Chen
Jing Ma
80
6
0
08 Nov 2024
Fine-Grained Reward Optimization for Machine Translation using Error Severity Mappings
Miguel Moura Ramos
Tomás Almeida
Daniel Vareta
Filipe Azevedo
Sweta Agrawal
Patrick Fernandes
André F. T. Martins
135
4
0
08 Nov 2024
RT-Grasp: Reasoning Tuning Robotic Grasping via Multi-modal Large Language Model
Jinxuan Xu
Shiyu Jin
Yutian Lei
Yuqian Zhang
Liangjun Zhang
LRM
84
7
0
07 Nov 2024
LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG
Laifa Tao
Qixuan Huang
Xianjun Wu
Weiwei Zhang
Yunlong Wu
Bin Li
Chen Lu
Xingshuo Hai
91
0
0
07 Nov 2024
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Jierui Li
Hung Le
Yingbo Zhou
Caiming Xiong
Silvio Savarese
Doyen Sahoo
LLMAG
95
8
0
07 Nov 2024
Orbit: A Framework for Designing and Evaluating Multi-objective Rankers
Chenyang Yang
Tesi Xiao
Michael Shavlovsky
Christian Kastner
Tongshuang Wu
90
0
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
234
6
0
07 Nov 2024
DELIFT: Data Efficient Language model Instruction Fine Tuning
Ishika Agarwal
Krishnateja Killamsetty
Lucian Popa
Marina Danilevksy
ALM
VLM
149
4
0
07 Nov 2024
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
Shehan Munasinghe
Hanan Gani
Wenqi Zhu
Jiale Cao
Eric P. Xing
Fahad Shahbaz Khan
Salman Khan
MLLM
VGen
VLM
140
9
0
07 Nov 2024
Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Yuhang Liu
Xueyu Hu
Shengyu Zhang
Jingyuan Chen
Fan Wu
Leilei Gan
RALM
42
0
0
06 Nov 2024
Can Custom Models Learn In-Context? An Exploration of Hybrid Architecture Performance on In-Context Learning Tasks
Ryan Campbell
Nelson Lojo
Kesava Viswanadha
Christoffer Grondal Tryggestad
Derrick Han Sun
Sriteja Vijapurapu
August Rolfsen
Anant Sahai
72
0
0
06 Nov 2024
RAGulator: Lightweight Out-of-Context Detectors for Grounded Text Generation
Ian Poey
Jiajun Liu
Qishuai Zhong
Adrien Chenailler
115
0
0
06 Nov 2024
Lexicalization Is All You Need: Examining the Impact of Lexical Knowledge in a Compositional QALD System
David Maria Schmidt
Mohammad Fazleh Elahi
Philipp Cimiano
98
0
0
06 Nov 2024
From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Zhirui Deng
Zhicheng Dou
Yinlin Zhu
Ji-Rong Wen
Ruibin Xiong
Mang Wang
Xin Wu
97
9
0
06 Nov 2024
Policy Aggregation
Parand A. Alamdari
Soroush Ebadian
Ariel D. Procaccia
OffRL
76
5
0
06 Nov 2024
Semantic Navigation for AI-assisted Ideation
Thomas Sandholm
Sarah Dong
Sayandev Mukherjee
John Feland
Bernardo A. Huberman
62
0
0
06 Nov 2024
Diversity Helps Jailbreak Large Language Models
Weiliang Zhao
Daniel Ben-Levi
Wei Hao
Junfeng Yang
Chengzhi Mao
AAML
496
1
0
06 Nov 2024
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Ziliang Gan
Yu Lu
D. Zhang
Haohan Li
Che Liu
...
Haipang Wu
Chaoyou Fu
Z. Xu
Rongjunchen Zhang
Yong Dai
113
13
0
05 Nov 2024
Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs
Long-Fei Li
Peng Zhao
Zhi Zhou
81
1
0
05 Nov 2024
Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
Jason Vega
Junsheng Huang
Gaokai Zhang
Hangoo Kang
Minjia Zhang
Gagandeep Singh
76
1
0
05 Nov 2024
CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration
Hongpeng Jin
Yanzhao Wu
166
5
0
05 Nov 2024
Grounding Natural Language to SQL Translation with Data-Based Self-Explanations
Yuankai Fan
Tonghui Ren
Can Huang
Zhenying He
Xinyu Wang
LRM
133
2
0
05 Nov 2024
On the Loss of Context-awareness in General Instruction Fine-tuning
Yihan Wang
Andrew Bai
Nanyun Peng
Cho-Jui Hsieh
384
2
0
05 Nov 2024
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
Hayeon Bang
Eunjin Choi
Megan Finch
Seungheon Doh
Seolhee Lee
G. Lee
Juhan Nam
75
0
0
04 Nov 2024
A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification
Sorouralsadat Fatemi
Yuheng Hu
Maryam Mousavi
110
4
0
04 Nov 2024
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Xingwu Sun
Yanfeng Chen
Yanwen Huang
Ruobing Xie
Jiaqi Zhu
...
Zhanhui Kang
Yong Yang
Yuhong Liu
Di Wang
Jie Jiang
MoE
ALM
ELM
172
34
0
04 Nov 2024
Improving Steering Vectors by Targeting Sparse Autoencoder Features
Sviatoslav Chalnev
Matthew Siu
Arthur Conmy
LLMSV
122
28
0
04 Nov 2024
Culinary Class Wars: Evaluating LLMs using ASH in Cuisine Transfer Task
Hoonick Lee
Mogan Gim
Donghyeon Park
Donghee Choi
Jaewoo Kang
59
0
0
04 Nov 2024
Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Yuxin Xiao
Chaoqun Wan
Yonggang Zhang
Wenxiao Wang
Binbin Lin
Xiaofei He
Xu Shen
Jieping Ye
53
0
0
04 Nov 2024
Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback
Guan-Ting Lin
Prashanth Gurunath Shivakumar
Aditya Gourav
Yile Gu
Ankur Gandhe
Hung-yi Lee
I. Bulyko
132
9
0
04 Nov 2024
Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM
Haw-Shiuan Chang
Nanyun Peng
Mohit Bansal
Anil Ramakrishna
Tagyoung Chung
92
4
0
03 Nov 2024
Sample-Efficient Alignment for LLMs
Zichen Liu
Changyu Chen
Chao Du
Wee Sun Lee
Min Lin
102
4
0
03 Nov 2024
Previous
1
2
3
...
38
39
40
...
126
127
128
Next