Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 7,311 papers shown
Title
Benchmarking zero-shot stance detection with FlanT5-XXL: Insights from training data, prompting, and decoding strategies into its near-SoTA performance
Rachith Aiyappa
Shruthi Senthilmani
Jisun An
Haewoon Kwak
Yong-Yeol Ahn
37
3
0
01 Mar 2024
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Lei Li
Yuqi Wang
Runxin Xu
Peiyi Wang
Xiachong Feng
Lingpeng Kong
Qi Liu
45
51
0
01 Mar 2024
Improving Socratic Question Generation using Data Augmentation and Preference Optimization
Nischal Ashok Kumar
Andrew Lan
43
8
0
01 Mar 2024
"Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models
Karina Halevy
Anna Sotnikova
Badr AlKhamissi
Syrielle Montariol
Antoine Bosselut
KELM
42
3
0
29 Feb 2024
FAC
2
^2
2
E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition
Xiaoqiang Wang
Bang Liu
Lingfei Wu
42
0
0
29 Feb 2024
Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling
Gabriel Grand
Valerio Pepe
Jacob Andreas
Joshua B. Tenenbaum
ReLM
36
6
0
29 Feb 2024
TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning
Kate Sanders
Nathaniel Weir
Benjamin Van Durme
LRM
41
11
0
29 Feb 2024
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
Chao Qian
Jie Zhang
Wei Yao
Dongrui Liu
Zhen-fei Yin
Yu Qiao
Yong Liu
Jing Shao
LLMSV
LRM
57
13
0
29 Feb 2024
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
Tsun-Hsuan Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
39
39
0
29 Feb 2024
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
Yifei Zhou
Andrea Zanette
Jiayi Pan
Sergey Levine
Aviral Kumar
65
51
0
29 Feb 2024
Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines
Lijia Ma
Xingchen Xu
Yong-Ming Tan
32
8
0
29 Feb 2024
Deep Learning for Cross-Domain Data Fusion in Urban Computing: Taxonomy, Advances, and Outlook
Xingchen Zou
Yibo Yan
Xixuan Hao
Yuehong Hu
Haomin Wen
...
Junbo Zhang
Yong Li
Tianrui Li
Yu Zheng
Keli Zhang
HAI
AI4TS
57
37
0
29 Feb 2024
Memory-Augmented Generative Adversarial Transformers
Stephan Raaijmakers
Roos Bakker
Anita Cremers
R. D. Kleijn
Tom Kouwenhoven
Tessa Verhoef
41
0
0
29 Feb 2024
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
Mihai Masala
Traian Rebedea
Horia Velicu
AILaw
43
2
0
29 Feb 2024
Large Language Models are Learnable Planners for Long-Term Recommendation
Wentao Shi
Xiangnan He
Yang Zhang
Chongming Gao
Xinyue Li
Jizhi Zhang
Qifan Wang
Fuli Feng
49
11
0
29 Feb 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
39
6
0
29 Feb 2024
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment
Yiju Guo
Ganqu Cui
Lifan Yuan
Ning Ding
Jiexin Wang
...
Ruobing Xie
Jie Zhou
Yankai Lin
Zhiyuan Liu
Maosong Sun
36
60
0
29 Feb 2024
Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study
Prottay Kumar Adhikary
Aseem Srivastava
Shivani Kumar
Salam Michael Singh
Puneet Manuja
Jini K. Gopinath
Vijay Krishnan
Swati Kedia
K. Deb
Tanmoy Chakraborty
AI4MH
45
8
0
29 Feb 2024
Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition
Boyu Chen
Siran Chen
Kunchang Li
Qinglin Xu
Yu Qiao
Yali Wang
34
3
0
29 Feb 2024
PopALM: Popularity-Aligned Language Models for Social Media Trendy Response Prediction
Erxin Yu
Jing Li
Chunpu Xu
35
3
0
29 Feb 2024
Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization
Aadirupa Saha
Pierre Gaillard
37
2
0
29 Feb 2024
Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*
Rodrigo Santos
Joao Silva
Luís Gomes
João Rodrigues
António Branco
46
10
0
29 Feb 2024
RORA: Robust Free-Text Rationale Evaluation
Zhengping Jiang
Yining Lu
Hanjie Chen
Daniel Khashabi
Benjamin Van Durme
Anqi Liu
53
1
0
28 Feb 2024
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability
Congying Xia
Chen Xing
Jiangshu Du
Xinyi Yang
Yihao Feng
Ran Xu
Wenpeng Yin
Caiming Xiong
ALM
35
42
0
28 Feb 2024
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Haoxiang Wang
Yong Lin
Wei Xiong
Rui Yang
Shizhe Diao
Shuang Qiu
Han Zhao
Tong Zhang
40
72
0
28 Feb 2024
Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification
Garima Chhikara
Anurag Sharma
Kripabandhu Ghosh
Abhijnan Chakraborty
44
14
0
28 Feb 2024
Is Crowdsourcing Breaking Your Bank? Cost-Effective Fine-Tuning of Pre-trained Language Models with Proximal Policy Optimization
Shuo Yang
Gjergji Kasneci
ALM
53
3
0
28 Feb 2024
A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Keith M. Davis
Ruisheng Cao
Su Zhu
Sheng Jiang
Hanchong Zhang
Lu Chen
Tuukka Ruotsalo
27
2
0
28 Feb 2024
Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information
Seungjong Sun
Eungu Lee
Dongyan Nan
Xiangying Zhao
Wonbyung Lee
Bernard J. Jansen
Jang Hyun Kim
64
17
0
28 Feb 2024
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Tong Liu
Yingjie Zhang
Zhe Zhao
Yinpeng Dong
Guozhu Meng
Kai Chen
AAML
56
48
0
28 Feb 2024
LoRA-SP: Streamlined Partial Parameter Adaptation for Resource-Efficient Fine-Tuning of Large Language Models
Yichao Wu
Yafei Xiang
Shuning Huo
Yulu Gong
Penghao Liang
33
7
0
28 Feb 2024
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Bin Cao
Jianhao Yuan
Yexin Liu
Jian Li
Shuyang Sun
Jing Liu
Bo Zhao
DiffM
43
7
0
28 Feb 2024
Hire a Linguist!: Learning Endangered Languages with In-Context Linguistic Descriptions
Kexun Zhang
Yee Man Choi
Zhenqiao Song
Taiqi He
Wenjie Wang
Lei Li
39
17
0
28 Feb 2024
TroubleLLM: Align to Red Team Expert
Zhuoer Xu
Jianping Zhang
Shiwen Cui
Changhua Meng
Weiqiang Wang
54
1
0
28 Feb 2024
Do Large Language Models Mirror Cognitive Language Processing?
Yuqi Ren
Renren Jin
Tongxuan Zhang
Deyi Xiong
53
4
0
28 Feb 2024
A Survey on Recent Advances in LLM-Based Multi-turn Dialogue Systems
Zihao Yi
Jiarui Ouyang
Yuwen Liu
Tianhao Liao
Zhe Xu
Ying Shen
LLMAG
LRM
67
60
0
28 Feb 2024
Collaborative decoding of critical tokens for boosting factuality of large language models
Lifeng Jin
Baolin Peng
Linfeng Song
Haitao Mi
Ye Tian
Dong Yu
HILM
35
6
0
28 Feb 2024
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training
Le Zhuo
Zewen Chi
Minghao Xu
Heyan Huang
Heqi Zheng
Conghui He
Xian-Ling Mao
Wentao Zhang
98
11
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions
Hanjie Chen
Zhouxiang Fang
Yash Singla
Mark Dredze
ELM
AI4MH
49
33
0
28 Feb 2024
Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging
Wei Zhang
Jian Yang
Anjie Le
Zhiyu Li
Shuangyong Song
Xianfu Cheng
Tieqiao Zheng
Shi Xu
69
14
0
28 Feb 2024
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey
Xi Fang
Weijie Xu
Fiona Anting Tan
Jiani Zhang
Ziqing Hu
Yanjun Qi
Scott Nickleach
Diego Socolinsky
Srinivasan H. Sengamedu
Christos Faloutsos
LMTD
ALM
59
66
0
27 Feb 2024
Prediction-Powered Ranking of Large Language Models
Ivi Chatzi
Eleni Straitouri
Suhas Thejaswi
Manuel Gomez Rodriguez
ALM
31
5
0
27 Feb 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
47
52
0
27 Feb 2024
Massive Activations in Large Language Models
Mingjie Sun
Xinlei Chen
J. Zico Kolter
Zhuang Liu
76
67
0
27 Feb 2024
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Ayana Niwa
Hayate Iso
36
4
0
27 Feb 2024
SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation
Shuangrui Ding
Zihan Liu
Xiao-wen Dong
Pan Zhang
Rui Qian
Conghui He
Dahua Lin
Jiaqi Wang
30
23
0
27 Feb 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
39
10
0
27 Feb 2024
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space
Shaolei Zhang
Tian Yu
Yang Feng
HILM
KELM
42
40
0
27 Feb 2024
Ansible Lightspeed: A Code Generation Service for IT Automation
Priyam Sahoo
Saurabh Pujar
Ganesh Nalawade
Richard Gebhardt
Louis Mandel
Luca Buratti
13
0
0
27 Feb 2024
Previous
1
2
3
...
82
83
84
...
145
146
147
Next