Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.02155
Cited By
Training language models to follow instructions with human feedback
4 March 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
Pamela Mishkin
Chong Zhang
Sandhini Agarwal
Katarina Slama
Alex Ray
John Schulman
Jacob Hilton
Fraser Kelton
Luke E. Miller
Maddie Simens
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training language models to follow instructions with human feedback"
50 / 7,310 papers shown
Title
Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent
Xiaonan Nie
Yi Liu
Fangcheng Fu
Jinbao Xue
Dian Jiao
Xupeng Miao
Yangyu Tao
Bin Cui
MoE
36
17
0
06 Mar 2023
Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM
Rachel Bawden
François Yvon
VLM
LRM
27
60
0
03 Mar 2023
Ask and You Shall Receive (a Graph Drawing): Testing ChatGPT's Potential to Apply Graph Layout Algorithms
Sara Di Bartolomeo
Giorgio Severi
V. Schetinger
Cody Dunne
46
8
0
03 Mar 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
94
11
0
03 Mar 2023
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
41
62
0
02 Mar 2023
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents
Wenlong Huang
Fei Xia
Dhruv Shah
Danny Driess
Andy Zeng
...
Pete Florence
Igor Mordatch
Sergey Levine
Karol Hausman
Brian Ichter
LM&Ro
32
43
0
01 Mar 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Seonghyeon Ye
Hyeonbin Hwang
Sohee Yang
Hyeongu Yun
Yireun Kim
Minjoon Seo
LRM
32
34
0
28 Feb 2023
Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Tom Kocmi
C. Federmann
ELM
59
341
0
28 Feb 2023
A Survey on Long Text Modeling with Transformers
Zican Dong
Tianyi Tang
Lunyi Li
Wayne Xin Zhao
VLM
26
54
0
28 Feb 2023
Goal Driven Discovery of Distributional Differences via Language Descriptions
Ruiqi Zhong
Peter Zhang
Steve Li
Jinwoo Ahn
Dan Klein
Jacob Steinhardt
49
49
0
28 Feb 2023
Reward Design with Language Models
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
LM&Ro
44
202
0
27 Feb 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
88
12,418
0
27 Feb 2023
Human-in-the-Loop Schema Induction
Tianyi Zhang
Isaac Tham
Zhaoyi Hou
J. Ren
Liyang Zhou
...
Heng Ji
Martha Palmer
S. Brown
Reece Suchocki
Chris Callison-Burch
24
16
0
25 Feb 2023
Spanish Built Factual Freectianary (Spanish-BFF): the first AI-generated free dictionary
Miguel Ortega-Martín
Óscar García-Sierra
Alfonso Ardoiz
J. C. Armenteros
Jorge Álvarez
Adrián Alonso
44
2
0
24 Feb 2023
CARE: Collaborative AI-Assisted Reading Environment
Dennis Zyska
Nils Dycke
Jan Buchmann
Ilia Kuznetsov
Iryna Gurevych
19
6
0
24 Feb 2023
Factual Consistency Oriented Speech Recognition
Naoyuki Kanda
Takuya Yoshioka
Yang Liu
43
0
0
24 Feb 2023
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning
Vittoria Dentella
Fritz Guenther
Elliot Murphy
G. Marcus
Evelina Leivada
ELM
40
27
0
23 Feb 2023
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Kai Greshake
Sahar Abdelnabi
Shailesh Mishra
C. Endres
Thorsten Holz
Mario Fritz
SILM
63
443
0
23 Feb 2023
Sentence Simplification via Large Language Models
Yutao Feng
Jipeng Qiang
Yun Li
Yunhao Yuan
Yi Zhu
28
17
0
23 Feb 2023
Conversational Text-to-SQL: An Odyssey into State-of-the-Art and Challenges Ahead
S. Parthasarathi
Lu Zeng
Dilek Z. Hakkani-Tür
52
2
0
21 Feb 2023
ChatGPT: Jack of all trades, master of none
Jan Kocoñ
Igor Cichecki
Oliwier Kaszyca
Mateusz Kochanek
Dominika Szydło
...
Maciej Piasecki
Lukasz Radliñski
Konrad Wojtasik
Stanislaw Wo'zniak
Przemyslaw Kazienko
AI4MH
42
528
0
21 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
66
238
0
19 Feb 2023
Machine Love
Joel Lehman
28
5
0
18 Feb 2023
Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints
Albert Lu
Hongxin Zhang
Yanzhe Zhang
Xuezhi Wang
Diyi Yang
LRM
35
28
0
17 Feb 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
36
15
0
17 Feb 2023
Auditing large language models: a three-layered approach
Jakob Mokander
Jonas Schuett
Hannah Rose Kirk
Luciano Floridi
AILaw
MLAU
50
196
0
16 Feb 2023
Tuning computer vision models with task rewards
André Susano Pinto
Alexander Kolesnikov
Yuge Shi
Lucas Beyer
Xiaohua Zhai
VLM
32
40
0
16 Feb 2023
When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning
Siliang Zeng
Chenliang Li
Alfredo García
Min-Fong Hong
OffRL
34
13
0
15 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
25
175
0
13 Feb 2023
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Daniel Kang
Xuechen Li
Ion Stoica
Carlos Guestrin
Matei A. Zaharia
Tatsunori Hashimoto
AAML
30
238
0
11 Feb 2023
Synthesizing Human Gaze Feedback for Improved NLP Performance
Varun Khurana
Yaman Kumar Singla
Nora Hollenstein
R. Kumar
Balaji Krishnamurthy
18
15
0
11 Feb 2023
The Re-Label Method For Data-Centric Machine Learning
Tonglei Guo
NoLa
39
2
0
09 Feb 2023
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models
Changan Niu
Chuanyi Li
Vincent Ng
Bin Luo
ELM
ALM
38
9
0
08 Feb 2023
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models
Hossein Hajipour
Keno Hassler
Thorsten Holz
Lea Schonherr
Mario Fritz
ELM
40
20
0
08 Feb 2023
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
Felix Friedrich
Manuel Brack
Lukas Struppek
Dominik Hintersdorf
P. Schramowski
Sasha Luccioni
Kristian Kersting
38
120
0
07 Feb 2023
ChatGPT and Software Testing Education: Promises & Perils
Sajed Jalil
Suzzana Rafi
Thomas D. Latoza
Kevin Moran
Wing Lam
ELM
36
173
0
07 Feb 2023
Data Selection for Language Models via Importance Resampling
Sang Michael Xie
Shibani Santurkar
Tengyu Ma
Percy Liang
46
173
0
06 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
45
163
0
06 Feb 2023
The Gradient of Generative AI Release: Methods and Considerations
Irene Solaiman
36
98
0
05 Feb 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
48
118
0
05 Feb 2023
Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
K. Choromanski
Shanda Li
Valerii Likhosherstov
Kumar Avinava Dubey
Shengjie Luo
Di He
Yiming Yang
Tamás Sarlós
Thomas Weingarten
Adrian Weller
39
8
0
03 Feb 2023
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Zhixuan Liang
Yao Mu
Mingyu Ding
Fei Ni
Masayoshi Tomizuka
Ping Luo
80
101
0
03 Feb 2023
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Zihao Wang
Shaofei Cai
Guanzhou Chen
Guy Van den Broeck
Xiaojian Ma
Yitao Liang
LM&Ro
LLMAG
60
318
0
03 Feb 2023
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment
Hao Liu
Wilson Yan
Pieter Abbeel
34
25
0
02 Feb 2023
Inform the uninformed: Improving Online Informed Consent Reading with an AI-Powered Chatbot
Ziang Xiao
T. Li
Karrie Karahalios
Hari Sundaram
36
34
0
02 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
50
10
0
01 Feb 2023
Grading Conversational Responses Of Chatbots
Grant Rosario
David Noever
AI4MH
43
4
0
01 Feb 2023
Large Language Models Can Be Easily Distracted by Irrelevant Context
Freda Shi
Xinyun Chen
Kanishka Misra
Nathan Scales
David Dohan
Ed H. Chi
Nathanael Scharli
Denny Zhou
ReLM
RALM
LRM
38
541
0
31 Jan 2023
Mathematical Capabilities of ChatGPT
Simon Frieder
Luca Pinchetti
Alexis Chevalier
Ryan-Rhys Griffiths
Tommaso Salvatori
Thomas Lukasiewicz
P. Petersen
Julius Berner
ELM
AI4MH
46
404
0
31 Jan 2023
Benchmarking Large Language Models for News Summarization
Tianyi Zhang
Faisal Ladhak
Esin Durmus
Percy Liang
Kathleen McKeown
Tatsunori B. Hashimoto
ELM
43
487
0
31 Jan 2023
Previous
1
2
3
...
141
142
143
...
145
146
147
Next