Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,456 papers shown
Title
What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary
Ori Ram
L. Bezalel
Adi Zicher
Yonatan Belinkov
Jonathan Berant
Amir Globerson
107
37
0
20 Dec 2022
Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering
Zhiyong Wu
Yaoxiang Wang
Jiacheng Ye
Lingpeng Kong
134
141
0
20 Dec 2022
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Hang Pu
Y. Lan
Chao Shen
95
41
0
20 Dec 2022
HINT: Hypernetwork Instruction Tuning for Efficient Zero- & Few-Shot Generalisation
Hamish Ivison
Akshita Bhagia
Yizhong Wang
Hannaneh Hajishirzi
Matthew E. Peters
146
20
0
20 Dec 2022
Identifying and Manipulating the Personality Traits of Language Models
Graham Caron
Shashank Srivastava
91
39
0
20 Dec 2022
Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study
Di Wu
Wasi Uddin Ahmad
Kai-Wei Chang
86
18
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
119
2
0
20 Dec 2022
Pay Attention to Your Tone: Introducing a New Dataset for Polite Language Rewrite
Xun Wang
Tao Ge
Allen Mao
Yuki Li
Furu Wei
Si-Qing Chen
95
5
0
20 Dec 2022
Human-Guided Fair Classification for Natural Language Processing
Florian E.Dorner
Momchil Peychev
Nikola Konstantinov
Naman Goel
Elliott Ash
Martin Vechev
FaML
80
4
0
20 Dec 2022
A Survey on Pretrained Language Models for Neural Code Intelligence
Yichen Xu
Yanqiao Zhu
52
17
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLM
ELM
LRM
138
351
0
20 Dec 2022
Language Modeling with Latent Situations
Belinda Z. Li
Maxwell Nye
Jacob Andreas
LRM
98
7
0
20 Dec 2022
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
82
72
0
20 Dec 2022
(QA)
2
^2
2
: Question Answering with Questionable Assumptions
Najoung Kim
Phu Mon Htut
Sam Bowman
Jackson Petty
111
39
0
20 Dec 2022
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
Boshi Wang
Sewon Min
Xiang Deng
Jiaming Shen
You Wu
Luke Zettlemoyer
Huan Sun
LRM
ReLM
122
252
0
20 Dec 2022
Are Deep Neural Networks SMARTer than Second Graders?
A. Cherian
Kuan-Chuan Peng
Suhas Lohit
Kevin A. Smith
J. Tenenbaum
AAML
LRM
ReLM
112
31
0
20 Dec 2022
On Improving Summarization Factual Consistency from Natural Language Feedback
Yixin Liu
Budhaditya Deb
Milagro Teruel
Aaron L Halfaker
Dragomir R. Radev
Ahmed Hassan Awadallah
HILM
62
38
0
20 Dec 2022
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
Mingye Xu
Mutian Xu
Tong He
Wanli Ouyang
Yali Wang
Xiaoguang Han
Yu Qiao
79
10
0
20 Dec 2022
Future Sight: Dynamic Story Generation with Large Pretrained Language Models
Brian D. Zimmerman
Gaurav Sahu
Olga Vechtomova
47
0
0
20 Dec 2022
Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks
Kaiser Sun
Peng Qi
Yuhao Zhang
Lan Liu
William Yang Wang
Zhiheng Huang
80
9
0
19 Dec 2022
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
85
14
0
19 Dec 2022
Improved Long-Form Spoken Language Translation with Large Language Models
Arya D. McCarthy
Haotong Zhang
Shankar Kumar
Felix Stahlberg
Axel H. Ng
73
2
0
19 Dec 2022
A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models
Karin de Langis
Dongyeop Kang
103
1
0
19 Dec 2022
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations
Xinxi Lyu
Sewon Min
Iz Beltagy
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
75
68
0
19 Dec 2022
Synthetic Pre-Training Tasks for Neural Machine Translation
Zexue He
Graeme W. Blackwood
Yikang Shen
Julian McAuley
Rogerio Feris
54
4
0
19 Dec 2022
Training Trajectories of Language Models Across Scales
Mengzhou Xia
Mikel Artetxe
Chunting Zhou
Xi Lin
Ramakanth Pasunuru
Danqi Chen
Luke Zettlemoyer
Ves Stoyanov
AIFin
LRM
98
64
0
19 Dec 2022
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
175
2,440
0
19 Dec 2022
Evaluating Human-Language Model Interaction
Mina Lee
Megha Srivastava
Amelia Hardy
John Thickstun
Esin Durmus
...
Hancheng Cao
Tony Lee
Rishi Bommasani
Michael S. Bernstein
Percy Liang
LM&MA
ALM
108
102
0
19 Dec 2022
DSI++: Updating Transformer Memory with New Documents
Sanket Vaibhav Mehta
Jai Gupta
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
J. Rao
Marc Najork
Emma Strubell
Donald Metzler
CLL
103
46
0
19 Dec 2022
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Yu Gu
Xiang Deng
Yu-Chuan Su
LLMAG
123
58
0
19 Dec 2022
A Retrieve-and-Read Framework for Knowledge Graph Link Prediction
Vardaan Pahuja
Boshi Wang
Hugo Latapie
Jayanth Srinivasa
Yu-Chuan Su
76
13
0
19 Dec 2022
On Event Individuation for Document-Level Information Extraction
William Gantt
Reno Kriz
Yunmo Chen
Siddharth Vashishtha
Aaron Steven White
69
2
0
19 Dec 2022
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
ALM
167
374
0
19 Dec 2022
Multilingual Sequence-to-Sequence Models for Hebrew NLP
Matan Eyal
Hila Noga
Roee Aharoni
Idan Szpektor
Reut Tsarfaty
47
4
0
19 Dec 2022
StyleFlow: Disentangle Latent Representations via Normalizing Flow for Unsupervised Text Style Transfer
Kangchen Zhu
Zhiliang Tian
Ruifeng Luo
Xiaoguang Mao
OOD
105
3
0
19 Dec 2022
Visconde: Multi-document QA with GPT-3 and Neural Reranking
Jayr Pereira
R. Fidalgo
R. Lotufo
Rodrigo Nogueira
BDL
RALM
78
33
0
19 Dec 2022
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages
Ercong Nie
Sheng Liang
Helmut Schmid
Hinrich Schütze
VLM
RALM
LRM
114
22
0
19 Dec 2022
Optimizing Prompts for Text-to-Image Generation
Y. Hao
Zewen Chi
Li Dong
Furu Wei
125
152
0
19 Dec 2022
Explanation Regeneration via Information Bottleneck
Qintong Li
Zhiyong Wu
Lingpeng Kong
Wei Bi
93
4
0
19 Dec 2022
Reasoning with Language Model Prompting: A Survey
Shuofei Qiao
Yixin Ou
Ningyu Zhang
Xiang Chen
Yunzhi Yao
Shumin Deng
Chuanqi Tan
Fei Huang
Huajun Chen
ReLM
ELM
LRM
232
327
0
19 Dec 2022
Latent Diffusion for Language Generation
Justin Lovelace
Varsha Kishore
Chao-gang Wan
Eliot Shekhtman
Kilian Q. Weinberger
DiffM
132
82
0
19 Dec 2022
Medical Knowledge Graph QA for Drug-Drug Interaction Prediction based on Multi-hop Machine Reading Comprehension
Peng Gao
Feng Gao
Jiancheng Ni
Yu Wang
Fei Wang
62
3
0
19 Dec 2022
AI Art in Architecture
J. Ploennigs
Markus Berger
DiffM
81
72
0
19 Dec 2022
Review of security techniques for memristor computing systems
Minhui Zou
Nan Du
Shahar Kvatinsky
AAML
26
7
0
19 Dec 2022
E-NER -- An Annotated Named Entity Recognition Corpus of Legal Text
Ting Wai Terence Au
Ingemar J. Cox
Vasileios Lampos
AILaw
72
28
0
19 Dec 2022
MIGA: A Unified Multi-task Generation Framework for Conversational Text-to-SQL
Yingwen Fu
Wenjie Ou
Zhou Yu
Yue Lin
75
7
0
19 Dec 2022
PromptBoosting: Black-Box Text Classification with Ten Forward Passes
Bairu Hou
J. O'Connor
Jacob Andreas
Shiyu Chang
Yang Zhang
VLM
57
44
0
19 Dec 2022
Discovering Language Model Behaviors with Model-Written Evaluations
Ethan Perez
Sam Ringer
Kamilė Lukošiūtė
Karina Nguyen
Edwin Chen
...
Danny Hernandez
Deep Ganguli
Evan Hubinger
Nicholas Schiefer
Jared Kaplan
ALM
102
407
0
19 Dec 2022
Natural Language to Code Generation in Interactive Data Science Notebooks
Pengcheng Yin
Wen-Ding Li
Kefan Xiao
Abhishek Rao
Yeming Wen
...
Paige Bailey
Michele Catasta
Henryk Michalewski
Oleksandr Polozov
Charles Sutton
88
66
0
19 Dec 2022
ColoristaNet for Photorealistic Video Style Transfer
Xiaowen Qiu
Ruize Xu
Boan He
Yingtao Zhang
Wenqiang Zhang
Weifeng Ge
59
0
0
19 Dec 2022
Previous
1
2
3
...
170
171
172
...
248
249
250
Next