Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,456 papers shown
Title
Understanding Postpartum Parents' Experiences via Two Digital Platforms
X. Yao
M. Mikhelson
Megan Micheletti
Eunsol Choi
S. C. Watkins
Edison Thomaz
Kaya de
31
7
0
22 Dec 2022
Improving Automated Program Repair with Domain Adaptation
Armin Zirak
Hadi Hemmati
68
11
0
21 Dec 2022
What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis
Xiang Deng
Vasilisa Bashlovkina
Feng Han
Simon Baumgartner
Michael Bendersky
73
46
0
21 Dec 2022
Towards Neural Variational Monte Carlo That Scales Linearly with System Size
Or Sharir
G. Chan
Anima Anandkumar
61
4
0
21 Dec 2022
Language models are better than humans at next-token prediction
Buck Shlegeris
Fabien Roger
Lawrence Chan
Euan McLean
ELM
LRM
81
12
0
21 Dec 2022
Critic-Guided Decoding for Controlled Text Generation
Minbeom Kim
Hwanhee Lee
Kang Min Yoo
Joonsuk Park
Hwaran Lee
Kyomin Jung
115
36
0
21 Dec 2022
SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
M Saiful Bari
Aston Zhang
Shuai Zheng
Xingjian Shi
Yi Zhu
Shafiq Joty
Mu Li
RALM
VLM
VPVLM
LRM
97
5
0
21 Dec 2022
Language Models as Inductive Reasoners
Zonglin Yang
Li Dong
Xinya Du
Hao Cheng
Min Zhang
Xiaodong Liu
Jianfeng Gao
Furu Wei
ReLM
LRM
96
37
0
21 Dec 2022
From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models
Jiaxian Guo
Junnan Li
Dongxu Li
A. M. H. Tiong
Boyang Albert Li
Dacheng Tao
Steven C. H. Hoi
VLM
MLLM
83
118
0
21 Dec 2022
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models
Dheeraj Mekala
Jason Wolfe
Subhro Roy
100
9
0
21 Dec 2022
SERENGETI: Massively Multilingual Language Models for Africa
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Alcides Alcoba Inciarte
76
33
0
21 Dec 2022
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
Zhiyang Xu
Ying Shen
Lifu Huang
MLLM
141
120
0
21 Dec 2022
ImPaKT: A Dataset for Open-Schema Knowledge Base Construction
Luke Vilnis
Zachary Kenneth Fisher
Bhargav Kanagal
Patrick C. Murray
Sumit Sanghai
79
3
0
21 Dec 2022
A Mutation-based Text Generation for Adversarial Machine Learning Applications
Jesus Guerrero
G. Liang
I. Alsmadi
DeLMO
MedIm
71
1
0
21 Dec 2022
ORCA: A Challenging Benchmark for Arabic Language Understanding
AbdelRahim Elmadany
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
ELM
109
45
0
21 Dec 2022
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
AbdelRahim Elmadany
Alcides Alcoba Inciarte
Md. Tawkat Islam Khondaker
77
8
0
21 Dec 2022
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
Hao Sun
Zhexin Zhang
Fei Mi
Yasheng Wang
Wen Liu
Jianwei Cui
Bin Wang
Qun Liu
Minlie Huang
92
21
0
21 Dec 2022
Zero-shot Triplet Extraction by Template Infilling
Bosung Kim
Hayate Iso
Nikita Bhutani
Estevam R. Hruschka
Ndapandula Nakashole
Tom Mitchell
ViT
58
10
0
21 Dec 2022
Generation-Augmented Query Expansion For Code Retrieval
Dong Li
Yelong Shen
Ruoming Jin
Yi Mao
Kuan-Chieh Wang
Weizhu Chen
RALM
69
8
0
20 Dec 2022
Ontologically Faithful Generation of Non-Player Character Dialogues
Nathaniel Weir
Ryan Thomas
Randolph DÁmore
Kellie Hill
Benjamin Van Durme
Harsh Jhamtani
75
7
0
20 Dec 2022
On-the-fly Denoising for Data Augmentation in Natural Language Understanding
Tianqing Fang
Wenxuan Zhou
Fangyu Liu
Hongming Zhang
Yangqiu Song
Muhao Chen
106
1
0
20 Dec 2022
Unleashing the Power of Visual Prompting At the Pixel Level
Junyang Wu
Xianhang Li
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
VPVLM
VLM
97
32
0
20 Dec 2022
A Length-Extrapolatable Transformer
Yutao Sun
Li Dong
Barun Patra
Shuming Ma
Shaohan Huang
Alon Benhaim
Vishrav Chaudhary
Xia Song
Furu Wei
115
124
0
20 Dec 2022
Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good movie, and a good prompt too?
Weijia Shi
Xiaochuang Han
Hila Gonen
Ari Holtzman
Yulia Tsvetkov
Luke Zettlemoyer
111
44
0
20 Dec 2022
Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Martha Lewis
Nihal V. Nayak
Peilin Yu
Qinan Yu
Jack Merullo
Stephen H. Bach
Ellie Pavlick
VLM
OCL
CoGe
134
68
0
20 Dec 2022
A Survey of Deep Learning for Mathematical Reasoning
Pan Lu
Liang Qiu
Wenhao Yu
Sean Welleck
Kai-Wei Chang
ReLM
LRM
133
150
0
20 Dec 2022
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
96
31
0
20 Dec 2022
Trustworthy Social Bias Measurement
Rishi Bommasani
Percy Liang
76
11
0
20 Dec 2022
Open Domain Multi-document Summarization: A Comprehensive Study of Model Brittleness under Retrieval
John Giorgi
Luca Soldaini
Bo Wang
Gary D. Bader
Kyle Lo
Lucy Lu Wang
Arman Cohan
95
19
0
20 Dec 2022
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Suwon Shon
Siddhant Arora
Chyi-Jiunn Lin
Ankita Pasad
Felix Wu
Roshan S. Sharma
Wei Wu
Hung-yi Lee
Karen Livescu
Shinji Watanabe
ELM
80
33
0
20 Dec 2022
Task Ambiguity in Humans and Language Models
Alex Tamkin
Kunal Handa
Ava Shrestha
Noah D. Goodman
UQLM
122
23
0
20 Dec 2022
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
Alex Troy Mallen
Akari Asai
Victor Zhong
Rajarshi Das
Daniel Khashabi
Hannaneh Hajishirzi
RALM
HILM
KELM
140
611
0
20 Dec 2022
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
KELM
RALM
LRM
168
476
0
20 Dec 2022
Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild?
Sang-Woo Lee
Sungdong Kim
Donghyeon Ko
Dong-hyun Ham
Youngki Hong
...
Wangkyo Jung
Kyunghyun Cho
Donghyun Kwak
H. Noh
W. Park
103
2
0
20 Dec 2022
A Measure-Theoretic Characterization of Tight Language Models
Li Du
Lucas Torroba Hennigen
Tiago Pimentel
Clara Meister
Jason Eisner
Ryan Cotterell
100
32
0
20 Dec 2022
Precise Zero-Shot Dense Retrieval without Relevance Labels
Luyu Gao
Xueguang Ma
Jimmy J. Lin
Jamie Callan
RALM
103
342
0
20 Dec 2022
LAMBADA: Backward Chaining for Automated Reasoning in Natural Language
Seyed Mehran Kazemi
Najoung Kim
Deepti Bhatia
Xinyuan Xu
Deepak Ramachandran
LRM
106
81
0
20 Dec 2022
Execution-Based Evaluation for Open-Domain Code Generation
Zhiruo Wang
Shuyan Zhou
Daniel Fried
Graham Neubig
ELM
114
84
0
20 Dec 2022
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
103
26
0
20 Dec 2022
Evaluation for Change
Rishi Bommasani
ELM
64
0
0
20 Dec 2022
Little Red Riding Hood Goes Around the Globe:Crosslingual Story Planning and Generation with Large Language Models
E. Razumovskaia
Joshua Maynez
Annie Louis
Mirella Lapata
Shashi Narayan
LRM
58
5
0
20 Dec 2022
Generic Temporal Reasoning with Differential Analysis and Explanation
Yu Feng
Ben Zhou
Haoyu Wang
H. Jin
Dan Roth
OOD
124
21
0
20 Dec 2022
Controllable Text Generation with Language Constraints
Howard Chen
Huihan Li
Danqi Chen
Karthik Narasimhan
67
16
0
20 Dec 2022
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
Hyunwoo J. Kim
Jack Hessel
Liwei Jiang
Peter West
Ximing Lu
...
Ronan Le Bras
Malihe Alikhani
Gunhee Kim
Maarten Sap
Yejin Choi
HILM
132
171
0
20 Dec 2022
Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models
Jingjing Xu
Qingxiu Dong
Hongyi Liu
Lei Li
ALM
LRM
69
1
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
95
250
0
20 Dec 2022
Perplexed by Quality: A Perplexity-based Method for Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data
Timm Jansen
Yangling Tong
V. Zevallos
Pedro Ortiz Suarez
74
20
0
20 Dec 2022
Geographic and Geopolitical Biases of Language Models
Fahim Faisal
Antonios Anastasopoulos
94
21
0
20 Dec 2022
Towards Reasoning in Large Language Models: A Survey
Jie Huang
Kevin Chen-Chuan Chang
LM&MA
ELM
LRM
190
645
0
20 Dec 2022
Contrastive Learning Reduces Hallucination in Conversations
Weiwei Sun
Zhengliang Shi
Shen Gao
Fajie Yuan
Maarten de Rijke
Zhaochun Ren
109
67
0
20 Dec 2022
Previous
1
2
3
...
169
170
171
...
248
249
250
Next