Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,482 papers shown
Title
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
193
727
0
05 Jan 2023
Corrupted by Algorithms? How AI-generated and Human-written Advice Shape (Dis)honesty
Margarita Leib
N. Köbis
Rainer Michael Rilke
Marloes H. J. Hagens
Bernd Irlenbusch
38
30
0
05 Jan 2023
Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Lorena Piedras
Lucas Rosenblatt
Julia Wilkins
79
10
0
05 Jan 2023
Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning
Wenting Zhao
Ibrahim Abdelaziz
Julian T Dolby
Kavitha Srinivas
M. Helali
Essam Mansour
59
0
0
05 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
124
67
0
04 Jan 2023
MessageNet: Message Classification using Natural Language Processing and Meta-data
Adar Kahana
Oren Elisha
26
0
0
04 Jan 2023
A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding
D. Cajueiro
A. G. Nery
Igor Tavares
Maísa Kely de Melo
Silvia A. dos Reis
Weigang Li
V. R. R. Celestino
86
15
0
04 Jan 2023
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng
Ziyang Chen
Andrew Owens
85
78
0
04 Jan 2023
UniHD at TSAR-2022 Shared Task: Is Compute All We Need for Lexical Simplification?
Dennis Aumiller
Michael Gertz
53
22
0
04 Jan 2023
Text sampling strategies for predicting missing bibliographic links
F. V. Krasnova
I. S. Smaznevicha
E. Baskakova
81
1
0
04 Jan 2023
On the Convergence of Stochastic Gradient Descent in Low-precision Number Formats
M. Cacciola
A. Frangioni
M. Asgharian
Alireza Ghaffari
V. Nia
86
4
0
04 Jan 2023
Infomaxformer: Maximum Entropy Transformer for Long Time-Series Forecasting Problem
Peiwang Tang
Xianchao Zhang
AI4TS
116
6
0
04 Jan 2023
Audio-Visual Efficient Conformer for Robust Speech Recognition
Maxime Burchi
Radu Timofte
VLM
78
35
0
04 Jan 2023
A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge
Haodi Ma
D. Wang
101
8
0
03 Jan 2023
Language Models are Drummers: Drum Composition with Natural Language Pre-Training
Li Zhang
Chris Callison-Burch
86
5
0
03 Jan 2023
On the causality-preservation capabilities of generative modelling
Yves-Cédric Bauwelinckx
Jan Dhaene
Tim Verdonck
Milan van den Heuvel
CML
AI4CE
98
1
0
03 Jan 2023
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
124
0
0
03 Jan 2023
Data Valuation Without Training of a Model
Nohyun Ki
Hoyong Choi
Hye Won Chung
TDI
82
33
0
03 Jan 2023
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven H. Wang
Antoine Scardigli
Leonard Tang
Wei Chen
D.M. Levkin
Anya Chen
Spencer Ball
Thomas Woodside
Oliver Zhang
Dan Hendrycks
AILaw
ELM
72
22
0
02 Jan 2023
CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection
Jie Liu
Yixiao Zhang
Jieneng Chen
Junfei Xiao
Yongyi Lu
Bennett A. Landman
Yixuan Yuan
Alan Yuille
Yucheng Tang
Zongwei Zhou
VLM
MedIm
145
211
0
02 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
278
560
0
02 Jan 2023
Transformer Based Geocoding
Yuval Solaz
Vitaly Shalumov
30
0
0
02 Jan 2023
DMOps: Data Management Operation and Recipes
E. Choi
Chanjun Park
81
7
0
02 Jan 2023
Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting
Benjamin Wilson
William Qi
Tanmay Agarwal
John Lambert
Jagjeet Singh
...
Andrew Hartnett
J. K. Pontes
Deva Ramanan
Peter Carr
James Hays
3DPC
AI4TS
138
647
0
02 Jan 2023
CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation
Ge Zhang
Yizhi Li
Yaoyao Wu
Linyuan Zhang
Chenghua Lin
Jiayi Geng
Shi Wang
Jie Fu
96
14
0
01 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
Lu Yang
Wenhe Jia
Shane Li
Q. Song
ViT
143
20
0
01 Jan 2023
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu
Chenyan Jia
Ge Zhang
Ziyu Zhuang
Tony X. Liu
Soroush Vosoughi
189
36
0
01 Jan 2023
Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He
Hongming Zhang
Dan Roth
KELM
LRM
242
169
0
31 Dec 2022
Unpacking the "Black Box" of AI in Education
Nabeel Gillani
R. Eynon
Catherine Chiabaut
Kelsey Finkel
76
59
0
31 Dec 2022
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLM
AIMat
157
547
0
31 Dec 2022
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
174
53
0
31 Dec 2022
Inconsistencies in Masked Language Models
Tom Young
Yunan Chen
Yang You
74
2
0
30 Dec 2022
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports
Katharina Jeblick
B. Schachtner
Jakob Dexl
Andreas Mittermeier
Anna Theresa Stüber
...
Tobias Weber
Philipp Wesp
B. Sabel
J. Ricke
Michael Ingrisch
LM&MA
MedIm
176
403
0
30 Dec 2022
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Yufeng Zhang
Boyi Liu
Qi Cai
Lingxiao Wang
Zhaoran Wang
128
13
0
30 Dec 2022
DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading
Hasan Md Tusfiqur
D. M. Nguyen
M. T. N. Truong
T. A. Nguyen
Binh Duc Nguyen
...
H. Profitlich
Ngoc T. T. Than
Ngan Le
P. Xie
Daniel Sonntag
MedIm
83
8
0
30 Dec 2022
MAUVE Scores for Generative Models: Theory and Practice
Krishna Pillutla
Lang Liu
John Thickstun
Sean Welleck
Swabha Swayamdipta
Rowan Zellers
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
EGVM
123
23
0
30 Dec 2022
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
93
8
0
30 Dec 2022
Improving Visual Representation Learning through Perceptual Understanding
Samyakh Tukra
Frederick Hoffman
Ken Chatfield
86
5
0
30 Dec 2022
Examining Political Rhetoric with Epistemic Stance Detection
Ankita Gupta
Su Lin Blodgett
Justin H. Gross
Brendan O'Connor
56
0
0
29 Dec 2022
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
45
0
29 Dec 2022
GPT Takes the Bar Exam
M. Bommarito
Daniel Martin Katz
ELM
81
156
0
29 Dec 2022
Eliminating Meta Optimization Through Self-Referential Meta Learning
Louis Kirsch
Jürgen Schmidhuber
58
7
0
29 Dec 2022
Foreground-Background Separation through Concept Distillation from Generative Image Foundation Models
Mischa Dombrowski
Hadrien Reynaud
Matthew Baugh
Bernhard Kainz
DiffM
91
3
0
29 Dec 2022
3D Masked Modelling Advances Lesion Classification in Axial T2w Prostate MRI
Alvaro Fernandez-Quilez
C. Andersen
T. Eftestøl
S. R. Kjosavik
K. Oppedal
25
2
0
29 Dec 2022
Maximizing Use-Case Specificity through Precision Model Tuning
Pranjal Awasthi
David Recio-Mitter
Yosuke Kyle Sugi
LM&MA
27
1
0
29 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
117
91
0
28 Dec 2022
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab
Keshav Santhanam
Xiang Lisa Li
David Leo Wright Hall
Percy Liang
Christopher Potts
Matei A. Zaharia
RALM
KELM
114
269
0
28 Dec 2022
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Daniel Y. Fu
Tri Dao
Khaled Kamal Saab
A. Thomas
Atri Rudra
Christopher Ré
159
404
0
28 Dec 2022
Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain
Chengzhi Zhang
Yi Xiang
Wenke Hao
Zhicheng Li
Yuchen Qian
Yuzhuo Wang
48
11
0
28 Dec 2022
Knowledge-Guided Data-Centric AI in Healthcare: Progress, Shortcomings, and Future Directions
Edward Y. Chang
MedIm
52
7
0
27 Dec 2022
Previous
1
2
3
...
168
169
170
...
248
249
250
Next