Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 11,595 papers shown
Title
The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems
Caleb Ziems
Jane A. Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
30
92
0
06 Apr 2022
Inducing Positive Perspectives with Text Reframing
Caleb Ziems
Minzhi Li
Anthony Zhang
Diyi Yang
DiffM
31
36
0
06 Apr 2022
DAGAM: Data Augmentation with Generation And Modification
Byeong-Cheol Jo
Tak-Sung Heo
Yeongjoon Park
Yongmin Yoo
Won-Ik Cho
Kyungsun Kim
VLM
25
2
0
06 Apr 2022
Data-Driven Approach for Log Instruction Quality Assessment
Jasmin Bogatinovski
S. Nedelkoski
Alexander Acker
Jorge Cardoso
O. Kao
19
4
0
06 Apr 2022
Can language models learn from explanations in context?
Andrew Kyle Lampinen
Ishita Dasgupta
Stephanie C. Y. Chan
Kory Matthewson
Michael Henry Tessler
Antonia Creswell
James L. McClelland
Jane X. Wang
Felix Hill
LRM
ReLM
61
286
0
05 Apr 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
148
6,035
0
05 Apr 2022
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Gaurav Sahu
Pau Rodríguez López
I. Laradji
Parmida Atighehchian
David Vazquez
Dzmitry Bahdanau
31
61
0
05 Apr 2022
Autoregressive 3D Shape Generation via Canonical Mapping
A. Cheng
Xueting Li
Sifei Liu
Min Sun
Ming Yang
3DPC
47
39
0
05 Apr 2022
High-Quality Pluralistic Image Completion via Code Shared VQGAN
Chuanxia Zheng
Guoxian Song
Tat-Jen Cham
Jianfei Cai
Dinh Q. Phung
Linjie Luo
VLM
38
10
0
05 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
51
102
0
04 Apr 2022
On scientific understanding with artificial intelligence
Mario Krenn
R. Pollice
S. Guo
Matteo Aldeghi
Alba Cervera-Lierta
...
Florian Hase
A. Jinich
AkshatKumar Nigam
Zhenpeng Yao
Alán Aspuru-Guzik
40
186
0
04 Apr 2022
Monte Carlo Physarum Machine: Characteristics of Pattern Formation in Continuous Stochastic Transport Networks
Oskar Elek
J. Burchett
J. Prochaska
A. Forbes
8
9
0
04 Apr 2022
PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models
Rabeeh Karimi Mahabadi
Luke Zettlemoyer
James Henderson
Marzieh Saeidi
Lambert Mathias
Ves Stoyanov
Majid Yazdani
VLM
34
70
0
03 Apr 2022
Improving Vision Transformers by Revisiting High-frequency Components
Jiawang Bai
Liuliang Yuan
Shutao Xia
Shuicheng Yan
Zhifeng Li
Wen Liu
ViT
16
90
0
03 Apr 2022
Inverse is Better! Fast and Accurate Prompt for Few-shot Slot Tagging
Yutai Hou
Cheng Chen
Xianzhen Luo
Bo-wen Li
Wanxiang Che
BDL
24
21
0
02 Apr 2022
QuadraLib: A Performant Quadratic Neural Network Library for Architecture Optimization and Design Exploration
Zirui Xu
Fuxun Yu
Jinjun Xiong
Xiang Chen
36
23
0
01 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
49
574
0
01 Apr 2022
Exploring Visual Prompts for Adapting Large-Scale Models
Hyojin Bahng
Ali Jahanian
S. Sankaranarayanan
Phillip Isola
VLM
VPVLM
LRM
25
256
0
31 Mar 2022
Do Vision-Language Pretrained Models Learn Composable Primitive Concepts?
Tian Yun
Usha Bhalla
Ellie Pavlick
Chen Sun
ReLM
CoGe
VLM
LRM
31
24
0
31 Mar 2022
On the probability-quality paradox in language generation
Clara Meister
Gian Wiher
Tiago Pimentel
Ryan Cotterell
36
14
0
31 Mar 2022
PanGu-Bot: Efficient Generative Dialogue Pre-training from Pre-trained Language Model
Fei Mi
Yitong Li
Yulong Zeng
Jingyan Zhou
Yasheng Wang
Chuanfei Xu
Lifeng Shang
Xin Jiang
Shiqi Zhao
Qun Liu
ALM
45
18
0
31 Mar 2022
Generative Pre-Trained Transformers for Biologically Inspired Design
Qihao Zhu
Xinyu Zhang
Jianxi Luo
AI4CE
45
3
0
31 Mar 2022
Leveraging pre-trained language models for conversational information seeking from text
Patrizio Bellan
M. Dragoni
Chiara Ghidini
30
6
0
31 Mar 2022
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
Alan Baade
Puyuan Peng
David Harwath
25
95
0
30 Mar 2022
Scientometric Review of Artificial Intelligence for Operations & Maintenance of Wind Turbines: The Past, Present and Future
Joyjit Chatterjee
Nina Dethlefs
34
83
0
30 Mar 2022
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers
Ziyue Xiang
Paolo Bestagini
Stefano Tubaro
Edward J. Delp
36
10
0
30 Mar 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
28
26
0
30 Mar 2022
WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
Heting Gao
Junrui Ni
Kaizhi Qian
Yang Zhang
Shiyu Chang
M. Hasegawa-Johnson
VLM
22
31
0
29 Mar 2022
LinkBERT: Pretraining Language Models with Document Links
Michihiro Yasunaga
J. Leskovec
Percy Liang
KELM
29
353
0
29 Mar 2022
Evaluating Prompts Across Multiple Choice Tasks In a Zero-Shot Setting
Gabriel Orlanski
LRM
29
2
0
29 Mar 2022
Training Compute-Optimal Large Language Models
Jordan Hoffmann
Sebastian Borgeaud
A. Mensch
Elena Buchatskaya
Trevor Cai
...
Karen Simonyan
Erich Elsen
Jack W. Rae
Oriol Vinyals
Laurent Sifre
AI4TS
69
1,856
0
29 Mar 2022
mc-BEiT: Multi-choice Discretization for Image BERT Pre-training
Xiaotong Li
Yixiao Ge
Kun Yi
Zixuan Hu
Ying Shan
Ling-yu Duan
37
38
0
29 Mar 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Manuel Kolmet
Qunjie Zhou
Aljosa Osep
Laura Leal-Taixe
27
24
0
28 Mar 2022
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
Mor Geva
Avi Caciularu
Ke Wang
Yoav Goldberg
KELM
74
338
0
28 Mar 2022
Few-Shot Learning with Siamese Networks and Label Tuning
Thomas Müller
Guillermo Pérez-Torró
Marc Franco-Salvador
VLM
28
38
0
28 Mar 2022
Generative Design Ideation: A Natural Language Generation Approach
Qihao Zhu
Jianxi Luo
AI4CE
35
19
0
28 Mar 2022
ANNA: Enhanced Language Representation for Question Answering
Changwook Jun
Hansol Jang
Myoseop Sim
Hyun Kim
Jooyoung Choi
Kyungkoo Min
Kyunghoon Bae
31
6
0
28 Mar 2022
Large-scale Bilingual Language-Image Contrastive Learning
ByungSoo Ko
Geonmo Gu
VLM
32
14
0
28 Mar 2022
Diagonal State Spaces are as Effective as Structured State Spaces
Ankit Gupta
Albert Gu
Jonathan Berant
59
293
0
27 Mar 2022
A Survey on Aspect-Based Sentiment Classification
Gianni Brauwers
Flavius Frasincar
LLMAG
39
110
0
27 Mar 2022
RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
Z. Geng
Luming Liang
Tianyu Ding
Ilya Zharkov
31
69
0
27 Mar 2022
A World-Self Model Towards Understanding Intelligence
Yutao Yue
32
2
0
25 Mar 2022
Gransformer: Transformer-based Graph Generation
Ahmad Khajenezhad
Seyed Ali Osia
Mahmood Karimian
H. Beigy
27
2
0
25 Mar 2022
CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis
Erik Nijkamp
Bo Pang
Hiroaki Hayashi
Lifu Tu
Haiquan Wang
Yingbo Zhou
Silvio Savarese
Caiming Xiong
ELM
90
982
0
25 Mar 2022
Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers
A. Bucker
Luis F. C. Figueredo
Sami Haddadin
Ashish Kapoor
Shuang Ma
Rogerio Bonatti
LM&Ro
45
49
0
25 Mar 2022
Linking Emergent and Natural Languages via Corpus Transfer
Shunyu Yao
Mo Yu
Yang Zhang
Karthik Narasimhan
J. Tenenbaum
Chuang Gan
27
15
0
24 Mar 2022
Token Dropping for Efficient BERT Pretraining
Le Hou
Richard Yuanzhe Pang
Dinesh Manocha
Yuexin Wu
Xinying Song
Xiaodan Song
Denny Zhou
22
43
0
24 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
19
513
0
24 Mar 2022
Extended critical regimes of deep neural networks
Chengqing Qu
Asem Wardak
P. Gong
AI4CE
24
1
0
24 Mar 2022
Bioformers: Embedding Transformers for Ultra-Low Power sEMG-based Gesture Recognition
Luca Bompani
Francesco Bianco Morghet
Moritz Scherer
Simone Benatti
Luca Benini
Enrico Macii
M. Poncino
Daniele Jahier Pagliari
14
16
0
24 Mar 2022
Previous
1
2
3
...
203
204
205
...
230
231
232
Next