Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 11,640 papers shown
Title
Miutsu: NTU's TaskBot for the Alexa Prize
Yen-Ting Lin
Hui-Chi Kuo
Zesheng Xu
Ssu Chiu
Chieh-Chih Hung
Yi-Cheng Chen
Chao-Wei Huang
Yun-Nung Chen
25
2
0
16 May 2022
Adaptive Prompt Learning-based Few-Shot Sentiment Analysis
Pengfei Zhang
Tingting Chai
Yongdong Xu
VLM
24
13
0
15 May 2022
ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts
Sonia K. Murthy
Kyle Lo
Daniel King
Chandra Bhagavatula
Bailey Kuehl
Sophie Johnson
Jon Borchardt
Daniel S. Weld
Tom Hope
Doug Downey
40
12
0
14 May 2022
Generating Literal and Implied Subquestions to Fact-check Complex Claims
Jifan Chen
Aniruddh Sriram
Eunsol Choi
Greg Durrett
HILM
36
60
0
14 May 2022
Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Philippe Laban
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
43
5
0
13 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
59
345
0
13 May 2022
The Creativity of Text-to-Image Generation
J. Oppenlaender
30
192
0
13 May 2022
Detailed Balanced Chemical Reaction Networks as Generalized Boltzmann Machines
William G. Poole
T. Ouldridge
Manoj Gopalkrishnan
E. Winfree
15
6
0
12 May 2022
Overparameterization Improves StyleGAN Inversion
Yohan Poirier-Ginter
Alexandre Lessard
Ryan Smith
Jean-François Lalonde
48
4
0
12 May 2022
The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Zixin Wen
Yuanzhi Li
SSL
37
34
0
12 May 2022
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
119
793
0
12 May 2022
Dynamic Prefix-Tuning for Generative Template-based Event Extraction
Xiao Liu
Heyan Huang
Ge Shi
Bo Wang
31
100
0
12 May 2022
Towards Answering Open-ended Ethical Quandary Questions
Yejin Bang
Nayeon Lee
Tiezheng Yu
Leila Khalatbari
Yan Xu
...
Romain Barraud
Elham J. Barezi
Andrea Madotto
Hayden Kee
Pascale Fung
ELM
40
6
0
12 May 2022
Minimal Neural Network Models for Permutation Invariant Agents
J. Pedersen
S. Risi
51
3
0
12 May 2022
Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks
Katherine M. Collins
Catherine Wong
Jiahai Feng
Megan Wei
J. Tenenbaum
LRM
33
58
0
11 May 2022
Clinical Prompt Learning with Frozen Language Models
Niall Taylor
Yi Zhang
Dan W Joyce
A. Nevado-Holgado
Andrey Kormilitzin
VLM
LM&MA
16
31
0
11 May 2022
Reducing Activation Recomputation in Large Transformer Models
V. Korthikanti
Jared Casper
Sangkug Lym
Lawrence C. McAfee
M. Andersch
M. Shoeybi
Bryan Catanzaro
AI4CE
44
257
0
10 May 2022
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
74
298
0
10 May 2022
Extracting Latent Steering Vectors from Pretrained Language Models
Nishant Subramani
Nivedita Suresh
Matthew E. Peters
LLMSV
36
82
0
10 May 2022
A High Throughput Generative Vector Autoregression Model for Stochastic Synapses
T. Hennen
A. Elías
J. Nodin
G. Molas
R. Waser
D. J. Wouters
D. Bedau
19
4
0
10 May 2022
Learning to Answer Visual Questions from Web Videos
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
42
33
0
10 May 2022
Symphony Generation with Permutation Invariant Language Model
Jiafeng Liu
Yuanliang Dong
Zehua Cheng
Xinran Zhang
Xiaobing Li
Feng Yu
Maosong Sun
21
40
0
10 May 2022
Few-shot Mining of Naturally Occurring Inputs and Outputs
Mandar Joshi
Terra Blevins
M. Lewis
Daniel S. Weld
Luke Zettlemoyer
33
1
0
09 May 2022
ConvMAE: Masked Convolution Meets Masked Autoencoders
Peng Gao
Teli Ma
Hongsheng Li
Ziyi Lin
Jifeng Dai
Yu Qiao
ViT
19
122
0
08 May 2022
Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence
Myeongjun Jang
Frank Mtumbuka
Thomas Lukasiewicz
36
9
0
08 May 2022
Odor Descriptor Understanding through Prompting
Laura Sisson
13
1
0
07 May 2022
When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it
Sebastian Schuster
Tal Linzen
18
25
0
06 May 2022
Beyond backpropagation: bilevel optimization through implicit differentiation and equilibrium propagation
Nicolas Zucchet
João Sacramento
36
15
0
06 May 2022
KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering
Rongxiang Weng
Chengyu Wang
Minghui Qiu
Qiuhui Shi
Hongbin Wang
Jun Huang
Ming Gao
RALM
39
16
0
06 May 2022
Explaining the Effectiveness of Multi-Task Learning for Efficient Knowledge Extraction from Spine MRI Reports
Arijit Sehanobish
M. Sandora
Nabila Abraham
Jayashri Pawar
Danielle Torres
Anasuya Das
M. Becker
Richard Herzog
Benjamin Odry
Ron Vianu
26
3
0
06 May 2022
Communication-Efficient Adaptive Federated Learning
Yujia Wang
Lu Lin
Jinghui Chen
FedML
27
71
0
05 May 2022
Language Models Can See: Plugging Visual Controls in Text Generation
Yixuan Su
Tian Lan
Yahui Liu
Fangyu Liu
Dani Yogatama
Yan Wang
Lingpeng Kong
Nigel Collier
VLM
MLLM
62
97
0
05 May 2022
Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021
Julia Kiseleva
Ziming Li
Mohammad Aliannejadi
Shrestha Mohanty
Maartje ter Hoeve
...
I. Churin
Putra Manggala
Kata Naszádi
Michiel van der Meer
Taewoon Kim
LLMAG
33
30
0
05 May 2022
PREME: Preference-based Meeting Exploration through an Interactive Questionnaire
Negar Arabzadeh
Ali Ahmadvand
Julia Kiseleva
Yang Liu
Ahmed Hassan Awadallah
Ming Zhong
Milad Shokouhi
28
4
0
05 May 2022
Relation Extraction as Open-book Examination: Retrieval-enhanced Prompt Tuning
Xiang Chen
Lei Li
Ningyu Zhang
Chuanqi Tan
Fei Huang
Luo Si
Huajun Chen
RALM
VLM
24
36
0
04 May 2022
Language Models in the Loop: Incorporating Prompting into Weak Supervision
Ryan Smith
Jason Alan Fries
Braden Hancock
Stephen H. Bach
58
53
0
04 May 2022
Compositional Task-Oriented Parsing as Abstractive Question Answering
Wenting Zhao
Konstantine Arkoudas
Weiqiong Sun
Claire Cardie
31
13
0
04 May 2022
A Computational Inflection for Scientific Discovery
Tom Hope
Doug Downey
Oren Etzioni
Daniel S. Weld
Eric Horvitz
AI4CE
36
32
0
04 May 2022
Sequencer: Deep LSTM for Image Classification
Yuki Tatsunami
Masato Taki
VLM
ViT
31
78
0
04 May 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLM
CLIP
OffRL
85
1,263
0
04 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters
Xue Bin Peng
Yunrong Guo
L. Halper
Sergey Levine
Sanja Fidler
30
15
0
04 May 2022
All You May Need for VQA are Image Captions
Soravit Changpinyo
Doron Kukliansky
Idan Szpektor
Xi Chen
Nan Ding
Radu Soricut
32
70
0
04 May 2022
Zero-shot Sonnet Generation with Discourse-level Planning and Aesthetics Features
Yufei Tian
Nanyun Peng
19
27
0
03 May 2022
Improving In-Context Few-Shot Learning via Self-Supervised Training
Mingda Chen
Jingfei Du
Ramakanth Pasunuru
Todor Mihaylov
Srini Iyer
Ves Stoyanov
Zornitsa Kozareva
SSL
AI4MH
42
64
0
03 May 2022
Adversarial Training for High-Stakes Reliability
Daniel M. Ziegler
Seraphina Nix
Lawrence Chan
Tim Bauman
Peter Schmidt-Nielsen
...
Noa Nabeshima
Benjamin Weinstein-Raun
D. Haas
Buck Shlegeris
Nate Thomas
AAML
38
59
0
03 May 2022
Learning to Transfer Prompts for Text Generation
Junyi Li
Tianyi Tang
J. Nie
Ji-Rong Wen
Wayne Xin Zhao
24
40
0
03 May 2022
Finding patterns in Knowledge Attribution for Transformers
Jeevesh Juneja
Ritu Agarwal
KELM
19
1
0
03 May 2022
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
127
3,527
0
02 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin Adewumi
F. Liwicki
Marcus Liwicki
34
15
0
02 May 2022
Logiformer: A Two-Branch Graph Transformer Network for Interpretable Logical Reasoning
Fangzhi Xu
Jun Liu
Qika Lin
Yudai Pan
Lingling Zhang
29
24
0
02 May 2022
Previous
1
2
3
...
201
202
203
...
231
232
233
Next