ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,256 papers shown
Title
Towards a Universal Continuous Knowledge Base
Towards a Universal Continuous Knowledge Base
Gang Chen
Maosong Sun
Yang Liu
55
3
0
25 Dec 2020
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and Perspectives
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
142
324
0
25 Dec 2020
I like fish, especially dolphins: Addressing Contradictions in Dialogue
  Modeling
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
Yixin Nie
Mary Williamson
Joey Tianyi Zhou
Douwe Kiela
Jason Weston
102
84
0
24 Dec 2020
QUACKIE: A NLP Classification Task With Ground Truth Explanations
QUACKIE: A NLP Classification Task With Ground Truth Explanations
Yves Rychener
X. Renard
Djamé Seddah
P. Frossard
Marcin Detyniecki
34
3
0
24 Dec 2020
Automated Lay Language Summarization of Biomedical Scientific Reviews
Automated Lay Language Summarization of Biomedical Scientific Reviews
Yue Guo
Weijian Qiu
Yizhong Wang
T. Cohen
71
78
0
23 Dec 2020
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
229
2,275
0
23 Dec 2020
A Maturity Assessment Framework for Conversational AI Development
  Platforms
A Maturity Assessment Framework for Conversational AI Development Platforms
Johan Aronsson
Philip Lu
Daniel Struber
Thorsten Berger
28
12
0
22 Dec 2020
Few-Shot Text Generation with Pattern-Exploiting Training
Few-Shot Text Generation with Pattern-Exploiting Training
Timo Schick
Hinrich Schütze
106
148
0
22 Dec 2020
myGym: Modular Toolkit for Visuomotor Robotic Tasks
myGym: Modular Toolkit for Visuomotor Robotic Tasks
M. Vavrecka
Nikita Sokovnin
Megi Mejdrechova
G. Sejnova
Marek Otáhal
28
6
0
21 Dec 2020
A Distributional Approach to Controlled Text Generation
A Distributional Approach to Controlled Text Generation
Muhammad Khalifa
Hady ElSahar
Marc Dymetman
167
119
0
21 Dec 2020
Building LEGO Using Deep Generative Models of Graphs
Building LEGO Using Deep Generative Models of Graphs
Rylee Thompson
Elahe Ghalebi
Terrance Devries
Graham W. Taylor
GANAI4CE
76
20
0
21 Dec 2020
Universal Policies for Software-Defined MDPs
Universal Policies for Software-Defined MDPs
Daniel Selsam
Jesse Michael Han
L. D. Moura
Patrice Godefroid
47
2
0
21 Dec 2020
Sub-Linear Memory: How to Make Performers SLiM
Sub-Linear Memory: How to Make Performers SLiM
Valerii Likhosherstov
K. Choromanski
Jared Davis
Xingyou Song
Adrian Weller
66
19
0
21 Dec 2020
Fusing CNNs and statistical indicators to improve image classification
Fusing CNNs and statistical indicators to improve image classification
Javier Huertas-Tato
Alejandro Martín
Julián Fierrez
David Camacho
66
37
0
20 Dec 2020
LieTransformer: Equivariant self-attention for Lie Groups
LieTransformer: Equivariant self-attention for Lie Groups
M. Hutchinson
Charline Le Lan
Sheheryar Zaidi
Emilien Dupont
Yee Whye Teh
Hyunjik Kim
123
111
0
20 Dec 2020
Quantum Optical Convolutional Neural Network: A Novel Image Recognition
  Framework for Quantum Computing
Quantum Optical Convolutional Neural Network: A Novel Image Recognition Framework for Quantum Computing
Rishab Parthasarathy
Rohan T. Bhowmik
41
27
0
19 Dec 2020
Toward Transformer-Based Object Detection
Toward Transformer-Based Object Detection
Josh Beal
Eric Kim
Eric Tzeng
Dong Huk Park
Andrew Zhai
Dmitry Kislyuk
ViT
97
215
0
17 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
135
3,015
0
17 Dec 2020
Autotelic Agents with Intrinsically Motivated Goal-Conditioned
  Reinforcement Learning: a Short Survey
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey
Cédric Colas
Tristan Karch
Olivier Sigaud
Pierre-Yves Oudeyer
154
95
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
82
628
0
17 Dec 2020
A Generalization of Transformer Networks to Graphs
A Generalization of Transformer Networks to Graphs
Vijay Prakash Dwivedi
Xavier Bresson
AI4CE
115
763
0
17 Dec 2020
Few-shot Sequence Learning with Transformers
Few-shot Sequence Learning with Transformers
Lajanugen Logeswaran
Ann Lee
Myle Ott
Honglak Lee
MarcÁurelio Ranzato
Arthur Szlam
ViT
64
12
0
17 Dec 2020
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene
  Contexts
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
Ji Hou
Benjamin Graham
Matthias Nießner
Saining Xie
3DPC
132
270
0
16 Dec 2020
Open Problems in Cooperative AI
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
116
203
0
15 Dec 2020
Attention over learned object embeddings enables complex visual
  reasoning
Attention over learned object embeddings enables complex visual reasoning
David Ding
Felix Hill
Adam Santoro
Malcolm Reynolds
M. Botvinick
OCL
110
71
0
15 Dec 2020
Nested Named Entity Recognition with Partially-Observed TreeCRFs
Nested Named Entity Recognition with Partially-Observed TreeCRFs
Yao Fu
Chuanqi Tan
Mosha Chen
Songfang Huang
Fei Huang
150
52
0
15 Dec 2020
Gegelati: Lightweight Artificial Intelligence through Generic and
  Evolvable Tangled Program Graphs
Gegelati: Lightweight Artificial Intelligence through Generic and Evolvable Tangled Program Graphs
K. Desnos
Nicolas Sourbier
Pierre-Yves Raumer
Olivier Gesny
Maxime Pelcat
33
12
0
15 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional
  Task
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
76
17
0
15 Dec 2020
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Jiayi Zhang
Zhi Cui
Xiaoqiang Xia
Yalong Guo
Yanran Li
Chen Wei
Jianwei Cui
71
18
0
15 Dec 2020
Extracting Training Data from Large Language Models
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
Basel Alomair
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAUSILM
548
1,963
0
14 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
92
406
0
14 Dec 2020
Communication-Efficient Federated Learning with Compensated
  Overlap-FedAvg
Communication-Efficient Federated Learning with Compensated Overlap-FedAvg
Yuhao Zhou
Qing Ye
Jiancheng Lv
FedML
61
127
0
12 Dec 2020
When is Memorization of Irrelevant Training Data Necessary for
  High-Accuracy Learning?
When is Memorization of Irrelevant Training Data Necessary for High-Accuracy Learning?
Gavin Brown
Mark Bun
Vitaly Feldman
Adam D. Smith
Kunal Talwar
314
102
0
11 Dec 2020
Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct
  Feedback Alignment
Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment
Julien Launay
Iacopo Poli
Kilian Muller
Gustave Pariente
I. Carron
L. Daudet
Florent Krzakala
S. Gigan
MoE
75
18
0
11 Dec 2020
ParsiNLU: A Suite of Language Understanding Challenges for Persian
ParsiNLU: A Suite of Language Understanding Challenges for Persian
Daniel Khashabi
Arman Cohan
Siamak Shakeri
Pedram Hosseini
Pouya Pezeshkpour
...
Niloofar Safi Samghabadi
Mahsa Shafaei
Saber Sheybani
Ali Tazarv
Yadollah Yaghoobzadeh
62
44
0
11 Dec 2020
Towards Neural Programming Interfaces
Towards Neural Programming Interfaces
Zachary Brown
Nathaniel R. Robinson
David Wingate
Nancy Fulda
AI4CE
116
5
0
10 Dec 2020
Imitating Interactive Intelligence
Imitating Interactive Intelligence
Josh Abramson
Arun Ahuja
Iain Barr
Arthur Brussee
Federico Carnevale
...
Greg Wayne
Duncan Williams
Nathaniel Wong
Chen Yan
Rui Zhu
LM&Ro
89
71
0
10 Dec 2020
Topological Planning with Transformers for Vision-and-Language
  Navigation
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Nathan Tsoi
Silvio Savarese
LM&Ro
95
101
0
09 Dec 2020
Positional Encoding as Spatial Inductive Bias in GANs
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu
Xintao Wang
Kai-xiang Chen
Bolei Zhou
Chen Change Loy
GAN
97
90
0
09 Dec 2020
On the Binding Problem in Artificial Neural Networks
On the Binding Problem in Artificial Neural Networks
Klaus Greff
Sjoerd van Steenkiste
Jürgen Schmidhuber
OCL
308
267
0
09 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot
  Navigation in Dynamic Human Environments
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
77
51
0
08 Dec 2020
CTRLsum: Towards Generic Controllable Text Summarization
CTRLsum: Towards Generic Controllable Text Summarization
Junxian He
Wojciech Kry'sciñski
Bryan McCann
Nazneen Rajani
Caiming Xiong
281
142
0
08 Dec 2020
Parallel Training of Deep Networks with Local Updates
Parallel Training of Deep Networks with Local Updates
Michael Laskin
Luke Metz
Seth Nabarrao
Mark Saroufim
Badreddine Noune
Carlo Luschi
Jascha Narain Sohl-Dickstein
Pieter Abbeel
FedML
117
27
0
07 Dec 2020
Foundations for Near-Term Quantum Natural Language Processing
Foundations for Near-Term Quantum Natural Language Processing
B. Coecke
G. Felice
K. Meichanetzidis
Alexis Toumi
119
87
0
07 Dec 2020
When Do Curricula Work?
When Do Curricula Work?
Xiaoxia Wu
Ethan Dyer
Behnam Neyshabur
92
118
0
05 Dec 2020
Unleashing the Tiger: Inference Attacks on Split Learning
Unleashing the Tiger: Inference Attacks on Split Learning
Dario Pasquini
G. Ateniese
M. Bernaschi
FedML
114
152
0
04 Dec 2020
RPT: Relational Pre-trained Transformer Is Almost All You Need towards
  Democratizing Data Preparation
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation
Nan Tang
Ju Fan
Fangyi Li
Jianhong Tu
Xiaoyong Du
Guoliang Li
Samuel Madden
M. Ouzzani
93
76
0
04 Dec 2020
Classification and reconstruction of optical quantum states with deep
  neural networks
Classification and reconstruction of optical quantum states with deep neural networks
Shahnawaz Ahmed
C. Muñoz
Franco Nori
A. F. Kockum
88
64
0
03 Dec 2020
Towards an AI assistant for power grid operators
Towards an AI assistant for power grid operators
Antoine Marot
Alexandre Rozier
Matthieu Dussartre
Laure Crochepierre
Benjamin Donnot
AI4CE
59
10
0
03 Dec 2020
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
K. Mangalam
Yang An
Harshayu Girase
Jitendra Malik
54
280
0
02 Dec 2020
Previous
123...238239240...244245246
Next