ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXivPDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 11,513 papers shown
Title
Pchatbot: A Large-Scale Dataset for Personalized Chatbot
Pchatbot: A Large-Scale Dataset for Personalized Chatbot
Hongjin Qian
Xiaohe Li
Hanxun Zhong
Yu Guo
Yueyuan Ma
Yutao Zhu
Zhanliang Liu
Zhanliang Liu
Ji-Rong Wen
41
43
0
28 Sep 2020
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense
  Reasoning
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
Ye Liu
Yao Wan
Lifang He
Hao Peng
Philip S. Yu
32
188
0
26 Sep 2020
Machine Knowledge: Creation and Curation of Comprehensive Knowledge
  Bases
Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases
Gerhard Weikum
Luna Dong
Simon Razniewski
Fabian M. Suchanek
34
125
0
24 Sep 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language
  Models
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
46
1,132
0
24 Sep 2020
Controlling Style in Generated Dialogue
Controlling Style in Generated Dialogue
Eric Michael Smith
Diana Gonzalez-Rico
Emily Dinan
Y-Lan Boureau
AI4CE
36
50
0
22 Sep 2020
VirtualFlow: Decoupling Deep Learning Models from the Underlying
  Hardware
VirtualFlow: Decoupling Deep Learning Models from the Underlying Hardware
Andrew Or
Haoyu Zhang
M. Freedman
17
9
0
20 Sep 2020
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning
  in NLP Using Fewer Parameters & Less Data
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan Pilault
Amine Elhattami
C. Pal
CLL
MoE
30
89
0
19 Sep 2020
Pruning Neural Networks at Initialization: Why are We Missing the Mark?
Pruning Neural Networks at Initialization: Why are We Missing the Mark?
Jonathan Frankle
Gintare Karolina Dziugaite
Daniel M. Roy
Michael Carbin
14
238
0
18 Sep 2020
Generation-Augmented Retrieval for Open-domain Question Answering
Generation-Augmented Retrieval for Open-domain Question Answering
Yuning Mao
Pengcheng He
Xiaodong Liu
Yelong Shen
Jianfeng Gao
Jiawei Han
Weizhu Chen
RALM
42
238
0
17 Sep 2020
Self-Supervised Meta-Learning for Few-Shot Natural Language
  Classification Tasks
Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Tsendsuren Munkhdalai
Andrew McCallum
SSL
VLM
33
87
0
17 Sep 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
38
79
0
17 Sep 2020
Efficient Transformer-based Large Scale Language Representations using
  Hardware-friendly Block Structured Pruning
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing Li
Zhenglun Kong
Tianyun Zhang
Ji Li
Zechao Li
Hang Liu
Caiwen Ding
VLM
32
64
0
17 Sep 2020
Generative Language-Grounded Policy in Vision-and-Language Navigation
  with Bayes' Rule
Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Shuhei Kurita
Kyunghyun Cho
LM&Ro
17
23
0
16 Sep 2020
Automated Source Code Generation and Auto-completion Using Deep
  Learning: Comparing and Discussing Current Language-Model-Related Approaches
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-Benito
Sanjay Vishwakarma
Francisco Martín-Fernández
Ismael Faro Ibm Quantum
22
30
0
16 Sep 2020
Evaluating representations by the complexity of learning low-loss
  predictors
Evaluating representations by the complexity of learning low-loss predictors
William F. Whitney
M. Song
David Brandfonbrener
Jaan Altosaar
Kyunghyun Cho
25
23
0
15 Sep 2020
Augmented Natural Language for Generative Sequence Labeling
Augmented Natural Language for Generative Sequence Labeling
Ben Athiwaratkun
Cicero Nogueira dos Santos
Jason Krone
Bing Xiang
VLM
19
61
0
15 Sep 2020
Critical Thinking for Language Models
Critical Thinking for Language Models
Gregor Betz
Christian Voigt
Kyle Richardson
SyDa
ReLM
LRM
AI4CE
26
35
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
51
956
0
15 Sep 2020
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre
P. Trempe
Amal Zouaq
Sarath Chandar
25
43
0
15 Sep 2020
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffie
Alex Newhouse
20
149
0
15 Sep 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
114
1,103
0
14 Sep 2020
GeDi: Generative Discriminator Guided Sequence Generation
GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause
Akhilesh Deepak Gotmare
Bryan McCann
N. Keskar
Chenyu You
R. Socher
Nazneen Rajani
56
391
0
14 Sep 2020
The Hardware Lottery
The Hardware Lottery
Sara Hooker
27
203
0
14 Sep 2020
Abstract Neural Networks
Abstract Neural Networks
Matthew Sotoudeh
Aditya V. Thakur
8
19
0
11 Sep 2020
The Intriguing Relation Between Counterfactual Explanations and
  Adversarial Examples
The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples
Timo Freiesleben
GAN
43
62
0
11 Sep 2020
SoK: Certified Robustness for Deep Neural Networks
SoK: Certified Robustness for Deep Neural Networks
Linyi Li
Tao Xie
Bo-wen Li
AAML
33
128
0
09 Sep 2020
Adversarial Watermarking Transformer: Towards Tracing Text Provenance
  with Data Hiding
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar Abdelnabi
Mario Fritz
WaLM
28
44
0
07 Sep 2020
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of
  claims using transformer-based models
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan Williams
Paul Rodrigues
Valerie Novak
42
42
0
05 Sep 2020
Learning to summarize from human feedback
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
56
1,994
0
02 Sep 2020
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise
  Sparsity
Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity
Cong Guo
B. Hsueh
Jingwen Leng
Yuxian Qiu
Yue Guan
Zehuan Wang
Xiaoying Jia
Xipeng Li
Minyi Guo
Yuhao Zhu
35
83
0
29 Aug 2020
learn2learn: A Library for Meta-Learning Research
learn2learn: A Library for Meta-Learning Research
Sébastien M. R. Arnold
Praateek Mahajan
Debajyoti Datta
Ian Bunner
Konstantinos Saitas Zarkias
34
95
0
27 Aug 2020
A Survey of Evaluation Metrics Used for NLG Systems
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
33
230
0
27 Aug 2020
Periodic Stochastic Gradient Descent with Momentum for Decentralized
  Training
Periodic Stochastic Gradient Descent with Momentum for Decentralized Training
Hongchang Gao
Heng-Chiao Huang
23
25
0
24 Aug 2020
Tearing Down the Memory Wall
Tearing Down the Memory Wall
Zaid Qureshi
Vikram Sharma Mailthody
S. Min
I-Hsin Chung
Jinjun Xiong
Wen-mei W. Hwu
GNN
31
9
0
24 Aug 2020
Very Deep Transformers for Neural Machine Translation
Very Deep Transformers for Neural Machine Translation
Xiaodong Liu
Kevin Duh
Liyuan Liu
Jianfeng Gao
19
102
0
18 Aug 2020
PopMAG: Pop Music Accompaniment Generation
PopMAG: Pop Music Accompaniment Generation
Yi Ren
Jinzheng He
Xu Tan
Tao Qin
Zhou Zhao
Tie-Yan Liu
36
115
0
18 Aug 2020
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
Zihan Liu
Zhaojiang Lin
Pascale Fung
43
58
0
14 Aug 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Fei Wu
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
38
209
0
13 Aug 2020
Communication-Efficient and Distributed Learning Over Wireless Networks:
  Principles and Applications
Communication-Efficient and Distributed Learning Over Wireless Networks: Principles and Applications
Jihong Park
S. Samarakoon
Anis Elgabli
Joongheon Kim
M. Bennis
Seong-Lyun Kim
Mérouane Debbah
39
161
0
06 Aug 2020
Aligning AI With Shared Human Values
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
Jingkai Li
D. Song
Jacob Steinhardt
63
519
0
05 Aug 2020
Word meaning in minds and machines
Word meaning in minds and machines
Brenden M. Lake
G. Murphy
NAI
15
117
0
04 Aug 2020
PowerGossip: Practical Low-Rank Communication Compression in
  Decentralized Deep Learning
PowerGossip: Practical Low-Rank Communication Compression in Decentralized Deep Learning
Thijs Vogels
Sai Praneeth Karimireddy
Martin Jaggi
FedML
11
54
0
04 Aug 2020
Representation Learning with Video Deep InfoMax
Representation Learning with Video Deep InfoMax
R. Devon Hjelm
Philip Bachman
SSL
MDE
26
28
0
27 Jul 2020
Bollyrics: Automatic Lyrics Generator for Romanised Hindi
Bollyrics: Automatic Lyrics Generator for Romanised Hindi
N. Jain
A. Chauhan
Atharva Chewale
Ojas Mithbavkar
Ujjaval Shah
M. Singh
25
1
0
25 Jul 2020
Contextualizing Enhances Gradient Based Meta Learning
Contextualizing Enhances Gradient Based Meta Learning
Evan Vogelbaum
Rumen Dangovski
L. Jing
Marin Soljacic
34
3
0
17 Jul 2020
Automated Detection and Forecasting of COVID-19 using Deep Learning
  Techniques: A Review
Automated Detection and Forecasting of COVID-19 using Deep Learning Techniques: A Review
A. Shoeibi
Marjane Khodatars
M. Jafari
Navid Ghassemi
Delaram Sadeghi
...
Z. Sani
F. Khozeimeh
S. Nahavandi
U. Acharya
Juan M Gorriz
51
178
0
16 Jul 2020
Deep Learning in Protein Structural Modeling and Design
Deep Learning in Protein Structural Modeling and Design
Wenhao Gao
S. Mahajan
Jeremias Sulam
Jeffrey J. Gray
31
159
0
16 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive
  Representations
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
41
310
0
12 Jul 2020
Leveraging Passage Retrieval with Generative Models for Open Domain
  Question Answering
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
Gautier Izacard
Edouard Grave
RALM
67
1,111
0
02 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
64
82
0
02 Jul 2020
Previous
123...228229230231
Next