ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXivPDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 11,078 papers shown
Title
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
Kunhao Zheng
Jesse Michael Han
Stanislas Polu
AIMat
35
152
0
31 Aug 2021
Quantized Convolutional Neural Networks Through the Lens of Partial
  Differential Equations
Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations
Ido Ben-Yair
Gil Ben Shalom
Moshe Eliasof
Eran Treister
MQ
33
5
0
31 Aug 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives
It's not Rocket Science : Interpreting Figurative Language in Narratives
Tuhin Chakrabarty
Yejin Choi
Vered Shwartz
22
55
0
31 Aug 2021
Sentence Bottleneck Autoencoders from Transformer Language Models
Sentence Bottleneck Autoencoders from Transformer Language Models
Ivan Montero
Nikolaos Pappas
Noah A. Smith
AI4CE
22
28
0
31 Aug 2021
APS: Active Pretraining with Successor Features
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
47
119
0
31 Aug 2021
LightNER: A Lightweight Tuning Paradigm for Low-resource NER via
  Pluggable Prompting
LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting
Xiang Chen
Lei Li
Shumin Deng
Chuanqi Tan
Changliang Xu
Fei Huang
Luo Si
Huajun Chen
Ningyu Zhang
VLM
36
65
0
31 Aug 2021
The five Is: Key principles for interpretable and safe conversational AI
The five Is: Key principles for interpretable and safe conversational AI
M. Wahde
M. Virgolin
36
5
0
31 Aug 2021
Discretized Integrated Gradients for Explaining Language Models
Discretized Integrated Gradients for Explaining Language Models
Soumya Sanyal
Xiang Ren
FAtt
17
53
0
31 Aug 2021
Towards Out-Of-Distribution Generalization: A Survey
Towards Out-Of-Distribution Generalization: A Survey
Jiashuo Liu
Zheyan Shen
Yue He
Xingxuan Zhang
Renzhe Xu
Han Yu
Peng Cui
CML
OOD
55
517
0
31 Aug 2021
How Does Adversarial Fine-Tuning Benefit BERT?
How Does Adversarial Fine-Tuning Benefit BERT?
J. Ebrahimi
Hao Yang
Wei Zhang
AAML
26
4
0
31 Aug 2021
Semi-Supervised Exaggeration Detection of Health Science Press Releases
Semi-Supervised Exaggeration Detection of Health Science Press Releases
Dustin Wright
Isabelle Augenstein
33
12
0
30 Aug 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Bo-wen Li
Xinyang Jiang
Donglin Bai
Yuge Zhang
Ningxin Zheng
Xuanyi Dong
Lu Liu
Yuqing Yang
Dongsheng Li
14
10
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text
  Understanding and Generation
LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation
Jian Guan
Zhuoer Feng
Yamei Chen
Ru He
Xiaoxi Mao
Changjie Fan
Minlie Huang
39
32
0
30 Aug 2021
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural
  Language Understanding
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding
Guoqing Zheng
Giannis Karamanolakis
Kai Shu
Ahmed Hassan Awadallah
SSL
21
1
0
28 Aug 2021
Layer-wise Model Pruning based on Mutual Information
Layer-wise Model Pruning based on Mutual Information
Chun Fan
Jiwei Li
Xiang Ao
Fei Wu
Yuxian Meng
Xiaofei Sun
46
19
0
28 Aug 2021
Self-training Improves Pre-training for Few-shot Learning in
  Task-oriented Dialog Systems
Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi
Wanhao Zhou
Feng Cai
Lingjing Kong
Minlie Huang
Boi Faltings
27
32
0
28 Aug 2021
LocTex: Learning Data-Efficient Visual Representations from Localized
  Textual Supervision
LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision
Zhijian Liu
Simon Stent
Jie Li
John Gideon
Song Han
VLM
25
10
0
26 Aug 2021
A Survey on Automated Fact-Checking
A Survey on Automated Fact-Checking
Zhijiang Guo
M. Schlichtkrull
Andreas Vlachos
27
459
0
26 Aug 2021
Just Say No: Analyzing the Stance of Neural Dialogue Generation in
  Offensive Contexts
Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts
Ashutosh Baheti
Maarten Sap
Alan Ritter
Mark O. Riedl
21
84
0
26 Aug 2021
Bilateral Denoising Diffusion Models
Bilateral Denoising Diffusion Models
Max W. Y. Lam
Jun Wang
Rongjie Huang
Dan Su
Dong Yu
DiffM
27
42
0
26 Aug 2021
Few-shot Visual Relationship Co-localization
Few-shot Visual Relationship Co-localization
Revant Teotia
Vaibhav Mishra
Mayank Maheshwari
Anand Mishra
20
1
0
26 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
780
0
24 Aug 2021
Greenformers: Improving Computation and Memory Efficiency in Transformer
  Models via Low-Rank Approximation
Greenformers: Improving Computation and Memory Efficiency in Transformer Models via Low-Rank Approximation
Samuel Cahyawijaya
26
12
0
24 Aug 2021
Prompt-Learning for Fine-Grained Entity Typing
Prompt-Learning for Fine-Grained Entity Typing
Ning Ding
Yulin Chen
Xu Han
Guangwei Xu
Pengjun Xie
Haitao Zheng
Zhiyuan Liu
Juan-Zi Li
Hong-Gee Kim
26
156
0
24 Aug 2021
Explaining Bayesian Neural Networks
Explaining Bayesian Neural Networks
Kirill Bykov
Marina M.-C. Höhne
Adelaida Creosteanu
Klaus-Robert Muller
Frederick Klauschen
Shinichi Nakajima
Marius Kloft
BDL
AAML
34
25
0
23 Aug 2021
Decomposition Multi-Objective Evolutionary Optimization: From
  State-of-the-Art to Future Opportunities
Decomposition Multi-Objective Evolutionary Optimization: From State-of-the-Art to Future Opportunities
Ke Li
19
9
0
21 Aug 2021
Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code
  Contributions
Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions
Hammond Pearce
Baleegh Ahmad
Benjamin Tan
Brendan Dolan-Gavitt
Ramesh Karri
SILM
34
392
0
20 Aug 2021
SplitGuard: Detecting and Mitigating Training-Hijacking Attacks in Split
  Learning
SplitGuard: Detecting and Mitigating Training-Hijacking Attacks in Split Learning
Ege Erdogan
Alptekin Kupcu
A. E. Cicek
AAML
22
32
0
20 Aug 2021
Knowledge Perceived Multi-modal Pretraining in E-commerce
Knowledge Perceived Multi-modal Pretraining in E-commerce
Yushan Zhu
Huaixiao Tou
Wen Zhang
Ganqiang Ye
Hui Chen
Ningyu Zhang
Huajun Chen
28
32
0
20 Aug 2021
UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label
  Inference Attacks Against Split Learning
UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label Inference Attacks Against Split Learning
Ege Erdogan
Alptekin Kupcu
A. E. Cicek
FedML
MIACV
35
77
0
20 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for
  Autoregressive Image Synthesis
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
38
156
0
19 Aug 2021
Neural Operator: Learning Maps Between Function Spaces
Neural Operator: Learning Maps Between Function Spaces
Nikola B. Kovachki
Zong-Yi Li
Burigede Liu
Kamyar Azizzadenesheli
K. Bhattacharya
Andrew M. Stuart
Anima Anandkumar
AI4CE
52
440
0
19 Aug 2021
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive
  Machine Translation
MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation
Pan Xie
Zexian Li
Xiaohui Hu
34
11
0
19 Aug 2021
Moser Flow: Divergence-based Generative Modeling on Manifolds
Moser Flow: Divergence-based Generative Modeling on Manifolds
N. Rozen
Aditya Grover
Maximilian Nickel
Y. Lipman
DRL
AI4CE
27
57
0
18 Aug 2021
Data Pricing in Machine Learning Pipelines
Data Pricing in Machine Learning Pipelines
Zicun Cong
Xuan Luo
J. Pei
Feida Zhu
Yong Zhang
28
46
0
18 Aug 2021
Toward a `Standard Model' of Machine Learning
Toward a `Standard Model' of Machine Learning
Zhiting Hu
Eric P. Xing
37
12
0
17 Aug 2021
Continual Backprop: Stochastic Gradient Descent with Persistent
  Randomness
Continual Backprop: Stochastic Gradient Descent with Persistent Randomness
Shibhansh Dohare
R. Sutton
A. R. Mahmood
CLL
47
80
0
13 Aug 2021
Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual
  Representations
Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations
Josh Beal
Hao Wu
Dong Huk Park
Andrew Zhai
Dmitry Kislyuk
ViT
21
29
0
12 Aug 2021
PatrickStar: Parallel Training of Pre-trained Models via Chunk-based
  Memory Management
PatrickStar: Parallel Training of Pre-trained Models via Chunk-based Memory Management
Jiarui Fang
Zilin Zhu
Shenggui Li
Hui Su
Yang Yu
Jie Zhou
Yang You
VLM
37
24
0
12 Aug 2021
Reimagining an autonomous vehicle
Reimagining an autonomous vehicle
Jeffrey Hawke
E. Haibo
Vijay Badrinarayanan
Alex Kendall
43
11
0
12 Aug 2021
Video Transformer for Deepfake Detection with Incremental Learning
Video Transformer for Deepfake Detection with Incremental Learning
Sohail Ahmed Khan
Hang Dai
ViT
24
62
0
11 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
45
53
0
10 Aug 2021
Privacy-Preserving Machine Learning: Methods, Challenges and Directions
Privacy-Preserving Machine Learning: Methods, Challenges and Directions
Runhua Xu
Nathalie Baracaldo
J. Joshi
32
100
0
10 Aug 2021
Noisy Channel Language Model Prompting for Few-Shot Text Classification
Noisy Channel Language Model Prompting for Few-Shot Text Classification
Sewon Min
Michael Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
VLM
40
218
0
09 Aug 2021
Token Shift Transformer for Video Classification
Token Shift Transformer for Video Classification
Hao Zhang
Y. Hao
Chong-Wah Ngo
ViT
29
116
0
05 Aug 2021
Deep multi-task mining Calabi-Yau four-folds
Deep multi-task mining Calabi-Yau four-folds
Harold Erbin
Riccardo Finotello
Robin Schneider
M. Tamaazousti
35
17
0
04 Aug 2021
How to Query Language Models?
How to Query Language Models?
Leonard Adolphs
S. Dhuliawala
Thomas Hofmann
KELM
24
15
0
04 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple
  Constraints
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
43
76
0
04 Aug 2021
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain
  Management
Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management
Cécile Logé
Emily L. Ross
D. Dadey
Saahil Jain
A. Saporta
A. Ng
Pranav Rajpurkar
24
22
0
03 Aug 2021
Previous
123...209210211...220221222
Next