ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXivPDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 10,752 papers shown
Title
I-BERT: Integer-only BERT Quantization
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
105
341
0
05 Jan 2021
Transformers in Vision: A Survey
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
227
2,431
0
04 Jan 2021
Retrieving and Reading: A Comprehensive Survey on Open-domain Question
  Answering
Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering
Fengbin Zhu
Wenqiang Lei
Chao Wang
Jianming Zheng
Soujanya Poria
Tat-Seng Chua
RALM
213
252
0
04 Jan 2021
Few-Shot Question Answering by Pretraining Span Selection
Few-Shot Question Answering by Pretraining Span Selection
Ori Ram
Yuval Kirstain
Jonathan Berant
Amir Globerson
Omer Levy
30
97
0
02 Jan 2021
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
Wangchunshu Zhou
Tao Ge
Canwen Xu
Ke Xu
Furu Wei
LRM
16
15
0
02 Jan 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
20
4,088
0
01 Jan 2021
WARP: Word-level Adversarial ReProgramming
WARP: Word-level Adversarial ReProgramming
Karen Hambardzumyan
Hrant Khachatrian
Jonathan May
AAML
254
342
0
01 Jan 2021
Studying Strategically: Learning to Mask for Closed-book QA
Studying Strategically: Learning to Mask for Closed-book QA
Qinyuan Ye
Belinda Z. Li
Sinong Wang
Benjamin Bolte
Hao Ma
Wen-tau Yih
Xiang Ren
Madian Khabsa
OffRL
21
11
0
31 Dec 2020
Shortformer: Better Language Modeling using Shorter Inputs
Shortformer: Better Language Modeling using Shorter Inputs
Ofir Press
Noah A. Smith
M. Lewis
230
89
0
31 Dec 2020
Making Pre-trained Language Models Better Few-shot Learners
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
243
1,924
0
31 Dec 2020
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots
  Matters
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters
Mengjie Zhao
Yi Zhu
Ehsan Shareghi
Ivan Vulić
Roi Reichart
Anna Korhonen
Hinrich Schütze
32
64
0
31 Dec 2020
AraGPT2: Pre-Trained Transformer for Arabic Language Generation
AraGPT2: Pre-Trained Transformer for Arabic Language Generation
Wissam Antoun
Fady Baly
Hazem M. Hajj
VLM
19
103
0
31 Dec 2020
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Ahmad Rashid
Vasileios Lioutas
Abbas Ghaddar
Mehdi Rezagholizadeh
21
27
0
31 Dec 2020
Reservoir Transformers
Reservoir Transformers
Sheng Shen
Alexei Baevski
Ari S. Morcos
Kurt Keutzer
Michael Auli
Douwe Kiela
35
17
0
30 Dec 2020
Transformer Feed-Forward Layers Are Key-Value Memories
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva
R. Schuster
Jonathan Berant
Omer Levy
KELM
39
745
0
29 Dec 2020
A Theoretical Analysis of the Repetition Problem in Text Generation
A Theoretical Analysis of the Repetition Problem in Text Generation
Z. Fu
Wai Lam
Anthony Man-Cho So
Bei Shi
77
90
0
29 Dec 2020
POPO: Pessimistic Offline Policy Optimization
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
32
10
0
26 Dec 2020
Automated Lay Language Summarization of Biomedical Scientific Reviews
Automated Lay Language Summarization of Biomedical Scientific Reviews
Yue Guo
Weijian Qiu
Yizhong Wang
T. Cohen
35
77
0
23 Dec 2020
LieTransformer: Equivariant self-attention for Lie Groups
LieTransformer: Equivariant self-attention for Lie Groups
M. Hutchinson
Charline Le Lan
Sheheryar Zaidi
Emilien Dupont
Yee Whye Teh
Hyunjik Kim
26
111
0
20 Dec 2020
Quantum Optical Convolutional Neural Network: A Novel Image Recognition
  Framework for Quantum Computing
Quantum Optical Convolutional Neural Network: A Novel Image Recognition Framework for Quantum Computing
Rishab Parthasarathy
Rohan T. Bhowmik
11
25
0
19 Dec 2020
Exploring Fluent Query Reformulations with Text-to-Text Transformers and
  Reinforcement Learning
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
154
5
0
18 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
64
2,819
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
34
613
0
17 Dec 2020
A Generalization of Transformer Networks to Graphs
A Generalization of Transformer Networks to Graphs
Vijay Prakash Dwivedi
Xavier Bresson
AI4CE
29
720
0
17 Dec 2020
Few-shot Sequence Learning with Transformers
Few-shot Sequence Learning with Transformers
Lajanugen Logeswaran
Ann Lee
Myle Ott
Honglak Lee
MarcÁurelio Ranzato
Arthur Szlam
ViT
39
12
0
17 Dec 2020
Applying Deutsch's concept of good explanations to artificial
  intelligence and neuroscience -- an initial exploration
Applying Deutsch's concept of good explanations to artificial intelligence and neuroscience -- an initial exploration
Daniel C. Elton
20
4
0
16 Dec 2020
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene
  Contexts
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts
Ji Hou
Benjamin Graham
Matthias Nießner
Saining Xie
3DPC
56
263
0
16 Dec 2020
Responsible Disclosure of Generative Models Using Scalable
  Fingerprinting
Responsible Disclosure of Generative Models Using Scalable Fingerprinting
Ning Yu
Vladislav Skripniuk
Dingfan Chen
Larry S. Davis
Mario Fritz
WIGM
46
89
0
16 Dec 2020
Open Problems in Cooperative AI
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
37
199
0
15 Dec 2020
Attention over learned object embeddings enables complex visual
  reasoning
Attention over learned object embeddings enables complex visual reasoning
David Ding
Felix Hill
Adam Santoro
Malcolm Reynolds
M. Botvinick
OCL
22
69
0
15 Dec 2020
Nested Named Entity Recognition with Partially-Observed TreeCRFs
Nested Named Entity Recognition with Partially-Observed TreeCRFs
Yao Fu
Chuanqi Tan
Mosha Chen
Songfang Huang
Fei Huang
70
48
0
15 Dec 2020
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Jiayi Zhang
Zhi Cui
Xiaoqiang Xia
Yalong Guo
Yanran Li
Chen Wei
Jianwei Cui
20
17
0
15 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
13
385
0
14 Dec 2020
Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct
  Feedback Alignment
Hardware Beyond Backpropagation: a Photonic Co-Processor for Direct Feedback Alignment
Julien Launay
Iacopo Poli
Kilian Muller
Gustave Pariente
I. Carron
L. Daudet
Florent Krzakala
S. Gigan
MoE
15
18
0
11 Dec 2020
Towards Neural Programming Interfaces
Towards Neural Programming Interfaces
Zachary Brown
Nathaniel R. Robinson
David Wingate
Nancy Fulda
AI4CE
20
5
0
10 Dec 2020
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at
  Reliable OOD Detection
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at Reliable OOD Detection
Dennis Ulmer
Giovanni Cina
OODD
35
31
0
09 Dec 2020
Topological Planning with Transformers for Vision-and-Language
  Navigation
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Marynel Vázquez
Silvio Savarese
LM&Ro
27
99
0
09 Dec 2020
Positional Encoding as Spatial Inductive Bias in GANs
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu
Xintao Wang
Kai-xiang Chen
Bolei Zhou
Chen Change Loy
GAN
27
89
0
09 Dec 2020
NavRep: Unsupervised Representations for Reinforcement Learning of Robot
  Navigation in Dynamic Human Environments
NavRep: Unsupervised Representations for Reinforcement Learning of Robot Navigation in Dynamic Human Environments
Daniel Dugas
Juan I. Nieto
Roland Siegwart
Jen Jen Chung
SSL
24
51
0
08 Dec 2020
CTRLsum: Towards Generic Controllable Text Summarization
CTRLsum: Towards Generic Controllable Text Summarization
Junxian He
Wojciech Kry'sciñski
Bryan McCann
Nazneen Rajani
Caiming Xiong
216
138
0
08 Dec 2020
Unleashing the Tiger: Inference Attacks on Split Learning
Unleashing the Tiger: Inference Attacks on Split Learning
Dario Pasquini
G. Ateniese
M. Bernaschi
FedML
28
147
0
04 Dec 2020
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting
K. Mangalam
Yang An
Harshayu Girase
Jitendra Malik
19
264
0
02 Dec 2020
Exploiting BERT to improve aspect-based sentiment analysis performance
  on Persian language
Exploiting BERT to improve aspect-based sentiment analysis performance on Persian language
H. Jafarian
Amirhosein Taghavi
Alireza Javaheri
Reza Rawassizadeh
23
19
0
02 Dec 2020
Modifying Memories in Transformer Models
Modifying Memories in Transformer Models
Chen Zhu
A. S. Rawat
Manzil Zaheer
Srinadh Bhojanapalli
Daliang Li
Felix X. Yu
Sanjiv Kumar
KELM
23
192
0
01 Dec 2020
Data-Free Model Extraction
Data-Free Model Extraction
Jean-Baptiste Truong
Pratyush Maini
R. Walls
Nicolas Papernot
MIACV
15
181
0
30 Nov 2020
Argument from Old Man's View: Assessing Social Bias in Argumentation
Argument from Old Man's View: Assessing Social Bias in Argumentation
Maximilian Spliethover
Henning Wachsmuth
6
20
0
24 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
40
66
0
24 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them
  on Images
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDL
VLM
56
337
0
20 Nov 2020
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks
Fanchao Qi
Yangyi Chen
Mukai Li
Yuan Yao
Zhiyuan Liu
Maosong Sun
AAML
28
264
0
20 Nov 2020
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture
  using Feedback-Modulated Delay Loops
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops
Florian Stelzer
André Röhm
Raul Vicente
Ingo Fischer
University of Tartu
AI4CE
19
46
0
19 Nov 2020
Previous
123...211212213214215216
Next