ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,355 papers shown
Title
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Shigang Li
Kazuki Osawa
Torsten Hoefler
160
32
0
14 Sep 2022
Out of One, Many: Using Language Models to Simulate Human Samples
Out of One, Many: Using Language Models to Simulate Human Samples
Lisa P. Argyle
Ethan C. Busby
Nancy Fulda
Joshua R Gubler
Christopher Rytting
David Wingate
SyDa
105
615
0
14 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Xi Chen
Tianlin Li
Soravit Changpinyo
A. Piergiovanni
Piotr Padlewski
...
Andreas Steiner
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
MLLMVLM
205
741
0
14 Sep 2022
vec2text with Round-Trip Translations
vec2text with Round-Trip Translations
Geoffrey Cideron
Sertan Girgin
Anton Raichuk
Olivier Pietquin
Olivier Bachem
Léonard Hussenot
91
3
0
14 Sep 2022
Language Chameleon: Transformation analysis between languages using
  Cross-lingual Post-training based on Pre-trained language models
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models
Suhyune Son
Chanjun Park
Jungseob Lee
Midan Shim
Chanhee Lee
Yoonna Jang
Jaehyung Seo
Heu-Jeoung Lim
71
0
0
14 Sep 2022
Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot
  Challenge on Conversational Task Assistance
Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance
Anna Gottardi
Osman Ipek
Giuseppe Castellucci
Shui Hu
Lavina Vaz
...
Oleg Rokhlenko
Kate Bland
Eugene Agichtein
R. Ghanadan
Y. Maarek
84
23
0
13 Sep 2022
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks
  from The New Yorker Caption Contest
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Jack Hessel
Ana Marasović
Jena D. Hwang
Lillian Lee
Jeff Da
Rowan Zellers
Robert Mankoff
Yejin Choi
VLM
112
91
0
13 Sep 2022
Improving Language Model Prompting in Support of Semi-autonomous Task
  Learning
Improving Language Model Prompting in Support of Semi-autonomous Task Learning
James R. Kirk
R. Wray
Peter Lindes
John E. Laird
LRM
64
11
0
13 Sep 2022
Revisiting Neural Scaling Laws in Language and Vision
Revisiting Neural Scaling Laws in Language and Vision
Ibrahim Alabdulmohsin
Behnam Neyshabur
Xiaohua Zhai
235
111
0
13 Sep 2022
Vision Transformers for Action Recognition: A Survey
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Mian
ViT
82
45
0
13 Sep 2022
Rule-adhering synthetic data -- the lingua franca of learning
Rule-adhering synthetic data -- the lingua franca of learning
Michaela D. Platzer
Ivona Krchova
119
2
0
12 Sep 2022
An Embedding-Based Grocery Search Model at Instacart
An Embedding-Based Grocery Search Model at Instacart
Yuqing Xie
Taesik Na
X. Xiao
Saurav Manchanda
Young Rao
Zhihong Xu
Guanghua Shu
Esther Vasiete
Tejaswi Tenneti
Haixun Wang
DMLRALM
56
6
0
12 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
286
501
0
12 Sep 2022
FP8 Formats for Deep Learning
FP8 Formats for Deep Learning
Paulius Micikevicius
Dusan Stosic
N. Burgess
Marius Cornea
Pradeep Dubey
...
Naveen Mellempudi
S. Oberman
Mohammad Shoeybi
Michael Siu
Hao Wu
BDLVLMMQ
156
141
0
12 Sep 2022
Factual and Informative Review Generation for Explainable Recommendation
Factual and Informative Review Generation for Explainable Recommendation
Zhouhang Xie
Sameer Singh
Julian McAuley
Bodhisattwa Prasad Majumder
104
26
0
12 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review,
  Evaluation and Recipe
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
172
150
0
12 Sep 2022
GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots
GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots
Gilbert Feng
Hongbo Zhang
Zhongyu Li
Xue Bin Peng
Bhuvan Basireddy
...
Zhitao Song
Lizhi Yang
Yunhui Liu
Koushil Sreenath
Sergey Levine
156
67
0
12 Sep 2022
Open-Domain Dialog Evaluation using Follow-Ups Likelihood
Open-Domain Dialog Evaluation using Follow-Ups Likelihood
Maxime De Bruyn
Ehsan Lotfi
Jeska Buhmann
Walter Daelemans
91
9
0
12 Sep 2022
Knowledge Base Question Answering: A Semantic Parsing Perspective
Knowledge Base Question Answering: A Semantic Parsing Perspective
Yu Gu
Vardaan Pahuja
Gong Cheng
Yu-Chuan Su
120
29
0
12 Sep 2022
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via
  Attention-based Network in Face Recognition
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition
Thanh-Dat Truong
C. Duong
Ngan Le
Marios Savvides
Khoa Luu
CVBM
103
9
0
11 Sep 2022
On The Computational Complexity of Self-Attention
On The Computational Complexity of Self-Attention
Feyza Duman Keles
Pruthuvi Maheshakya Wijewardena
Chinmay Hegde
142
130
0
11 Sep 2022
Structured Q-learning For Antibody Design
Structured Q-learning For Antibody Design
Alexander I. Cowen-Rivers
P. Gorinski
Aivar Sootla
Asif R. Khan
Liu Furui
Jun Wang
Jan Peters
H. Ammar
OffRLOnRL
89
3
0
10 Sep 2022
Improved Masked Image Generation with Token-Critic
Improved Masked Image Generation with Token-Critic
José Lezama
Huiwen Chang
Lu Jiang
Irfan Essa
DiffM
248
48
0
09 Sep 2022
Automatic Readability Assessment of German Sentences with Transformer
  Ensembles
Automatic Readability Assessment of German Sentences with Transformer Ensembles
Patrick Gustav Blaneck
Tobias Bornheim
Niklas Grieger
Stephan Bialonski
74
10
0
09 Sep 2022
Fast Neural Kernel Embeddings for General Activations
Fast Neural Kernel Embeddings for General Activations
Insu Han
A. Zandieh
Jaehoon Lee
Roman Novak
Lechao Xiao
Amin Karbasi
120
19
0
09 Sep 2022
Vision for Bosnia and Herzegovina in Artificial Intelligence Age: Global
  Trends, Potential Opportunities, Selected Use-cases and Realistic Goals
Vision for Bosnia and Herzegovina in Artificial Intelligence Age: Global Trends, Potential Opportunities, Selected Use-cases and Realistic Goals
Zlatan Ajanović
E. Alickovic
Aida Brankovic
Sead Delalic
Eldar Kurtic
S. Malikić
Adnan Mehonic
Hamza Merzic
Kenan Sehic
Bahrudin Trbalic
88
0
0
08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Rohan Taori
Tatsunori B. Hashimoto
124
48
0
08 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task
  Applications
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
90
20
0
08 Sep 2022
What does a platypus look like? Generating customized prompts for
  zero-shot image classification
What does a platypus look like? Generating customized prompts for zero-shot image classification
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
189
224
0
07 Sep 2022
AutoPruner: Transformer-Based Call Graph Pruning
AutoPruner: Transformer-Based Call Graph Pruning
Thanh Le-Cong
Hong Jin Kang
Truong-Giang Nguyen
S. A. Haryono
David Lo
X. Le
H. Thang
66
20
0
07 Sep 2022
On the Effectiveness of Compact Biomedical Transformers
On the Effectiveness of Compact Biomedical Transformers
Omid Rohanian
Mohammadmahdi Nouriborji
Samaneh Kouchaki
David Clifton
MedIm
87
31
0
07 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation
AudioLM: a Language Modeling Approach to Audio Generation
Zalan Borsos
Raphaël Marinier
Damien Vincent
Eugene Kharitonov
Olivier Pietquin
...
Dominik Roblek
O. Teboul
David Grangier
Marco Tagliasacchi
Neil Zeghidour
AuLLM
163
616
0
07 Sep 2022
SynSciPass: detecting appropriate uses of scientific text generation
SynSciPass: detecting appropriate uses of scientific text generation
Domenic Rosati
DeLMO
126
19
0
07 Sep 2022
Studying Bias in GANs through the Lens of Race
Studying Bias in GANs through the Lens of Race
V. Maluleke
Neerja Thakkar
Tim Brooks
Ethan Weber
Trevor Darrell
Alexei A. Efros
Angjoo Kanazawa
Devin Guillory
107
36
0
06 Sep 2022
Explaining Machine Learning Models in Natural Conversations: Towards a
  Conversational XAI Agent
Explaining Machine Learning Models in Natural Conversations: Towards a Conversational XAI Agent
Van Bach Nguyen
Jorg Schlotterer
C. Seifert
AILaw
38
12
0
06 Sep 2022
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer
  Models
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models
Jiangsu Du
Ziming Liu
Jiarui Fang
Shenggui Li
Yongbin Li
Yutong Lu
Yang You
MoE
52
4
0
06 Sep 2022
Few-Shot Document-Level Event Argument Extraction
Few-Shot Document-Level Event Argument Extraction
Xianjun Yang
Yujie Lu
Linda R. Petzold
69
16
0
06 Sep 2022
Evaluating the Susceptibility of Pre-Trained Language Models via
  Handcrafted Adversarial Examples
Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples
Hezekiah J. Branch
Jonathan Rodriguez Cefalu
Jeremy McHugh
Leyla Hujer
Aditya Bahl
Daniel del Castillo Iglesias
Ron Heichman
Ramesh Darwishi
ELMSILMAAML
70
56
0
05 Sep 2022
Selective Annotation Makes Language Models Better Few-Shot Learners
Selective Annotation Makes Language Models Better Few-Shot Learners
Hongjin Su
Jungo Kasai
Chen Henry Wu
Weijia Shi
Tianlu Wang
...
Rui Zhang
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
118
262
0
05 Sep 2022
A Review of Sparse Expert Models in Deep Learning
A Review of Sparse Expert Models in Deep Learning
W. Fedus
J. Dean
Barret Zoph
MoE
129
154
0
04 Sep 2022
The Effectiveness of Bidirectional Generative Patent Language Models
The Effectiveness of Bidirectional Generative Patent Language Models
Jieh-Sheng Lee
48
1
0
04 Sep 2022
Do Large Language Models know what humans know?
Do Large Language Models know what humans know?
Sean Trott
Cameron J. Jones
Tyler A. Chang
J. Michaelov
Benjamin Bergen
74
97
0
04 Sep 2022
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot
  Learning for Human-AI Interaction in Creative Applications of Generative
  Models
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot Learning for Human-AI Interaction in Creative Applications of Generative Models
Hai Dang
Lukas Mecke
Florian Lehmann
Sven Goller
Daniel Buschek
74
102
0
03 Sep 2022
HammingMesh: A Network Topology for Large-Scale Deep Learning
HammingMesh: A Network Topology for Large-Scale Deep Learning
Torsten Hoefler
Tommaso Bonato
Daniele De Sensi
Salvatore Di Girolamo
Shigang Li
Marco Heddes
Jon Belk
Deepak Goel
Miguel Castro
Steve Scott
3DHGNNAI4CE
79
23
0
03 Sep 2022
TransPolymer: a Transformer-based language model for polymer property
  predictions
TransPolymer: a Transformer-based language model for polymer property predictions
Changwen Xu
Yuyang Wang
A. Farimani
105
93
0
03 Sep 2022
Petals: Collaborative Inference and Fine-tuning of Large Models
Petals: Collaborative Inference and Fine-tuning of Large Models
Alexander Borzunov
Dmitry Baranchuk
Tim Dettmers
Max Ryabinin
Younes Belkada
Artem Chumachenko
Pavel Samygin
Colin Raffel
VLM
116
67
0
02 Sep 2022
INTERACTION: A Generative XAI Framework for Natural Language Inference
  Explanations
INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations
Jialin Yu
Alexandra I. Cristea
Anoushka Harit
Zhongtian Sun
O. Aduragba
Lei Shi
Noura Al Moubayed
74
10
0
02 Sep 2022
Multi-Modal Experience Inspired AI Creation
Multi-Modal Experience Inspired AI Creation
Qian Cao
Xu Chen
Ruihua Song
Hao Jiang
Guangyan Yang
Bo Zhao
68
3
0
02 Sep 2022
IMG2IMU: Translating Knowledge from Large-Scale Images to IMU Sensing
  Applications
IMG2IMU: Translating Knowledge from Large-Scale Images to IMU Sensing Applications
Hyungjun Yoon
Hyeong-Tae Cha
Hoang C. Nguyen
Taesik Gong
Sungyeop Lee
VLMSSL
106
1
0
02 Sep 2022
Previous
123...187188189...246247248
Next