ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,288 papers shown
Title
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
FL-Tuning: Layer Tuning for Feed-Forward Network in Transformer
Jingping Liu
Yuqiu Song
Kui Xue
Hongli Sun
Chao Wang
Lihan Chen
Haiyun Jiang
Jiaqing Liang
Tong Ruan
69
2
0
30 Jun 2022
CTrGAN: Cycle Transformers GAN for Gait Transfer
CTrGAN: Cycle Transformers GAN for Gait Transfer
Shahar Mahpod
Noam Gaash
Hay Hoffman
Gil Ben-Artzi
ViT
69
1
0
30 Jun 2022
esCorpius: A Massive Spanish Crawling Corpus
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño
David Pérez-Fernández
Jordi Armengol-Estapé
D. Griol
Z. Callejas
99
2
0
30 Jun 2022
BigBIO: A Framework for Data-Centric Biomedical Natural Language
  Processing
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Jason Alan Fries
Leon Weber
Natasha Seelam
Gabriel Altay
Debajyoti Datta
...
Minh Chien Vu
Trishala Neeraj
Jonas Golde
Albert Villanova del Moral
Benjamin Beilharz
LM&MA
151
49
0
30 Jun 2022
Language Model-Based Emotion Prediction Methods for Emotional Speech
  Synthesis Systems
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Hyun-Wook Yoon
Ohsung Kwon
Hoyeon Lee
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
Min-Jae Hwang
128
15
0
30 Jun 2022
A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Yangyang Guo
Liqiang Nie
Yongkang Wong
Yebin Liu
Zhiyong Cheng
Mohan S. Kankanhalli
119
40
0
30 Jun 2022
GPTs at Factify 2022: Prompt Aided Fact-Verification
GPTs at Factify 2022: Prompt Aided Fact-Verification
Pawan Kumar Sahu
Saksham Aggarwal
Taneesh Gupta
Gyanendra Das
67
1
0
29 Jun 2022
Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Aitor Lewkowycz
Anders Andreassen
David Dohan
Ethan Dyer
Henryk Michalewski
...
Theo Gutman-Solo
Yuhuai Wu
Behnam Neyshabur
Guy Gur-Ari
Vedant Misra
ReLMELMLRM
216
863
0
29 Jun 2022
SoK: Content Moderation in Social Media, from Guidelines to Enforcement,
  and Research to Practice
SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice
Mohit Singhal
Chen Ling
Pujan Paudel
Poojitha Thota
Nihal Kumarswamy
Gianluca Stringhini
Shirin Nilizadeh
156
33
0
29 Jun 2022
On the Robustness of Dialogue History Representation in Conversational
  Question Answering: A Comprehensive Study and a New Prompt-based Method
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method
Zorik Gekhman
Nadav Oved
Orgad Keller
Idan Szpektor
Roi Reichart
71
8
0
29 Jun 2022
TweetNLP: Cutting-Edge Natural Language Processing for Social Media
TweetNLP: Cutting-Edge Natural Language Processing for Social Media
Jose Camacho-Collados
Kiamehr Rezaee
Talayeh Riahi
Asahi Ushio
Daniel Loureiro
...
Eugenio Martínez-Cámara
Gonzalo Medina
T. Buhrmann
Leonardo Neves
Francesco Barbieri
VLMAI4MH
102
144
0
29 Jun 2022
Extreme compression of sentence-transformer ranker models: faster
  inference, longer battery life, and less storage on edge devices
Extreme compression of sentence-transformer ranker models: faster inference, longer battery life, and less storage on edge devices
Amit Chaulwar
Lukas Malik
Maciej Krajewski
Felix Reichel
Leif-Nissen Lundbæk
M. Huth
B. Matejczyk
VLM
32
3
0
29 Jun 2022
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
Test2Vec: An Execution Trace Embedding for Test Case Prioritization
E. Jabbar
Soheila Zangeneh
Hadi Hemmati
R. Feldt
87
5
0
28 Jun 2022
Continual Learning with Transformers for Image Classification
Continual Learning with Transformers for Image Classification
Beyza Ermis
Giovanni Zappella
Martin Wistuba
Aditya Rawal
Cédric Archambeau
CLL
83
22
0
28 Jun 2022
Towards Lexical Gender Inference: A Scalable Methodology using Online
  Databases
Towards Lexical Gender Inference: A Scalable Methodology using Online Databases
Marion Bartl
Susan Leavy
41
1
0
28 Jun 2022
Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible
  Products Prediction
Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction
Rongzhi Zhang
Rebecca West
Xiquan Cui
Chao Zhang
98
6
0
28 Jun 2022
Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation
  and Instance Generation
Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation
Jiaxin Huang
Yu Meng
Jiawei Han
78
20
0
28 Jun 2022
Studying Generalization Through Data Averaging
Studying Generalization Through Data Averaging
C. Gomez-Uribe
FedML
133
0
0
28 Jun 2022
Materials Transformers Language Models for Generative Materials Design:
  a benchmark study
Materials Transformers Language Models for Generative Materials Design: a benchmark study
Nihang Fu
Lai Wei
Yuqi Song
Qinyang Li
Rui Xin
Sadman Sadeed Omee
Rongzhi Dong
Edirisuriya M Dilanga Siriwardane
Jianjun Hu
50
2
0
27 Jun 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
Junting Pan
Ziyi Lin
Xiatian Zhu
Jing Shao
Hongsheng Li
96
206
0
27 Jun 2022
Prompting Decision Transformer for Few-Shot Policy Generalization
Prompting Decision Transformer for Few-Shot Policy Generalization
Mengdi Xu
Songlin Yang
Shun Zhang
Yuchen Lu
Ding Zhao
J. Tenenbaum
Chuang Gan
OffRL
88
150
0
27 Jun 2022
ProGen2: Exploring the Boundaries of Protein Language Models
ProGen2: Exploring the Boundaries of Protein Language Models
Erik Nijkamp
Jeffrey A. Ruffolo
Eli N. Weinstein
Nikhil Naik
Ali Madani
AI4TS
76
314
0
27 Jun 2022
Endowing Language Models with Multimodal Knowledge Graph Representations
Endowing Language Models with Multimodal Knowledge Graph Representations
Ningyuan Huang
Y. Deshpande
Yibo Liu
Houda Alberts
Kyunghyun Cho
Clara Vania
Iacer Calixto
VLM
72
15
0
27 Jun 2022
Leveraging Language for Accelerated Learning of Tool Manipulation
Leveraging Language for Accelerated Learning of Tool Manipulation
Allen Z. Ren
Bharat Govil
Tsung-Yen Yang
Karthik Narasimhan
Anirudha Majumdar
LM&Ro
85
36
0
27 Jun 2022
Long Range Language Modeling via Gated State Spaces
Long Range Language Modeling via Gated State Spaces
Harsh Mehta
Ankit Gupta
Ashok Cutkosky
Behnam Neyshabur
Mamba
138
243
0
27 Jun 2022
Repository-Level Prompt Generation for Large Language Models of Code
Repository-Level Prompt Generation for Large Language Models of Code
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
103
143
0
26 Jun 2022
Adversarial Self-Attention for Language Understanding
Adversarial Self-Attention for Language Understanding
Hongqiu Wu
Ruixue Ding
Hai Zhao
Pengjun Xie
Fei Huang
Min Zhang
81
12
0
25 Jun 2022
MVP: Multi-task Supervised Pre-training for Natural Language Generation
MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
120
24
0
24 Jun 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
100
22
0
24 Jun 2022
A Disability Lens towards Biases in GPT-3 Generated Open-Ended Languages
A Disability Lens towards Biases in GPT-3 Generated Open-Ended Languages
Akhter Al Amin
Kazi Sinthia Kabir
84
5
0
23 Jun 2022
Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic
  Graphs
Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
Yi-Lun Liao
Tess E. Smidt
190
246
0
23 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
188
120
0
23 Jun 2022
Sample Condensation in Online Continual Learning
Sample Condensation in Online Continual Learning
Mattia Sangermano
Antonio Carta
Andrea Cossu
D. Bacciu
DD
81
44
0
23 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online
  Videos
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
153
304
0
23 Jun 2022
AST-Probe: Recovering abstract syntax trees from hidden representations
  of pre-trained language models
AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models
José Antonio Hernández López
Martin Weyssow
Jesús Sánchez Cuadrado
H. Sahraoui
55
23
0
23 Jun 2022
Few-Shot Non-Parametric Learning with Deep Latent Variable Model
Few-Shot Non-Parametric Learning with Deep Latent Variable Model
Zhiying Jiang
Yi-Zhu Dai
Ji Xin
Ming Li
Jimmy J. Lin
72
5
0
23 Jun 2022
Evaluating Generative Patent Language Models
Evaluating Generative Patent Language Models
Jieh-Sheng Lee
ELM
97
17
0
23 Jun 2022
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
Baolin Peng
Michel Galley
Pengcheng He
Chris Brockett
Lars Liden
E. Nouri
Zhou Yu
Bill Dolan
Jianfeng Gao
VLM
95
75
0
22 Jun 2022
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming
Chuan Wen
Jianing Qian
Jierui Lin
Jiaye Teng
Dinesh Jayaraman
Yang Gao
AAML
97
18
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
214
1,134
0
22 Jun 2022
Generative Pretraining for Black-Box Optimization
Generative Pretraining for Black-Box Optimization
S. Krishnamoorthy
Satvik Mashkaria
Aditya Grover
OffRLAI4CE
134
31
0
22 Jun 2022
Using cognitive psychology to understand GPT-3
Using cognitive psychology to understand GPT-3
Marcel Binz
Eric Schulz
ELMLLMAG
351
490
0
21 Jun 2022
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders
Gang Li
Heliang Zheng
Daqing Liu
Chaoyue Wang
Fuchun Sun
Changwen Zheng
119
130
0
21 Jun 2022
Review Neural Networks about Image Transformation Based on IGC Learning
  Framework with Annotated Information
Review Neural Networks about Image Transformation Based on IGC Learning Framework with Annotated Information
Yuanjie Yan
Suorong Yang
Yan Wang
Jian Zhao
S. Furao
57
0
0
21 Jun 2022
Insights into Pre-training via Simpler Synthetic Tasks
Insights into Pre-training via Simpler Synthetic Tasks
Yuhuai Wu
Felix Li
Percy Liang
AIMat
92
21
0
21 Jun 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable
  Search Index with Query Generation
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
106
68
0
21 Jun 2022
SoteriaFL: A Unified Framework for Private Federated Learning with
  Communication Compression
SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression
Zhize Li
Haoyu Zhao
Boyue Li
Yuejie Chi
FedML
81
41
0
20 Jun 2022
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender
  Bias
Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender Bias
Yarden Tal
Inbal Magar
Roy Schwartz
77
36
0
20 Jun 2022
Resource-Efficient Separation Transformer
Resource-Efficient Separation Transformer
Luca Della Libera
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Frédéric Lepoutre
François Grondin
VLM
99
18
0
19 Jun 2022
Towards Unified Conversational Recommender Systems via
  Knowledge-Enhanced Prompt Learning
Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning
Xiaolei Wang
Kun Zhou
Ji-Rong Wen
Wayne Xin Zhao
HAIAI4TS
73
137
0
19 Jun 2022
Previous
123...192193194...244245246
Next