ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,431 papers shown
Title
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text
  Generation
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
Swarnadeep Saha
Xinyan Velocity Yu
Joey Tianyi Zhou
Ramakanth Pasunuru
Asli Celikyilmaz
ReLMLRM
59
11
0
16 Dec 2022
Dense Feature Memory Augmented Transformers for COVID-19 Vaccination
  Search Classification
Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification
Jai Gupta
Yi Tay
C. Kamath
Vinh Q. Tran
Donald Metzler
S. Bavadekar
Mimi Sun
E. Gabrilovich
MedIm
37
0
0
16 Dec 2022
Teaching Small Language Models to Reason
Teaching Small Language Models to Reason
Lucie Charlotte Magister
Jonathan Mallinson
Jakub Adamek
Eric Malmi
Aliaksei Severyn
LRMAI4CEReLM
233
267
0
16 Dec 2022
Decoder Tuning: Efficient Language Understanding as Decoding
Decoder Tuning: Efficient Language Understanding as Decoding
Ganqu Cui
Wentao Li
Ning Ding
Longtao Huang
Zhiyuan Liu
Maosong Sun
78
6
0
16 Dec 2022
Lessons learned from the evaluation of Spanish Language Models
Lessons learned from the evaluation of Spanish Language Models
Rodrigo Agerri
Eneko Agirre
ELM
90
15
0
16 Dec 2022
Feature Dropout: Revisiting the Role of Augmentations in Contrastive
  Learning
Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning
Alex Tamkin
Margalit Glasgow
Xiluo He
Noah D. Goodman
SSL
118
7
0
16 Dec 2022
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP
  Tasks
FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks
Weilong Dong
Xinwei Wu
Junzhuo Li
Shuangzhi Wu
Chao Bian
Deyi Xiong
FedML
106
6
0
16 Dec 2022
Convolution-enhanced Evolving Attention Networks
Convolution-enhanced Evolving Attention Networks
Yujing Wang
Yaming Yang
Zhuowan Li
Jiangang Bai
Mingliang Zhang
Xiangtai Li
Jiahao Yu
Ce Zhang
Gao Huang
Yu Tong
ViT
102
6
0
16 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
115
91
0
16 Dec 2022
Controllable Text Generation via Probability Density Estimation in the
  Latent Space
Controllable Text Generation via Probability Density Estimation in the Latent Space
Yuxuan Gu
Xiaocheng Feng
Sicheng Ma
Lingyuan Zhang
Heng Gong
Weihong Zhong
Bing Qin
90
18
0
16 Dec 2022
ALERT: Adapting Language Models to Reasoning Tasks
ALERT: Adapting Language Models to Reasoning Tasks
Ping Yu
Tianlu Wang
O. Yu. Golovneva
Badr AlKhamissi
Siddharth Verma
Zhijing Jin
Gargi Ghosh
Mona T. Diab
Asli Celikyilmaz
ReLMLRM
85
19
0
16 Dec 2022
Werewolf Among Us: A Multimodal Dataset for Modeling Persuasion
  Behaviors in Social Deduction Games
Werewolf Among Us: A Multimodal Dataset for Modeling Persuasion Behaviors in Social Deduction Games
Bolin Lai
Hongxin Zhang
Miao Liu
Aryan Pariani
Fiona Ryan
Wenqi Jia
Shirley Anugrah Hayati
James M. Rehg
Diyi Yang
56
10
0
16 Dec 2022
Improving Chess Commentaries by Combining Language Models with Symbolic
  Reasoning Engines
Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines
Andrew Lee
David Wu
Emily Dinan
M. Lewis
LRM
87
7
0
15 Dec 2022
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources
  in Natural Language Understanding Systems
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems
Akshatha Arodi
Martin Pömsl
Kaheer Suleman
Adam Trischler
Alexandra Olteanu
Jackie C.K. Cheung
ELM
75
5
0
15 Dec 2022
FiDO: Fusion-in-Decoder optimized for stronger performance and faster
  inference
FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference
Michiel de Jong
Yury Zemlyanskiy
Joshua Ainslie
Nicholas FitzGerald
Sumit Sanghai
Fei Sha
William W. Cohen
VLM
73
36
0
15 Dec 2022
Efficient Long Sequence Modeling via State Space Augmented Transformer
Efficient Long Sequence Modeling via State Space Augmented Transformer
Simiao Zuo
Xiaodong Liu
Jian Jiao
Denis Xavier Charles
Eren Manavoglu
Tuo Zhao
Jianfeng Gao
175
37
0
15 Dec 2022
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue
  Systems
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems
Denis Emelin
Daniele Bonadiman
Sawsan Alqahtani
Yi Zhang
Saab Mansour
82
17
0
15 Dec 2022
MAViL: Masked Audio-Video Learners
MAViL: Masked Audio-Video Learners
Po-Yao (Bernie) Huang
Vasu Sharma
Hu Xu
Chaitanya K. Ryali
Haoqi Fan
Yanghao Li
Shang-Wen Li
Gargi Ghosh
Jitendra Malik
Christoph Feichtenhofer
81
54
0
15 Dec 2022
Objaverse: A Universe of Annotated 3D Objects
Objaverse: A Universe of Annotated 3D Objects
Matt Deitke
Dustin Schwenk
Jordi Salvador
Luca Weihs
Oscar Michel
Eli VanderBilt
Ludwig Schmidt
Kiana Ehsani
Aniruddha Kembhavi
Ali Farhadi
116
975
0
15 Dec 2022
CLIPPO: Image-and-Language Understanding from Pixels Only
CLIPPO: Image-and-Language Understanding from Pixels Only
Michael Tschannen
Basil Mustafa
N. Houlsby
CLIPVLM
102
49
0
15 Dec 2022
Attributed Question Answering: Evaluation and Modeling for Attributed
  Large Language Models
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Bernd Bohnet
Vinh Q. Tran
Pat Verga
Roee Aharoni
D. Andor
...
Michael Collins
Dipanjan Das
Donald Metzler
Slav Petrov
Kellie Webster
115
65
0
15 Dec 2022
Revisiting the Gold Standard: Grounding Summarization Evaluation with
  Robust Human Evaluation
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Yixin Liu
Alexander R. Fabbri
Pengfei Liu
Yilun Zhao
Linyong Nan
...
Simeng Han
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
ALM
86
134
0
15 Dec 2022
Visually-augmented pretrained language models for NLP tasks without
  images
Visually-augmented pretrained language models for NLP tasks without images
Hangyu Guo
Kun Zhou
Wayne Xin Zhao
Qinyu Zhang
Ji-Rong Wen
VLM
56
10
0
15 Dec 2022
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning
O. Yu. Golovneva
Moya Chen
Spencer Poff
Martin Corredor
Luke Zettlemoyer
Maryam Fazel-Zarandi
Asli Celikyilmaz
ReLMLRM
104
152
0
15 Dec 2022
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
Hang Lai
Weinan Zhang
Xialin He
Chen Yu
Zheng Tian
Yong Yu
Jun Wang
114
21
0
15 Dec 2022
Transformers learn in-context by gradient descent
Transformers learn in-context by gradient descent
J. Oswald
Eyvind Niklasson
E. Randazzo
João Sacramento
A. Mordvintsev
A. Zhmoginov
Max Vladymyrov
MLT
148
497
0
15 Dec 2022
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual
  Machine Translation
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation
Maha Elbayad
Anna Y. Sun
Shruti Bhosale
MoE
86
10
0
15 Dec 2022
Build-a-Bot: Teaching Conversational AI Using a Transformer-Based Intent
  Recognition and Question Answering Architecture
Build-a-Bot: Teaching Conversational AI Using a Transformer-Based Intent Recognition and Question Answering Architecture
Kate Pearce
Sharifa Alghowinem
C. Breazeal
75
19
0
14 Dec 2022
Efficient Self-supervised Learning with Contextualized Target
  Representations for Vision, Speech and Language
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Alexei Baevski
Arun Babu
Wei-Ning Hsu
Michael Auli
VLMSSL
129
97
0
14 Dec 2022
Analytical Engines With Context-Rich Processing: Towards Efficient
  Next-Generation Analytics
Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics
Viktor Sanca
Anastasia Ailamaki
116
4
0
14 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Reproducible scaling laws for contrastive language-image learning
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLMCLIP
139
824
0
14 Dec 2022
A Hierarchical Framework for Collaborative Artificial Intelligence
A Hierarchical Framework for Collaborative Artificial Intelligence
James L. Crowley
J. Coutaz
Jasmin Grosinger
Javier Vázquez-Salceda
C. Angulo
Alberto Sanfeliu
Luca Iocchi
Anthony G. Cohn
30
6
0
14 Dec 2022
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled
  Videos
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Hao-Wen Dong
Naoya Takahashi
Yuki Mitsufuji
Julian McAuley
Taylor Berg-Kirkpatrick
VLMCLIP
81
29
0
14 Dec 2022
Efficient Speech Representation Learning with Low-Bit Quantization
Efficient Speech Representation Learning with Low-Bit Quantization
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Abdel-rahman Mohamed
MQ
49
10
0
14 Dec 2022
Pre-trained Language Models Can be Fully Zero-Shot Learners
Pre-trained Language Models Can be Fully Zero-Shot Learners
Xuandong Zhao
Siqi Ouyang
Zhiguo Yu
Ming-li Wu
Lei Li
VLMLRM
103
34
0
14 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and
  Methods
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
127
26
0
13 Dec 2022
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma
Jerry Hong
Mustafa Omer Gul
Mona Gandhi
Irena Gao
Ranjay Krishna
CoGe
94
142
0
13 Dec 2022
Foresight -- Generative Pretrained Transformer (GPT) for Modelling of
  Patient Timelines using EHRs
Foresight -- Generative Pretrained Transformer (GPT) for Modelling of Patient Timelines using EHRs
Z. Kraljevic
D. Bean
Anthony Shek
R. Bendayan
H. Hemingway
...
Alfie Baston
Jack Ross
Esther Idowu
J. Teo
Richard J. B. Dobson
AI4TS
76
23
0
13 Dec 2022
Diverse Demonstrations Improve In-context Compositional Generalization
Diverse Demonstrations Improve In-context Compositional Generalization
Itay Levy
Ben Bogin
Jonathan Berant
96
146
0
13 Dec 2022
Gradient flow in the gaussian covariate model: exact solution of
  learning curves and multiple descent structures
Gradient flow in the gaussian covariate model: exact solution of learning curves and multiple descent structures
Antione Bodin
N. Macris
79
4
0
13 Dec 2022
Benchmarking Large Language Models for Automated Verilog RTL Code
  Generation
Benchmarking Large Language Models for Automated Verilog RTL Code Generation
Shailja Thakur
Baleegh Ahmad
Zhenxing Fan
Hammond Pearce
Benjamin Tan
Ramesh Karri
Brendan Dolan-Gavitt
S. Garg
68
141
0
13 Dec 2022
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Y. Hao
Yutao Sun
Li Dong
Zhixiong Han
Yuxian Gu
Furu Wei
LRM
62
75
0
13 Dec 2022
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
119
4
0
13 Dec 2022
FastMIM: Expediting Masked Image Modeling Pre-training for Vision
FastMIM: Expediting Masked Image Modeling Pre-training for Vision
Jianyuan Guo
Kai Han
Han Wu
Yehui Tang
Yunhe Wang
Chang Xu
80
10
0
13 Dec 2022
Quant 4.0: Engineering Quantitative Investment with Automated,
  Explainable and Knowledge-driven Artificial Intelligence
Quant 4.0: Engineering Quantitative Investment with Automated, Explainable and Knowledge-driven Artificial Intelligence
Jian Guo
Saizhuo Wang
L. Ni
H. Shum
AIFin
99
8
0
13 Dec 2022
Position: Considerations for Differentially Private Learning with
  Large-Scale Public Pretraining
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Florian Tramèr
Gautam Kamath
Nicholas Carlini
SILM
131
72
0
13 Dec 2022
Technical Report -- Competition Solution for Prompt Tuning using
  Pretrained Language Model
Technical Report -- Competition Solution for Prompt Tuning using Pretrained Language Model
Jiang-Long Song
Wuhe Zou
Feng Li
Xiaolei Qin
Weidong Zhang
61
0
0
13 Dec 2022
Attentive Deep Neural Networks for Legal Document Retrieval
Attentive Deep Neural Networks for Legal Document Retrieval
Nguyen Ha Thanh
Manh-Kien Phi
Xuan-Bach Ngo
Vu Tran
Le-Minh Nguyen
Minh-Phuong Tu
AILaw
50
30
0
13 Dec 2022
Despite "super-human" performance, current LLMs are unsuited for
  decisions about ethics and safety
Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety
Joshua Albrecht
Ellie Kitanidis
Abraham J. Fetterman
ELMReLMALMLRM
84
19
0
13 Dec 2022
Jointly Learning Visual and Auditory Speech Representations from Raw
  Data
Jointly Learning Visual and Auditory Speech Representations from Raw Data
A. Haliassos
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
Maja Pantic
SSL
92
49
0
12 Dec 2022
Previous
123...171172173...247248249
Next