ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,482 papers shown
Title
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
Chengyi Wang
Sanyuan Chen
Yu-Huan Wu
Zi-Hua Zhang
Long Zhou
...
Huaming Wang
Jinyu Li
Lei He
Sheng Zhao
Furu Wei
193
727
0
05 Jan 2023
Corrupted by Algorithms? How AI-generated and Human-written Advice Shape
  (Dis)honesty
Corrupted by Algorithms? How AI-generated and Human-written Advice Shape (Dis)honesty
Margarita Leib
N. Köbis
Rainer Michael Rilke
Marloes H. J. Hagens
Bernd Irlenbusch
38
30
0
05 Jan 2023
Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Critical Perspectives: A Benchmark Revealing Pitfalls in PerspectiveAPI
Lorena Piedras
Lucas Rosenblatt
Julia Wilkins
79
10
0
05 Jan 2023
Serenity: Library Based Python Code Analysis for Code Completion and
  Automated Machine Learning
Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning
Wenting Zhao
Ibrahim Abdelaziz
Julian T Dolby
Kavitha Srinivas
M. Helali
Essam Mansour
59
0
0
05 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
124
67
0
04 Jan 2023
MessageNet: Message Classification using Natural Language Processing and
  Meta-data
MessageNet: Message Classification using Natural Language Processing and Meta-data
Adar Kahana
Oren Elisha
26
0
0
04 Jan 2023
A comprehensive review of automatic text summarization techniques:
  method, data, evaluation and coding
A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding
D. Cajueiro
A. G. Nery
Igor Tavares
Maísa Kely de Melo
Silvia A. dos Reis
Weigang Li
V. R. R. Celestino
86
15
0
04 Jan 2023
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng
Ziyang Chen
Andrew Owens
85
78
0
04 Jan 2023
UniHD at TSAR-2022 Shared Task: Is Compute All We Need for Lexical
  Simplification?
UniHD at TSAR-2022 Shared Task: Is Compute All We Need for Lexical Simplification?
Dennis Aumiller
Michael Gertz
53
22
0
04 Jan 2023
Text sampling strategies for predicting missing bibliographic links
Text sampling strategies for predicting missing bibliographic links
F. V. Krasnova
I. S. Smaznevicha
E. Baskakova
81
1
0
04 Jan 2023
On the Convergence of Stochastic Gradient Descent in Low-precision
  Number Formats
On the Convergence of Stochastic Gradient Descent in Low-precision Number Formats
M. Cacciola
A. Frangioni
M. Asgharian
Alireza Ghaffari
V. Nia
86
4
0
04 Jan 2023
Infomaxformer: Maximum Entropy Transformer for Long Time-Series
  Forecasting Problem
Infomaxformer: Maximum Entropy Transformer for Long Time-Series Forecasting Problem
Peiwang Tang
Xianchao Zhang
AI4TS
116
6
0
04 Jan 2023
Audio-Visual Efficient Conformer for Robust Speech Recognition
Audio-Visual Efficient Conformer for Robust Speech Recognition
Maxime Burchi
Radu Timofte
VLM
78
35
0
04 Jan 2023
A Survey On Few-shot Knowledge Graph Completion with Structural and
  Commonsense Knowledge
A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge
Haodi Ma
D. Wang
101
8
0
03 Jan 2023
Language Models are Drummers: Drum Composition with Natural Language
  Pre-Training
Language Models are Drummers: Drum Composition with Natural Language Pre-Training
Li Zhang
Chris Callison-Burch
86
5
0
03 Jan 2023
On the causality-preservation capabilities of generative modelling
On the causality-preservation capabilities of generative modelling
Yves-Cédric Bauwelinckx
Jan Dhaene
Tim Verdonck
Milan van den Heuvel
CMLAI4CE
98
1
0
03 Jan 2023
Semi-Structured Object Sequence Encoders
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
124
0
0
03 Jan 2023
Data Valuation Without Training of a Model
Data Valuation Without Training of a Model
Nohyun Ki
Hoyong Choi
Hye Won Chung
TDI
82
33
0
03 Jan 2023
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement
  Understanding
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding
Steven H. Wang
Antoine Scardigli
Leonard Tang
Wei Chen
D.M. Levkin
Anya Chen
Spencer Ball
Thomas Woodside
Oliver Zhang
Dan Hendrycks
AILawELM
72
22
0
02 Jan 2023
CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection
CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection
Jie Liu
Yixiao Zhang
Jieneng Chen
Junfei Xiao
Yongyi Lu
Bennett A. Landman
Yixuan Yuan
Alan Yuille
Yucheng Tang
Zongwei Zhou
VLMMedIm
145
211
0
02 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
278
560
0
02 Jan 2023
Transformer Based Geocoding
Transformer Based Geocoding
Yuval Solaz
Vitaly Shalumov
30
0
0
02 Jan 2023
DMOps: Data Management Operation and Recipes
DMOps: Data Management Operation and Recipes
E. Choi
Chanjun Park
81
7
0
02 Jan 2023
Argoverse 2: Next Generation Datasets for Self-Driving Perception and
  Forecasting
Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting
Benjamin Wilson
William Qi
Tanmay Agarwal
John Lambert
Jagjeet Singh
...
Andrew Hartnett
J. K. Pontes
Deva Ramanan
Peter Carr
James Hays
3DPCAI4TS
138
647
0
02 Jan 2023
CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation
CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation
Ge Zhang
Yizhi Li
Yaoyao Wu
Linyuan Zhang
Chenghua Lin
Jiayi Geng
Shi Wang
Jie Fu
96
14
0
01 Jan 2023
Deep Learning Technique for Human Parsing: A Survey and Outlook
Deep Learning Technique for Human Parsing: A Survey and Outlook
Lu Yang
Wenhe Jia
Shane Li
Q. Song
ViT
143
20
0
01 Jan 2023
Second Thoughts are Best: Learning to Re-Align With Human Values from
  Text Edits
Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits
Ruibo Liu
Chenyan Jia
Ge Zhang
Ziyu Zhuang
Tony X. Liu
Soroush Vosoughi
189
36
0
01 Jan 2023
Rethinking with Retrieval: Faithful Large Language Model Inference
Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He
Hongming Zhang
Dan Roth
KELMLRM
242
169
0
31 Dec 2022
Unpacking the "Black Box" of AI in Education
Unpacking the "Black Box" of AI in Education
Nabeel Gillani
R. Eynon
Catherine Chiabaut
Kelsey Finkel
76
59
0
31 Dec 2022
A Survey on In-context Learning
A Survey on In-context Learning
Qingxiu Dong
Lei Li
Damai Dai
Ce Zheng
Jingyuan Ma
...
Zhiyong Wu
Baobao Chang
Xu Sun
Lei Li
Zhifang Sui
ReLMAIMat
157
547
0
31 Dec 2022
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition
  with Pre-trained Vision-Language Models
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
174
53
0
31 Dec 2022
Inconsistencies in Masked Language Models
Inconsistencies in Masked Language Models
Tom Young
Yunan Chen
Yang You
74
2
0
30 Dec 2022
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on
  Simplified Radiology Reports
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports
Katharina Jeblick
B. Schachtner
Jakob Dexl
Andreas Mittermeier
Anna Theresa Stüber
...
Tobias Weber
Philipp Wesp
B. Sabel
J. Ricke
Michael Ingrisch
LM&MAMedIm
176
403
0
30 Dec 2022
An Analysis of Attention via the Lens of Exchangeability and Latent
  Variable Models
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models
Yufeng Zhang
Boyi Liu
Qi Cai
Lingxiao Wang
Zhaoran Wang
128
13
0
30 Dec 2022
DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and
  Classification for Diabetic Retinopathy Grading
DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading
Hasan Md Tusfiqur
D. M. Nguyen
M. T. N. Truong
T. A. Nguyen
Binh Duc Nguyen
...
H. Profitlich
Ngoc T. T. Than
Ngan Le
P. Xie
Daniel Sonntag
MedIm
83
8
0
30 Dec 2022
MAUVE Scores for Generative Models: Theory and Practice
MAUVE Scores for Generative Models: Theory and Practice
Krishna Pillutla
Lang Liu
John Thickstun
Sean Welleck
Swabha Swayamdipta
Rowan Zellers
Sewoong Oh
Yejin Choi
Zaïd Harchaoui
EGVM
123
23
0
30 Dec 2022
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
93
8
0
30 Dec 2022
Improving Visual Representation Learning through Perceptual
  Understanding
Improving Visual Representation Learning through Perceptual Understanding
Samyakh Tukra
Frederick Hoffman
Ken Chatfield
86
5
0
30 Dec 2022
Examining Political Rhetoric with Epistemic Stance Detection
Examining Political Rhetoric with Epistemic Stance Detection
Ankita Gupta
Su Lin Blodgett
Justin H. Gross
Brendan O'Connor
56
0
0
29 Dec 2022
Efficient Movie Scene Detection using State-Space Transformers
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
45
0
29 Dec 2022
GPT Takes the Bar Exam
GPT Takes the Bar Exam
M. Bommarito
Daniel Martin Katz
ELM
81
156
0
29 Dec 2022
Eliminating Meta Optimization Through Self-Referential Meta Learning
Eliminating Meta Optimization Through Self-Referential Meta Learning
Louis Kirsch
Jürgen Schmidhuber
58
7
0
29 Dec 2022
Foreground-Background Separation through Concept Distillation from
  Generative Image Foundation Models
Foreground-Background Separation through Concept Distillation from Generative Image Foundation Models
Mischa Dombrowski
Hadrien Reynaud
Matthew Baugh
Bernhard Kainz
DiffM
91
3
0
29 Dec 2022
3D Masked Modelling Advances Lesion Classification in Axial T2w Prostate
  MRI
3D Masked Modelling Advances Lesion Classification in Axial T2w Prostate MRI
Alvaro Fernandez-Quilez
C. Andersen
T. Eftestøl
S. R. Kjosavik
K. Oppedal
25
2
0
29 Dec 2022
Maximizing Use-Case Specificity through Precision Model Tuning
Maximizing Use-Case Specificity through Precision Model Tuning
Pranjal Awasthi
David Recio-Mitter
Yosuke Kyle Sugi
LM&MA
27
1
0
29 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
117
91
0
28 Dec 2022
Demonstrate-Search-Predict: Composing retrieval and language models for
  knowledge-intensive NLP
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Omar Khattab
Keshav Santhanam
Xiang Lisa Li
David Leo Wright Hall
Percy Liang
Christopher Potts
Matei A. Zaharia
RALMKELM
114
269
0
28 Dec 2022
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Daniel Y. Fu
Tri Dao
Khaled Kamal Saab
A. Thomas
Atri Rudra
Christopher Ré
159
404
0
28 Dec 2022
Automatic Recognition and Classification of Future Work Sentences from
  Academic Articles in a Specific Domain
Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain
Chengzhi Zhang
Yi Xiang
Wenke Hao
Zhicheng Li
Yuchen Qian
Yuzhuo Wang
48
11
0
28 Dec 2022
Knowledge-Guided Data-Centric AI in Healthcare: Progress, Shortcomings,
  and Future Directions
Knowledge-Guided Data-Centric AI in Healthcare: Progress, Shortcomings, and Future Directions
Edward Y. Chang
MedIm
52
7
0
27 Dec 2022
Previous
123...168169170...248249250
Next