ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,362 papers shown
Title
Predicting the Future of AI with AI: High-quality link prediction in an
  exponentially growing knowledge network
Predicting the Future of AI with AI: High-quality link prediction in an exponentially growing knowledge network
Mario Krenn
L. Buffoni
B. Coutinho
S. Eppel
J. Foster
...
Ngoc M. Tran
Francisco Valente
Yangxinyu Xie
Rose Yu
Michael K Kopp
104
51
0
23 Sep 2022
An Interdisciplinary Perspective on Evaluation and Experimental Design
  for Visual Text Analytics: Position Paper
An Interdisciplinary Perspective on Evaluation and Experimental Design for Visual Text Analytics: Position Paper
Kostiantyn Kucher
N. Sultanum
Angel Daza
Vasiliki Simaki
Maria Skeppstedt
Barbara Plank
Jean-Daniel Fekete
Narges Mahyar
50
5
0
23 Sep 2022
Variational Open-Domain Question Answering
Variational Open-Domain Question Answering
Valentin Liévin
Andreas Geert Motzfeldt
Ida Riis Jensen
Ole Winther
OODBDL
76
9
0
23 Sep 2022
MetaPrompting: Learning to Learn Better Prompts
MetaPrompting: Learning to Learn Better Prompts
Yutai Hou
Hongyuan Dong
Xinghao Wang
Bohan Li
Wanxiang Che
VLM
83
30
0
23 Sep 2022
Zero-shot Domain Adaptation for Neural Machine Translation with
  Retrieved Phrase-level Prompts
Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts
Zewei Sun
Qingnan Jiang
Shujian Huang
Jun Cao
Shanbo Cheng
Mingxuan Wang
VLM
114
8
0
23 Sep 2022
Improving Conversational Recommender System via Contextual and
  Time-Aware Modeling with Less Domain-Specific Knowledge
Improving Conversational Recommender System via Contextual and Time-Aware Modeling with Less Domain-Specific Knowledge
Lingzhi Wang
Shafiq Joty
Wei Gao
Xingshan Zeng
Kam-Fai Wong
65
10
0
23 Sep 2022
Towards Faithful Model Explanation in NLP: A Survey
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
237
120
0
22 Sep 2022
ProgPrompt: Generating Situated Robot Task Plans using Large Language
  Models
ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh
Valts Blukis
Arsalan Mousavian
Ankit Goyal
Danfei Xu
Jonathan Tremblay
Dieter Fox
Jesse Thomason
Animesh Garg
LM&RoLLMAG
179
659
0
22 Sep 2022
Poisson Flow Generative Models
Poisson Flow Generative Models
Yilun Xu
Ziming Liu
M. Tegmark
Tommi Jaakkola
195
88
0
22 Sep 2022
A Case Report On The "A.I. Locked-In Problem": social concerns with
  modern NLP
A Case Report On The "A.I. Locked-In Problem": social concerns with modern NLP
Yoshija Walter
LLMAG
50
2
0
22 Sep 2022
Prompting for a conversation: How to control a dialog model?
Prompting for a conversation: How to control a dialog model?
Josef Valvoda
Yimai Fang
David Vandyke
219
5
0
22 Sep 2022
Efficient Few-Shot Learning Without Prompts
Efficient Few-Shot Learning Without Prompts
Lewis Tunstall
Nils Reimers
Unso Eun Seo Jo
Luke Bates
Daniel Korat
Moshe Wasserblat
Oren Pereg
VLM
93
197
0
22 Sep 2022
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question
  Generation
Selecting Better Samples from Pre-trained LLMs: A Case Study on Question Generation
Xingdi Yuan
Tong Wang
Yen-Hsiang Wang
Emery Fine
Rania Abdelghani
Pauline Lucas
Hélene Sauzéon
Pierre-Yves Oudeyer
94
30
0
22 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for
  vision based Deep Reinforcement Learning
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
107
6
0
22 Sep 2022
A novel corrective-source term approach to modeling unknown physics in
  aluminum extraction process
A novel corrective-source term approach to modeling unknown physics in aluminum extraction process
Haakon Robinson
E. Lundby
Adil Rasheed
J. Gravdahl
51
5
0
22 Sep 2022
DFX: A Low-latency Multi-FPGA Appliance for Accelerating
  Transformer-based Text Generation
DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation
Seongmin Hong
Seungjae Moon
Junsoo Kim
Sungjae Lee
Minsub Kim
Dongsoo Lee
Joo-Young Kim
171
83
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
Deep Lake: a Lakehouse for Deep Learning
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
90
20
0
22 Sep 2022
Learning Model Predictive Controllers with Real-Time Attention for
  Real-World Navigation
Learning Model Predictive Controllers with Real-Time Attention for Real-World Navigation
Xuesu Xiao
Tingnan Zhang
K. Choromanski
Edward J. Lee
Anthony G. Francis
...
Leila Takayama
Roy Frostig
Jie Tan
Carolina Parada
Vikas Sindhwani
150
55
0
22 Sep 2022
MulBot: Unsupervised Bot Detection Based on Multivariate Time Series
MulBot: Unsupervised Bot Detection Based on Multivariate Time Series
Lorenzo Mannocci
S. Cresci
A. Monreale
A. Vakali
Maurizio Tesconi
67
9
0
21 Sep 2022
Is More Data Better? Re-thinking the Importance of Efficiency in Abusive
  Language Detection with Transformers-Based Active Learning
Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning
Hannah Rose Kirk
Bertie Vidgen
Scott A. Hale
51
10
0
21 Sep 2022
FAL-CUR: Fair Active Learning using Uncertainty and Representativeness
  on Fair Clustering
FAL-CUR: Fair Active Learning using Uncertainty and Representativeness on Fair Clustering
R. Fajri
A. Saxena
Yulong Pei
Mykola Pechenizkiy
FaML
51
3
0
21 Sep 2022
A Comprehensive Survey on Trustworthy Recommender Systems
A Comprehensive Survey on Trustworthy Recommender Systems
Wenqi Fan
Xiangyu Zhao
Xiao Chen
Jingran Su
Jingtong Gao
...
Qidong Liu
Yiqi Wang
Hanfeng Xu
Lei Chen
Qing Li
FaML
107
48
0
21 Sep 2022
Generate rather than Retrieve: Large Language Models are Strong Context
  Generators
Generate rather than Retrieve: Large Language Models are Strong Context Generators
Wenhao Yu
Dan Iter
Shuohang Wang
Yichong Xu
Mingxuan Ju
Soumya Sanyal
Chenguang Zhu
Michael Zeng
Meng Jiang
RALMAIMat
360
341
0
21 Sep 2022
Exploring Optimal Granularity for Extractive Summarization of
  Unstructured Health Records: Analysis of the Largest Multi-Institutional
  Archive of Health Records in Japan
Exploring Optimal Granularity for Extractive Summarization of Unstructured Health Records: Analysis of the Largest Multi-Institutional Archive of Health Records in Japan
Kenichiro Ando
T. Okumura
Mamoru Komachi
Hiromasa Horiguchi
Yuji Matsumoto
103
7
0
20 Sep 2022
Relaxed Attention for Transformer Models
Relaxed Attention for Transformer Models
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
53
12
0
20 Sep 2022
Sparse Vicious Attacks on Graph Neural Networks
Sparse Vicious Attacks on Graph Neural Networks
Giovanni Trappolini
Valentino Maiorca
Silvio Severino
Emanuele Rodolà
Fabrizio Silvestri
Gabriele Tolomei
AAML
62
8
0
20 Sep 2022
Knowledge-Aware Bayesian Deep Topic Model
Knowledge-Aware Bayesian Deep Topic Model
Dongsheng Wang
Yishi Xu
Miaoge Li
Zhibin Duan
Chaojie Wang
Bo Chen
Mingyuan Zhou
BDL
101
16
0
20 Sep 2022
Learn to Explain: Multimodal Reasoning via Thought Chains for Science
  Question Answering
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
Ashwin Kalyan
ELMReLMLRM
295
1,301
0
20 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for
  Open-world Detection
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIPVLM
179
160
0
20 Sep 2022
Automatic Label Sequence Generation for Prompting Sequence-to-sequence
  Models
Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models
Zichun Yu
Tianyu Gao
Zhengyan Zhang
Yankai Lin
Zhiyuan Liu
Maosong Sun
Jie Zhou
VLMLRM
51
1
0
20 Sep 2022
Will It Blend? Mixing Training Paradigms & Prompting for Argument
  Quality Prediction
Will It Blend? Mixing Training Paradigms & Prompting for Argument Quality Prediction
Michiel van der Meer
Myrthe Reuver
Urja Khurana
Lea Krause
Selene Báez Santamaría
78
14
0
19 Sep 2022
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous
  Driving
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving
Xiwen Liang
Yangxin Wu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
106
37
0
19 Sep 2022
NL2INTERFACE: Interactive Visualization Interface Generation from
  Natural Language Queries
NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries
Yiru Chen
Ryan Li
Austin Mac
Tianbao Xie
Tao Yu
Eugene Wu
73
12
0
19 Sep 2022
Joint Language Semantic and Structure Embedding for Knowledge Graph
  Completion
Joint Language Semantic and Structure Embedding for Knowledge Graph Completion
Jianhao Shen
Chenguang Wang
Linyuan Gong
Dawn Song
119
32
0
19 Sep 2022
Automated MeSH Term Suggestion for Effective Query Formulation in
  Systematic Reviews Literature Search
Automated MeSH Term Suggestion for Effective Query Formulation in Systematic Reviews Literature Search
Shuai Wang
Harrisen Scells
Bevan Koopman
Guido Zuccon
AI4CE
73
18
0
19 Sep 2022
Enabling Conversational Interaction with Mobile UI using Large Language
  Models
Enabling Conversational Interaction with Mobile UI using Large Language Models
Bryan Wang
Gang Li
Yang Li
219
144
0
18 Sep 2022
Bootstrap Generalization Ability from Loss Landscape Perspective
Bootstrap Generalization Ability from Loss Landscape Perspective
Huanran Chen
Shitong Shao
Ziyi Wang
Zirui Shang
Jin Chen
Xiaofeng Ji
Xinxiao Wu
OOD
134
19
0
18 Sep 2022
EEG-Based Epileptic Seizure Prediction Using Temporal Multi-Channel
  Transformers
EEG-Based Epileptic Seizure Prediction Using Temporal Multi-Channel Transformers
Ricardo V. Godoy
Tharik J. S. Reis
Paulo H. Polegato
G. J. G. Lahr
R. Saute
F. Nakano
H. Machado
A. Sakamoto
Marcelo Becker
G. Caurin
45
7
0
18 Sep 2022
Unsupervised Lexical Substitution with Decontextualised Embeddings
Unsupervised Lexical Substitution with Decontextualised Embeddings
Takashi Wada
Timothy Baldwin
Yuji Matsumoto
Jey Han Lau
145
7
0
17 Sep 2022
Selective Token Generation for Few-shot Natural Language Generation
Selective Token Generation for Few-shot Natural Language Generation
DaeJin Jo
Taehwan Kwon
Eun-Sol Kim
Sungwoong Kim
70
1
0
17 Sep 2022
Psychologically-informed chain-of-thought prompts for metaphor
  understanding in large language models
Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models
Ben Prystawski
P. Thibodeau
Christopher Potts
Noah D. Goodman
ReLMLRMAI4CE
76
21
0
16 Sep 2022
Malicious Source Code Detection Using Transformer
Malicious Source Code Detection Using Transformer
Chen Tsfaty
Michael Fire
62
4
0
16 Sep 2022
The Whole Truth and Nothing But the Truth: Faithful and Controllable
  Dialogue Response Generation with Dataflow Transduction and Constrained
  Decoding
The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained Decoding
Hao Fang
Anusha Balakrishnan
Harsh Jhamtani
Jonathan Bufe
J. Crawford
Jayant Krishnamurthy
Adam Pauls
J. Eisner
Jacob Andreas
Dan Klein
90
6
0
16 Sep 2022
DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under
  Missing Data
DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under Missing Data
Kai Zhang
Qinmin Yang
Chong Li
AI4TS
21
0
0
16 Sep 2022
SQ-Swin: a Pretrained Siamese Quadratic Swin Transformer for Lettuce
  Browning Prediction
SQ-Swin: a Pretrained Siamese Quadratic Swin Transformer for Lettuce Browning Prediction
Dayang Wang
Boce Zhang
Yongshun Xu
Yaguang Luo
Hengyong Yu
ViT
91
1
0
16 Sep 2022
Can There be Art Without an Artist?
Can There be Art Without an Artist?
A. Ghosh
Genoveva Fossas
106
25
0
16 Sep 2022
On the Relation between Sensitivity and Accuracy in In-context Learning
On the Relation between Sensitivity and Accuracy in In-context Learning
Yanda Chen
Chen Zhao
Zhou Yu
Kathleen McKeown
He He
265
80
0
16 Sep 2022
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for
  Multilingual Tweet Representations at Twitter
TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter
Xinyang Zhang
Yury Malkov
Omar U. Florez
Serim Park
Brian McWilliams
Jiawei Han
Ahmed El-Kishky
VLM
107
94
0
15 Sep 2022
LAVIS: A Library for Language-Vision Intelligence
LAVIS: A Library for Language-Vision Intelligence
Dongxu Li
Junnan Li
Hung Le
Guangsen Wang
Silvio Savarese
Guosheng Lin
VLM
192
56
0
15 Sep 2022
Distribution Aware Metrics for Conditional Natural Language Generation
Distribution Aware Metrics for Conditional Natural Language Generation
David M. Chan
Yiming Ni
David A. Ross
Sudheendra Vijayanarasimhan
Austin Myers
John F. Canny
77
4
0
15 Sep 2022
Previous
123...186187188...246247248
Next