ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,379 papers shown
Title
Large Language Models Meet Harry Potter: A Bilingual Dataset for
  Aligning Dialogue Agents with Characters
Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters
Nuo Chen
Yan Wang
Haiyun Jiang
Deng Cai
Yuhan Li
Ziyang Chen
Longyue Wang
Jia Li
88
8
0
13 Nov 2022
Build generally reusable agent-environment interaction models
Build generally reusable agent-environment interaction models
Jun Jin
Hongming Zhang
Jun Luo
49
0
0
13 Nov 2022
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Taehoon Kim
Mark A Marsden
Pyunghwan Ahn
Sangyun Kim
Sihaeng Lee
Alessandra Sala
S. Kim
VLM
62
4
0
13 Nov 2022
NLPeer: A Unified Resource for the Computational Study of Peer Review
NLPeer: A Unified Resource for the Computational Study of Peer Review
Nils Dycke
Ilia Kuznetsov
Iryna Gurevych
78
39
0
12 Nov 2022
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic
  Fusion Prompts
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic Fusion Prompts
Xiaocui Yang
Shi Feng
Daling Wang
Pengfei Hong
Soujanya Poria
82
23
0
12 Nov 2022
Design of Unmanned Air Vehicles Using Transformer Surrogate Models
Design of Unmanned Air Vehicles Using Transformer Surrogate Models
Adam D. Cobb
Anirban Roy
Daniel Elenius
Susmit Jha
AI4CE
48
1
0
11 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELMVLM
160
137
0
11 Nov 2022
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class
  Classification
CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification
Yongbin Li
Canran Xu
Guodong Long
Tao Shen
Chongyang Tao
Jing Jiang
75
1
0
11 Nov 2022
Steps towards prompt-based creation of virtual worlds
Steps towards prompt-based creation of virtual worlds
Jasmine Roberts
Andrzej Banburski-Fahey
J. Lanier
62
14
0
10 Nov 2022
Climate Policy Tracker: Pipeline for automated analysis of public
  climate policies
Climate Policy Tracker: Pipeline for automated analysis of public climate policies
Artur .Zólkowski
Mateusz Krzyzinski
Piotr Wilczyñski
Stanislaw Giziñski
Emilia Wisnios
Bartosz Pieliñski
Julian Sienkiewicz
P. Biecek
73
3
0
10 Nov 2022
The CRINGE Loss: Learning what language not to model
The CRINGE Loss: Learning what language not to model
Leonard Adolphs
Tianyu Gao
Jing Xu
Kurt Shuster
Sainbayar Sukhbaatar
Jason Weston
MU
95
37
0
10 Nov 2022
InternImage: Exploring Large-Scale Vision Foundation Models with
  Deformable Convolutions
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
Wenhai Wang
Jifeng Dai
Zhe Chen
Zhenhang Huang
Zhiqi Li
...
Tong Lu
Lewei Lu
Hongsheng Li
Xiaogang Wang
Yu Qiao
VLM
180
698
0
10 Nov 2022
Impact of Adversarial Training on Robustness and Generalizability of
  Language Models
Impact of Adversarial Training on Robustness and Generalizability of Language Models
Enes Altinisik
Hassan Sajjad
Husrev Taha Sencar
Safa Messaoud
Sanjay Chawla
AAML
59
11
0
10 Nov 2022
On Optimizing the Communication of Model Parallelism
On Optimizing the Communication of Model Parallelism
Yonghao Zhuang
Hexu Zhao
Lianmin Zheng
Zhuohan Li
Eric P. Xing
Qirong Ho
Joseph E. Gonzalez
Ion Stoica
Haotong Zhang
111
28
0
10 Nov 2022
Contrastive Self-Supervised Learning for Skeleton Representations
Contrastive Self-Supervised Learning for Skeleton Representations
N. Lingg
Miguel Sarabia
Luca Zappella
B. Theobald
SSL
47
0
0
10 Nov 2022
Collateral facilitation in humans and language models
Collateral facilitation in humans and language models
J. Michaelov
Benjamin Bergen
110
11
0
09 Nov 2022
Grammatical Error Correction: A Survey of the State of the Art
Grammatical Error Correction: A Survey of the State of the Art
Christopher Bryant
Zheng Yuan
Muhammad Reza Qorib
Hannan Cao
Hwee Tou Ng
Ted Briscoe
3DV
87
87
0
09 Nov 2022
Large Language Models with Controllable Working Memory
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
135
171
0
09 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in
  Diffusion Models
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
157
312
0
09 Nov 2022
Efficiently Scaling Transformer Inference
Efficiently Scaling Transformer Inference
Reiner Pope
Sholto Douglas
Aakanksha Chowdhery
Jacob Devlin
James Bradbury
Anselm Levskaya
Jonathan Heek
Kefan Xiao
Shivani Agrawal
J. Dean
116
326
0
09 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
477
2,398
0
09 Nov 2022
What is Wrong with Language Models that Can Not Tell a Story?
What is Wrong with Language Models that Can Not Tell a Story?
Ivan P. Yamshchikov
Alexey Tikhonov
67
7
0
09 Nov 2022
Creative Writing with an AI-Powered Writing Assistant: Perspectives from
  Professional Writers
Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
Daphne Ippolito
Ann Yuan
Andy Coenen
Sehmon Burnam
103
101
0
09 Nov 2022
Detecting Languages Unintelligible to Multilingual Models through Local
  Structure Probes
Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes
Louis Clouâtre
Prasanna Parthasarathi
Payel Das
Sarath Chandar
75
3
0
09 Nov 2022
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker
Aldo Pacchiano
OffRLOnRL
103
41
0
09 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLMOffRLLRM
70
9
0
09 Nov 2022
Distribution-based Emotion Recognition in Conversation
Distribution-based Emotion Recognition in Conversation
Wen Wu
Chuxu Zhang
P. Woodland
84
4
0
09 Nov 2022
Few-Shot Character Understanding in Movies as an Assessment to
  Meta-Learning of Theory-of-Mind
Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind
Mo Yu
Qiujing Wang
Shunchi Zhang
Yisi Sang
Kangsheng Pu
...
Han Wang
Liyan Xu
Jing Li
Yue Yu
Jie Zhou
89
20
0
09 Nov 2022
Learning to Follow Instructions in Text-Based Games
Learning to Follow Instructions in Text-Based Games
Mathieu Tuli
Andrew C. Li
Pashootan Vaezipoor
Toryn Q. Klassen
Scott Sanner
Sheila A. McIlraith
79
13
0
08 Nov 2022
Detecting and Accommodating Novel Types and Concepts in an Embodied
  Simulation Environment
Detecting and Accommodating Novel Types and Concepts in an Embodied Simulation Environment
Sadaf Ghaffari
Nikhil Krishnaswamy
38
7
0
08 Nov 2022
QuantPipe: Applying Adaptive Post-Training Quantization for Distributed
  Transformer Pipelines in Dynamic Edge Environments
QuantPipe: Applying Adaptive Post-Training Quantization for Distributed Transformer Pipelines in Dynamic Edge Environments
Hong Wang
Connor Imes
Souvik Kundu
Peter A. Beerel
S. Crago
J. Walters
MQ
59
7
0
08 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILMLRM
114
206
0
08 Nov 2022
Self-conditioned Embedding Diffusion for Text Generation
Self-conditioned Embedding Diffusion for Text Generation
Robin Strudel
Corentin Tallec
Florent Altché
Yilun Du
Yaroslav Ganin
...
Will Grathwohl
Nikolay Savinov
Sander Dieleman
Laurent Sifre
Rémi Leblond
DiffM
89
88
0
08 Nov 2022
Conciseness: An Overlooked Language Task
Conciseness: An Overlooked Language Task
Felix Stahlberg
Aashish Kumar
Chris Alberti
Shankar Kumar
40
1
0
08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
Hao Peng
Xiaozhi Wang
Shengding Hu
Hailong Jin
Lei Hou
Juanzi Li
Zhiyuan Liu
Qun Liu
89
25
0
08 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRLOnRLAI4CE
81
23
0
08 Nov 2022
Looking at the Overlooked: An Analysis on the Word-Overlap Bias in
  Natural Language Inference
Looking at the Overlooked: An Analysis on the Word-Overlap Bias in Natural Language Inference
S. Rajaee
Yadollah Yaghoobzadeh
Mohammad Taher Pilehvar
73
5
0
07 Nov 2022
On minimal variations for unsupervised representation learning
On minimal variations for unsupervised representation learning
Vivien A. Cabannes
A. Bietti
Randall Balestriero
SSLDRL
92
8
0
07 Nov 2022
Generalizable Re-Identification from Videos with Cycle Association
Generalizable Re-Identification from Videos with Cycle Association
Zhongdao Wang
Zhaopeng Dou
Jingwei Zhang
Liang Zhen
Yifan Sun
Yali Li
Shengjin Wang
BDL
66
2
0
07 Nov 2022
From Denoising Diffusions to Denoising Markov Models
From Denoising Diffusions to Denoising Markov Models
Joe Benton
Yuyang Shi
Valentin De Bortoli
George Deligiannidis
Arnaud Doucet
DiffM
119
35
0
07 Nov 2022
TLP: A Deep Learning-based Cost Model for Tensor Program Tuning
TLP: A Deep Learning-based Cost Model for Tensor Program Tuning
Yiqiang Zhai
Yu Zhang
Shuo Liu
Xiaomeng Chu
Jie Peng
Jianmin Ji
Yanyong Zhang
47
33
0
07 Nov 2022
Knowledge Graph Embedding: A Survey from the Perspective of
  Representation Spaces
Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces
Jiahang Cao
Jinyuan Fang
Zaiqiao Meng
Shangsong Liang
104
75
0
07 Nov 2022
Generative Transformers for Design Concept Generation
Generative Transformers for Design Concept Generation
Qihao Zhu
Jianxi Luo
AI4CE
79
50
0
07 Nov 2022
Probing neural language models for understanding of words of estimative
  probability
Probing neural language models for understanding of words of estimative probability
Damien Sileo
Marie-Francine Moens
51
12
0
07 Nov 2022
Contrastive Learning with Prompt-derived Virtual Semantic Prototypes for
  Unsupervised Sentence Embedding
Contrastive Learning with Prompt-derived Virtual Semantic Prototypes for Unsupervised Sentence Embedding
Jiali Zeng
Yongjing Yin
Yu Jiang
Shuangzhi Wu
Yunbo Cao
SSL
68
13
0
07 Nov 2022
Fixing Model Bugs with Natural Language Patches
Fixing Model Bugs with Natural Language Patches
Shikhar Murty
Christopher D. Manning
Scott M. Lundberg
Marco Tulio Ribeiro
KELM
80
39
0
07 Nov 2022
Complex Reading Comprehension Through Question Decomposition
Complex Reading Comprehension Through Question Decomposition
Xiao-Yu Guo
Yuan-Fang Li
Gholamreza Haffari
ReLM
74
10
0
07 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
107
65
0
07 Nov 2022
On the Domain Adaptation and Generalization of Pretrained Language
  Models: A Survey
On the Domain Adaptation and Generalization of Pretrained Language Models: A Survey
Xu Guo
Han Yu
LM&MAVLM
145
30
0
06 Nov 2022
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement
  Learning
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning
D. Elbaz
Gal Novik
Oren Salzman
OffRL
148
0
0
06 Nov 2022
Previous
123...176177178...246247248
Next