ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,427 papers shown
Title
Towards using Few-Shot Prompt Learning for Automating Model Completion
Towards using Few-Shot Prompt Learning for Automating Model Completion
Meriem Ben Chaaben
Lola Burgueño
H. Sahraoui
VLMLRM
108
34
0
07 Dec 2022
Counterfactual reasoning: Do language models need world knowledge for
  causal understanding?
Counterfactual reasoning: Do language models need world knowledge for causal understanding?
Jiaxuan Li
Lang-Chi Yu
Allyson Ettinger
CMLLRM
43
2
0
06 Dec 2022
Visual Query Tuning: Towards Effective Usage of Intermediate
  Representations for Parameter and Memory Efficient Transfer Learning
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning
Cheng-Hao Tu
Zheda Mai
Wei-Lun Chao
57
48
0
06 Dec 2022
CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain
CySecBERT: A Domain-Adapted Language Model for the Cybersecurity Domain
Markus Bayer
Philip D. . Kuehn
Ramin Shanehsaz
Christian A. Reuter
64
49
0
06 Dec 2022
Towards human-compatible autonomous car: A study of non-verbal Turing
  test in automated driving with affective transition modelling
Towards human-compatible autonomous car: A study of non-verbal Turing test in automated driving with affective transition modelling
Zhaoning Li
Qiaoli Jiang
Zhengming Wu
Anqi Liu
Haiyan Wu
Miner Huang
Kai Huang
Y. Ku
64
2
0
06 Dec 2022
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context
  Tuning
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning
Praveen Venkateswaran
Evelyn Duesterwald
Vatche Isahagian
92
9
0
06 Dec 2022
Adaptive Testing of Computer Vision Models
Adaptive Testing of Computer Vision Models
Irena Gao
Gabriel Ilharco
Scott M. Lundberg
Marco Tulio Ribeiro
VLM
84
43
0
06 Dec 2022
Sources of Noise in Dialogue and How to Deal with Them
Sources of Noise in Dialogue and How to Deal with Them
Derek Chen
Zhou Yu
58
2
0
06 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
81
2
0
06 Dec 2022
Codex Hacks HackerRank: Memorization Issues and a Framework for Code
  Synthesis Evaluation
Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation
Anjan Karmakar
Julian Aron Prenner
Marco DÁmbros
Romain Robbes
ELM
73
17
0
06 Dec 2022
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
HungYueh Chiang
N. Frumkin
Feng Liang
Diana Marculescu
MQ
77
12
0
05 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual
  Learning
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLMMLLM
159
262
0
05 Dec 2022
Meta-Learning Fast Weight Language Models
Meta-Learning Fast Weight Language Models
Kevin Clark
Kelvin Guu
Ming-Wei Chang
Panupong Pasupat
Geoffrey E. Hinton
Mohammad Norouzi
KELM
80
14
0
05 Dec 2022
In-context Examples Selection for Machine Translation
In-context Examples Selection for Machine Translation
Sweta Agrawal
Chunting Zhou
M. Lewis
Luke Zettlemoyer
Marjan Ghazvininejad
LRM
120
198
0
05 Dec 2022
I2MVFormer: Large Language Model Generated Multi-View Document
  Supervision for Zero-Shot Image Classification
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification
Muhammad Ferjad Naeem
Muhammad Gul Zain Ali Khan
Yongqin Xian
Muhammad Zeshan Afzal
D. Stricker
Luc Van Gool
F. Tombari
VLM
81
56
0
05 Dec 2022
Improving Few-Shot Performance of Language Models via Nearest Neighbor
  Calibration
Improving Few-Shot Performance of Language Models via Nearest Neighbor Calibration
Feng Nie
Meixi Chen
Zhirui Zhang
Xuan Cheng
65
33
0
05 Dec 2022
Legal Prompt Engineering for Multilingual Legal Judgement Prediction
Legal Prompt Engineering for Multilingual Legal Judgement Prediction
Dietrich Trautmann
Alina Petrova
Frank Schilder
ELMAILaw
99
80
0
05 Dec 2022
Automatic Generation of Factual News Headlines in Finnish
Automatic Generation of Factual News Headlines in Finnish
Maximilian Koppatz
Khalid Alnajjar
Mika Hämäläinen
Thierry Poibeau
69
2
0
05 Dec 2022
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Ana Kotarcic
Dominik Hangartner
Fabrizio Gilardi
Selina Kurer
K. Donnay
60
3
0
05 Dec 2022
Breaking the Spurious Causality of Conditional Generation via Fairness
  Intervention with Corrective Sampling
Breaking the Spurious Causality of Conditional Generation via Fairness Intervention with Corrective Sampling
J. Nam
Sangwoo Mo
Jaeho Lee
Jinwoo Shin
96
7
0
05 Dec 2022
Cross-lingual Similarity of Multilingual Representations Revisited
Cross-lingual Similarity of Multilingual Representations Revisited
Maksym Del
Mark Fishel
74
3
0
04 Dec 2022
Understanding How Model Size Affects Few-shot Instruction Prompting
Understanding How Model Size Affects Few-shot Instruction Prompting
Ayrton San Joaquin
Ardy Haroen
39
0
0
04 Dec 2022
Persona-Based Conversational AI: State of the Art and Challenges
Persona-Based Conversational AI: State of the Art and Challenges
Junfeng Liu
Christopher T. Symons
R. Vatsavai
63
12
0
04 Dec 2022
Toward Efficient Language Model Pretraining and Downstream Adaptation
  via Self-Evolution: A Case Study on SuperGLUE
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
Qihuang Zhong
Liang Ding
Yibing Zhan
Yu Qiao
Yonggang Wen
...
Yixin Chen
Xinbo Gao
Steven C. H. Hoi
Xiaoou Tang
Dacheng Tao
VLMELM
124
35
0
04 Dec 2022
Constructing Highly Inductive Contexts for Dialogue Safety through
  Controllable Reverse Generation
Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation
Zhexin Zhang
Jiale Cheng
Hao Sun
Jiawen Deng
Fei Mi
Yasheng Wang
Lifeng Shang
Minlie Huang
SILM
148
9
0
04 Dec 2022
MiLMo:Minority Multilingual Pre-trained Language Model
MiLMo:Minority Multilingual Pre-trained Language Model
Sisi Liu
Hanru Shi
Xinhe Yu
Wugedele Bao
Yuan Sun
Xiaobing Zhao
81
0
0
04 Dec 2022
Precise Energy Consumption Measurements of Heterogeneous Artificial
  Intelligence Workloads
Precise Energy Consumption Measurements of Heterogeneous Artificial Intelligence Workloads
R. Caspart
Sebastian Ziegler
Arvid Weyrauch
Holger Obermaier
Simon Raffeiner
...
Marco Nolden
I. Reinartz
Hyunjin Park
Markus Goetz
Charlotte Debus
21
10
0
03 Dec 2022
Language Models as Agent Models
Language Models as Agent Models
Jacob Andreas
LLMAG
85
141
0
03 Dec 2022
A Survey on Medical Document Summarization
A Survey on Medical Document Summarization
Raghav Jain
Anubhav Jangra
S. Saha
Adam Jatowt
3DGSMedIm
82
19
0
03 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual
  Representation
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
57
13
0
03 Dec 2022
RHO ($ρ$): Reducing Hallucination in Open-domain Dialogues with
  Knowledge Grounding
RHO (ρρρ): Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding
Ziwei Ji
Zihan Liu
Nayeon Lee
Tiezheng Yu
Bryan Wilie
Mini Zeng
Pascale Fung
HILM
93
55
0
03 Dec 2022
Meta Learning for Few-Shot Medical Text Classification
Meta Learning for Few-Shot Medical Text Classification
Pankaj Sharma
Imran Qureshi
Minh Tran
OOD
41
0
0
03 Dec 2022
Exploring the Limits of Differentially Private Deep Learning with
  Group-wise Clipping
Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping
Jiyan He
Xuechen Li
Da Yu
Huishuai Zhang
Janardhan Kulkarni
Y. Lee
A. Backurs
Nenghai Yu
Jiang Bian
118
49
0
03 Dec 2022
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
122
70
0
02 Dec 2022
Compound Tokens: Channel Fusion for Vision-Language Representation
  Learning
Compound Tokens: Channel Fusion for Vision-Language Representation Learning
Maxwell Mbabilla Aladago
A. Piergiovanni
64
2
0
02 Dec 2022
An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
Hong Jun Jeon
Benjamin Van Roy
72
0
0
02 Dec 2022
Nonparametric Masked Language Modeling
Nonparametric Masked Language Modeling
Sewon Min
Weijia Shi
M. Lewis
Xilun Chen
Wen-tau Yih
Hannaneh Hajishirzi
Luke Zettlemoyer
RALM
160
50
0
02 Dec 2022
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Fang Yu
Lee Quartey
Frank Schilder
ELMLRM
54
69
0
02 Dec 2022
MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation
MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation
Lukas Hoyer
Dengxin Dai
Haoran Wang
Luc Van Gool
139
230
0
02 Dec 2022
SumREN: Summarizing Reported Speech about Events in News
SumREN: Summarizing Reported Speech about Events in News
R. Reddy
Heba Elfardy
Hou Pong Chan
Kevin Small
Chenhui Xu
62
5
0
02 Dec 2022
SoftCorrect: Error Correction with Soft Detection for Automatic Speech
  Recognition
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Yichong Leng
Xu Tan
Wenjie Liu
Kaitao Song
Rui Wang
Xiang-Yang Li
Tao Qin
Ed Lin
Tie-Yan Liu
114
16
0
02 Dec 2022
Programming Is Hard -- Or at Least It Used to Be: Educational
  Opportunities And Challenges of AI Code Generation
Programming Is Hard -- Or at Least It Used to Be: Educational Opportunities And Challenges of AI Code Generation
Brett A. Becker
Paul Denny
James Finnie-Ansley
Andrew Luxton-Reilly
James Prather
E. Santos
91
287
0
02 Dec 2022
a survey on GPT-3
a survey on GPT-3
M. Zong
Bhaskar Krishnamachari
100
35
0
01 Dec 2022
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual
  Grounding
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Ronghang Hu
Xinlei Chen
Matthias Nießner
Angel X. Chang
120
54
0
01 Dec 2022
Multi-Class Segmentation from Aerial Views using Recursive Noise
  Diffusion
Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion
Benedikt Kolbeinsson
K. Mikolajczyk
DiffM
81
13
0
01 Dec 2022
Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis
Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis
Odysseas S. Chlapanis
Georgios Paraskevopoulos
Alexandros Potamianos
87
9
0
01 Dec 2022
Extensible Prompts for Language Models on Zero-shot Language Style
  Customization
Extensible Prompts for Language Models on Zero-shot Language Style Customization
Tao Ge
Jing Hu
Li Dong
Shaoguang Mao
Yanqiu Xia
Xun Wang
Si-Qing Chen
Furu Wei
VLM
87
7
0
01 Dec 2022
Convolution, aggregation and attention based deep neural networks for
  accelerating simulations in mechanics
Convolution, aggregation and attention based deep neural networks for accelerating simulations in mechanics
Saurabh Deshpande
Raúl I. Sosa
Stéphane P. A. Bordas
J. Lengiewicz
AI4CE
77
20
0
01 Dec 2022
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
Hamish Ivison
Noah A. Smith
Hannaneh Hajishirzi
Pradeep Dasigi
122
23
0
01 Dec 2022
CREPE: Open-Domain Question Answering with False Presuppositions
CREPE: Open-Domain Question Answering with False Presuppositions
Xinyan Velocity Yu
Sewon Min
Luke Zettlemoyer
Hannaneh Hajishirzi
105
54
0
30 Nov 2022
Previous
123...173174175...247248249
Next