ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,362 papers shown
Title
Linearly Mapping from Image to Text Space
Linearly Mapping from Image to Text Space
Jack Merullo
Louis Castricato
Carsten Eickhoff
Ellie Pavlick
VLM
248
117
0
30 Sep 2022
PART: Pre-trained Authorship Representation Transformer
PART: Pre-trained Authorship Representation Transformer
Javier Huertas-Tato
Álvaro Huertas-García
Alejandro Martín
135
9
0
30 Sep 2022
Improving Molecular Pretraining with Complementary Featurizations
Improving Molecular Pretraining with Complementary Featurizations
Yanqiao Zhu
Dingshuo Chen
Yuanqi Du
Yingze Wang
Qiang Liu
Shu Wu
AI4CE
68
7
0
29 Sep 2022
Unpacking Large Language Models with Conceptual Consistency
Unpacking Large Language Models with Conceptual Consistency
Pritish Sahu
Michael Cogswell
Yunye Gong
Ajay Divakaran
LRM
116
17
0
29 Sep 2022
Toward Trustworthy Neural Program Synthesis
Toward Trustworthy Neural Program Synthesis
Darren Key
Wen-Ding Li
Kevin Ellis
NAI
171
6
0
29 Sep 2022
Scaling Laws for a Multi-Agent Reinforcement Learning Model
Scaling Laws for a Multi-Agent Reinforcement Learning Model
Oren Neumann
C. Gros
92
27
0
29 Sep 2022
Compositional Semantic Parsing with Large Language Models
Compositional Semantic Parsing with Large Language Models
Andrew Drozdov
Nathanael Scharli
Ekin Akyuurek
Nathan Scales
Xinying Song
Xinyun Chen
Olivier Bousquet
Denny Zhou
ReLMLRM
269
93
0
29 Sep 2022
Co-Writing Screenplays and Theatre Scripts with Language Models: An
  Evaluation by Industry Professionals
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals
Piotr Wojciech Mirowski
Kory W. Mathewson
Jaylen Pittman
Richard Evans
HAI
115
267
0
29 Sep 2022
Does Zero-Shot Reinforcement Learning Exist?
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
116
46
0
29 Sep 2022
Repairing Bugs in Python Assignments Using Large Language Models
Repairing Bugs in Python Assignments Using Large Language Models
Jialu Zhang
J. Cambronero
Sumit Gulwani
Vu Le
R. Piskac
Gustavo Soares
Gust Verbruggen
KELM
77
56
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffMVGen
97
1,439
0
29 Sep 2022
A Multiagent Framework for the Asynchronous and Collaborative Extension
  of Multitask ML Systems
A Multiagent Framework for the Asynchronous and Collaborative Extension of Multitask ML Systems
Andrea Gesmundo
104
2
0
29 Sep 2022
Is Complexity Required for Neural Network Pruning? A Case Study on
  Global Magnitude Pruning
Is Complexity Required for Neural Network Pruning? A Case Study on Global Magnitude Pruning
Manas Gupta
Efe Camci
Vishandi Rudy Keneta
Abhishek Vaidyanathan
Ritwik Kanodia
Chuan-Sheng Foo
Wu Min
Lin Jie
71
14
0
29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
ReLMLRM
207
300
0
29 Sep 2022
Dataset Distillation Using Parameter Pruning
Dataset Distillation Using Parameter Pruning
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
DD
142
22
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
140
47
0
29 Sep 2022
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via
  Generative Modeling
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling
Yusong Wu
Josh Gardner
Ethan Manilow
Ian Simon
Curtis Hawthorne
Jesse Engel
91
10
0
28 Sep 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
108
33
0
28 Sep 2022
Clinical Language Understanding Evaluation (CLUE)
Clinical Language Understanding Evaluation (CLUE)
Travis R. Goodwin
Dina Demner-Fushman
ELMLM&MA
28
1
0
28 Sep 2022
Who is GPT-3? An Exploration of Personality, Values and Demographics
Who is GPT-3? An Exploration of Personality, Values and Demographics
Marilù Miotto
Nicola Rossberg
Bennett Kleinberg
ELMPILM
83
116
0
28 Sep 2022
Causal Proxy Models for Concept-Based Model Explanations
Causal Proxy Models for Concept-Based Model Explanations
Zhengxuan Wu
Karel DÓosterlinck
Atticus Geiger
Amir Zur
Christopher Potts
MILM
132
37
0
28 Sep 2022
Prompt-driven efficient Open-set Semi-supervised Learning
Prompt-driven efficient Open-set Semi-supervised Learning
Haoran Li
Chun-Mei Feng
Tao Zhou
Yong Xu
Xiaojun Chang
126
4
0
28 Sep 2022
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual
  Tasks
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Zhiyang Chen
Yousong Zhu
Zhaowen Li
Fan Yang
Wei Li
...
Chaoyang Zhao
Liwei Wu
Rui Zhao
Jinqiao Wang
Ming Tang
VLMVOS
124
16
0
28 Sep 2022
YATO: Yet Another deep learning based Text analysis Open toolkit
YATO: Yet Another deep learning based Text analysis Open toolkit
Zeqiang Wang
Yile Wang
Jiageng Wu
Zhiyang Teng
Jie Yang
100
3
0
28 Sep 2022
ButterflyFlow: Building Invertible Layers with Butterfly Matrices
ButterflyFlow: Building Invertible Layers with Butterfly Matrices
Chenlin Meng
Linqi Zhou
Kristy Choi
Tri Dao
Stefano Ermon
TPM
163
12
0
28 Sep 2022
PROD: Progressive Distillation for Dense Retrieval
PROD: Progressive Distillation for Dense Retrieval
Zhenghao Lin
Yeyun Gong
Xiao Liu
Hang Zhang
Chen Lin
...
Jian Jiao
Jing Lu
Daxin Jiang
Rangan Majumder
Nan Duan
129
27
0
27 Sep 2022
EditEval: An Instruction-Based Benchmark for Text Improvements
EditEval: An Instruction-Based Benchmark for Text Improvements
Jane Dwivedi-Yu
Timo Schick
Zhengbao Jiang
Maria Lomeli
Patrick Lewis
Gautier Izacard
Edouard Grave
Sebastian Riedel
Fabio Petroni
104
28
0
27 Sep 2022
Deep Generative Multimedia Children's Literature
Deep Generative Multimedia Children's Literature
Matthew Lyle Olson
35
0
0
27 Sep 2022
Improving Radiology Report Generation Systems by Removing Hallucinated
  References to Non-existent Priors
Improving Radiology Report Generation Systems by Removing Hallucinated References to Non-existent Priors
Vignav Ramesh
Nathan Chi
Pranav Rajpurkar
MedIm
93
50
0
27 Sep 2022
Local Grammar-Based Coding Revisited
Local Grammar-Based Coding Revisited
L. Debowski
74
0
0
27 Sep 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization
  Perspective
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
52
1
0
26 Sep 2022
Lex2Sent: A bagging approach to unsupervised sentiment analysis
Lex2Sent: A bagging approach to unsupervised sentiment analysis
Kai-Robin Lange
Jonas Rieger
Carsten Jentsch
SSL
32
2
0
26 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
156
69
0
26 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier
  Layers
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers
Nurullah Sevim
Ege Ozan Özyedek
Furkan Şahinuç
Aykut Koç
85
12
0
26 Sep 2022
Do ever larger octopi still amplify reporting biases? Evidence from
  judgments of typical colour
Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour
Fangyu Liu
Julian Martin Eisenschlos
Jeremy R. Cole
Nigel Collier
90
4
0
26 Sep 2022
Can Large Language Models Truly Understand Prompts? A Case Study with
  Negated Prompts
Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts
Joel Jang
Seonghyeon Ye
Minjoon Seo
ELMLRM
162
64
0
26 Sep 2022
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs
Ðorðe Miladinovic
Kumar Shridhar
Kushal Kumar Jain
Max B. Paulus
J. M. Buhmann
Mrinmaya Sachan
Carl Allen
DRL
97
5
0
26 Sep 2022
Towards Parameter-Efficient Integration of Pre-Trained Language Models
  In Temporal Video Grounding
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding
Erica K. Shimomoto
Edison Marrese-Taylor
Hiroya Takamura
Ichiro Kobayashi
Hideki Nakayama
Yusuke Miyao
88
7
0
26 Sep 2022
Entailment Semantics Can Be Extracted from an Ideal Language Model
Entailment Semantics Can Be Extracted from an Ideal Language Model
William Merrill
Alex Warstadt
Tal Linzen
179
14
0
26 Sep 2022
Knowledge Representation for Conceptual, Motivational, and Affective
  Processes in Natural Language Communication
Knowledge Representation for Conceptual, Motivational, and Affective Processes in Natural Language Communication
Seng-Beng Ho
Zhaoxia Wang
B. Quek
Min Zhang
36
3
0
26 Sep 2022
News Summarization and Evaluation in the Era of GPT-3
News Summarization and Evaluation in the Era of GPT-3
Tanya Goyal
Junyi Jessy Li
Greg Durrett
ELM
124
411
0
26 Sep 2022
Re-contextualizing Fairness in NLP: The Case of India
Re-contextualizing Fairness in NLP: The Case of India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
101
61
0
25 Sep 2022
WinoDict: Probing language models for in-context word acquisition
WinoDict: Probing language models for in-context word acquisition
Julian Martin Eisenschlos
Jeremy R. Cole
Fangyu Liu
William W. Cohen
KELM
58
13
0
25 Sep 2022
Moral Mimicry: Large Language Models Produce Moral Rationalizations
  Tailored to Political Identity
Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity
Gabriel Simmons
181
67
0
24 Sep 2022
Interventional Causal Representation Learning
Interventional Causal Representation Learning
Kartik Ahuja
Divyat Mahajan
Yixin Wang
Yoshua Bengio
CML
156
95
0
24 Sep 2022
Whodunit? Learning to Contrast for Authorship Attribution
Whodunit? Learning to Contrast for Authorship Attribution
Bo Ai
Yuchen Wang
Yugin Tan
Samson Tan
SSL
164
19
0
23 Sep 2022
Multiple-Choice Question Generation: Towards an Automated Assessment
  Framework
Multiple-Choice Question Generation: Towards an Automated Assessment Framework
Vatsal Raina
Mark Gales
AI4EdELM
75
34
0
23 Sep 2022
Promptagator: Few-shot Dense Retrieval From 8 Examples
Promptagator: Few-shot Dense Retrieval From 8 Examples
Zhuyun Dai
Vincent Zhao
Ji Ma
Yi Luan
Jianmo Ni
Jing Lu
A. Bakalov
Kelvin Guu
Keith B. Hall
Ming-Wei Chang
RALM
104
242
0
23 Sep 2022
Visual representations in the human brain are aligned with large
  language models
Visual representations in the human brain are aligned with large language models
Adrien Doerig
Tim C Kietzmann
Emily J. Allen
Yihan Wu
Thomas Naselaris
Kendrick Norris Kay
I. Charest
95
24
0
23 Sep 2022
Best Prompts for Text-to-Image Models and How to Find Them
Best Prompts for Text-to-Image Models and How to Find Them
Nikita Pavlichenko
Dmitry Ustalov
DiffM
85
63
0
23 Sep 2022
Previous
123...185186187...246247248
Next