Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 1,609 papers shown
Title
MedPix 2.0: A Comprehensive Multimodal Biomedical Data set for Advanced AI Applications
Irene Siragusa
Salvatore Contino
Massimo La Ciura
Rosario Alicata
Roberto Pirrone
169
3
0
03 Jul 2024
GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models
Zike Yuan
Ming Liu
Hui Wang
Bing Qin
LRM
ELM
102
2
0
03 Jul 2024
Single Image Rolling Shutter Removal with Diffusion Models
Zhanglei Yang
Haipeng Li
Mingbo Hong
Chen-Lin Zhang
Shuaicheng Liu
Shuaicheng Liu
DiffM
65
4
0
03 Jul 2024
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific Languages
Mehant Kammakomati
Sameer Pimparkhede
Srikanth G. Tamilselvam
Praveen Venkateswaran
Pushpak Bhattacharyya
ALM
107
0
0
03 Jul 2024
AdaCQR: Enhancing Query Reformulation for Conversational Search via Sparse and Dense Retrieval Alignment
Yilong Lai
Jialong Wu
Congzhi Zhang
Haowen Sun
Deyu Zhou
93
4
0
02 Jul 2024
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models
Daking Rai
Yilun Zhou
Shi Feng
Abulhair Saparov
Ziyu Yao
160
33
0
02 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang
Peng Wang
Tong Zhou
Yushun Dong
Zhen Tan
Jundong Li
CoGe
114
9
0
02 Jul 2024
RegMix: Data Mixture as Regression for Language Model Pre-training
Qian Liu
Xiaosen Zheng
Niklas Muennighoff
Guangtao Zeng
Longxu Dou
Tianyu Pang
Jing Jiang
Min Lin
MoE
156
52
1
01 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark Gales
Kate Knill
124
3
0
01 Jul 2024
Ground Every Sentence: Improving Retrieval-Augmented LLMs with Interleaved Reference-Claim Generation
Sirui Xia
Xintao Wang
Jiaqing Liang
Yifei Zhang
Weikang Zhou
Jiaji Deng
Fei Yu
Yanghua Xiao
RALM
138
8
0
01 Jul 2024
Revisiting Random Walks for Learning on Graphs
Jinwoo Kim
Olga Zaghen
Ayhan Suleymanzade
Youngmin Ryou
Seunghoon Hong
130
1
0
01 Jul 2024
From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning
Nan Xu
Fei Wang
Sheng Zhang
Hoifung Poon
Muhao Chen
110
7
0
01 Jul 2024
Expressivity of Neural Networks with Random Weights and Learned Biases
Ezekiel Williams
Avery Hee-Woon Ryoo
Thomas Jiralerspong
Alexandre Payeur
M. Perich
Luca Mazzucato
Guillaume Lajoie
105
2
0
01 Jul 2024
Large Language Models are Zero-Shot Recognizers for Activities of Daily Living
Gabriele Civitarese
Michele Fiori
Priyankar Choudhary
Claudio Bettini
65
5
0
01 Jul 2024
Eliminating Position Bias of Language Models: A Mechanistic Approach
Ziqi Wang
Hanlin Zhang
Xiner Li
Kuan-Hao Huang
Chi Han
Shuiwang Ji
Sham Kakade
Hao Peng
Heng Ji
123
19
0
01 Jul 2024
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Nikhil Abhyankar
Vivek Gupta
Dan Roth
Chandan K. Reddy
LMTD
95
4
0
29 Jun 2024
Understanding and Mitigating Language Confusion in LLMs
Kelly Marchisio
Wei-Yin Ko
Alexandre Berard
Théo Dehaze
Sebastian Ruder
122
32
0
28 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
113
3
0
28 Jun 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
167
0
0
27 Jun 2024
Resolving Discrepancies in Compute-Optimal Scaling of Language Models
Tomer Porian
Mitchell Wortsman
J. Jitsev
Ludwig Schmidt
Y. Carmon
121
26
0
27 Jun 2024
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation
Sanggeon Yun
Ryozo Masukawa
Minhyoung Na
Mohsen Imani
99
8
0
27 Jun 2024
RouteLLM: Learning to Route LLMs with Preference Data
Isaac Ong
Amjad Almahairi
Vincent Wu
Wei-Lin Chiang
Tianhao Wu
Joseph E. Gonzalez
M. W. Kadous
Ion Stoica
119
102
0
26 Jun 2024
LABOR-LLM: Language-Based Occupational Representations with Large Language Models
Tianyu Du
Ayush Kanodia
Herman Brunborg
Keyon Vafa
Susan Athey
62
4
0
25 Jun 2024
Entropy-Based Decoding for Retrieval-Augmented Large Language Models
Zexuan Qiu
Zijing Ou
Bin Wu
Jingjing Li
Aiwei Liu
Irwin King
KELM
RALM
113
6
0
25 Jun 2024
NormTab: Improving Symbolic Reasoning in LLMs Through Tabular Data Normalization
Md Mahadi Hasan Nahid
Davood Rafiei
LMTD
100
5
0
25 Jun 2024
GMT: Guided Mask Transformer for Leaf Instance Segmentation
Feng Chen
Sotirios A. Tsaftaris
M. Giuffrida
66
1
0
24 Jun 2024
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging
Deyuan Liu
Zhan Qin
Han Wang
Zhao Yang
Zecheng Wang
...
Zhao Lv
Zhiying Tu
Dianhui Chu
Bo Li
Dianbo Sui
86
2
0
24 Jun 2024
Large Vocabulary Size Improves Large Language Models
Sho Takase
Ryokan Ri
Shun Kiyono
Takuya Kato
106
4
0
24 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
164
1
0
24 Jun 2024
Finding Transformer Circuits with Edge Pruning
Adithya Bhaskar
Alexander Wettig
Dan Friedman
Danqi Chen
185
20
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
142
38
0
24 Jun 2024
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Roy Xie
Junlin Wang
Ruomin Huang
Minxing Zhang
Rong Ge
Jian Pei
Neil Zhenqiang Gong
Bhuwan Dhingra
MIALM
118
17
0
23 Jun 2024
Unveiling LLM Mechanisms Through Neural ODEs and Control Theory
Yukun Zhang
Qi Dong
84
0
0
23 Jun 2024
Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Orevaoghene Ahia
Shuyue Stella Li
Vidhisha Balachandran
Sunayana Sitaram
Yulia Tsvetkov
128
7
0
22 Jun 2024
RuleR: Improving LLM Controllability by Rule-based Data Recycling
Ming Li
Han Chen
Chenguang Wang
Dang Nguyen
Dianqi Li
Dinesh Manocha
114
10
0
22 Jun 2024
ToVo: Toxicity Taxonomy via Voting
Tinh Son Luong
Thanh-Thien Le
Thang Viet Doan
Linh Ngo Van
Thien Huu Nguyen
Diep Thi-Ngoc Nguyen
70
0
0
21 Jun 2024
GOAL: A Generalist Combinatorial Optimization Agent Learner
Darko Drakulic
Sofia Michel
J. Andreoli
74
10
0
21 Jun 2024
An Investigation of Prompt Variations for Zero-shot LLM-based Rankers
Shuoqi Sun
Shengyao Zhuang
Shuai Wang
Guido Zuccon
102
9
0
20 Jun 2024
CodeRAG-Bench: Can Retrieval Augment Code Generation?
Zora Z. Wang
Akari Asai
Xinyan Velocity Yu
Frank F. Xu
Yiqing Xie
Graham Neubig
Daniel Fried
RALM
184
41
0
20 Jun 2024
APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking
Can Jin
Hongwu Peng
Shiyu Zhao
Zhenting Wang
Wujiang Xu
Ligong Han
Jiahui Zhao
Kai Zhong
Sanguthevar Rajasekaran
Dimitris N. Metaxas
KELM
95
33
0
20 Jun 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal
Tinghao Xie
Xiangyu Qi
Yi Zeng
Yangsibo Huang
Udari Madhushani Sehwag
...
Bo Li
Kai Li
Danqi Chen
Peter Henderson
Prateek Mittal
ALM
ELM
154
75
0
20 Jun 2024
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing
Han Jiang
Xiaoyuan Yi
Zhihua Wei
Ziang Xiao
Shu Wang
Xing Xie
ELM
ALM
132
8
0
20 Jun 2024
Encoder vs Decoder: Comparative Analysis of Encoder and Decoder Language Models on Multilingual NLU Tasks
Dan S. Nielsen
Kenneth Enevoldsen
Peter Schneider-Kamp
ELM
100
9
0
19 Jun 2024
Neuro-symbolic Training for Reasoning over Spatial Language
Tanawan Premsri
Parisa Kordjamshidi
LRM
NAI
64
6
0
19 Jun 2024
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Aman Singh Thakur
Kartik Choudhary
Venkat Srinik Ramayapally
Sankaran Vaidyanathan
Dieuwke Hupkes
ELM
ALM
139
65
0
18 Jun 2024
What Did I Do Wrong? Quantifying LLMs' Sensitivity and Consistency to Prompt Engineering
Federico Errica
G. Siracusano
D. Sanvito
Roberto Bifulco
151
25
0
18 Jun 2024
Generative Artificial Intelligence-Guided User Studies: An Application for Air Taxi Services
Shengdi Xiao
Jingjing Li
Tatsuki Fushimi
Yoichi Ochiai
70
0
0
18 Jun 2024
VoCo-LLaMA: Towards Vision Compression with Large Language Models
Xubing Ye
Yukang Gan
Xiaoke Huang
Yixiao Ge
Yansong Tang
MLLM
VLM
90
28
0
18 Jun 2024
From Instance Training to Instruction Learning: Task Adapters Generation from Instructions
Huanxuan Liao
Yao Xu
Shizhu He
Yuanzhe Zhang
Yanchao Hao
Shengping Liu
Kang Liu
Jun Zhao
126
1
0
18 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
Yujin Baek
Taehee Kim
Jaegul Choo
66
2
0
18 Jun 2024
Previous
1
2
3
...
22
23
24
...
31
32
33
Next