Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,288 papers shown
Title
Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study
Yujun Zhou
Jiayi Ye
Zipeng Ling
Yufei Han
Yue Huang
...
Zhenwen Liang
Kehan Guo
Taicheng Guo
Xiangqi Wang
Xiangliang Zhang
ReLM
LRM
118
1
0
05 Jun 2025
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers
Yutao Hou
Zeguan Xiao
Fei Yu
Yihan Jiang
Xuetao Wei
Hailiang Huang
Yun-Nung Chen
Guanhua Chen
LRM
99
0
0
05 Jun 2025
Stable Vision Concept Transformers for Medical Diagnosis
Lijie Hu
Songning Lai
Yuan Hua
Shu Yang
Jingfeng Zhang
Di Wang
MedIm
96
0
0
05 Jun 2025
Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques
Jisu An
Junseok Lee
Jeoungeun Lee
Yongseok Son
149
0
0
05 Jun 2025
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Seongmin Lee
Aeree Cho
Grace C. Kim
ShengYun Peng
Mansi Phute
Duen Horng Chau
LM&MA
AI4CE
63
0
0
05 Jun 2025
Gen-n-Val: Agentic Image Data Generation and Validation
Jing-En Huang
I-Sheng Fang
Tzuhsuan Huang
Chih-Yu Wang
Jun-Cheng Chen
VLM
112
0
0
05 Jun 2025
RewardAnything: Generalizable Principle-Following Reward Models
Zhuohao Yu
Jiali Zeng
Weizheng Gu
Yidong Wang
Jindong Wang
Fandong Meng
Jie Zhou
Yue Zhang
Shikun Zhang
Wei Ye
LRM
100
1
0
04 Jun 2025
Relational reasoning and inductive bias in transformers trained on a transitive inference task
J. Geerts
Stephanie Chan
Claudia Clopath
Kimberly L. Stachenfeld
LRM
31
0
0
04 Jun 2025
Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models
Seungcheol Park
Jeongin Bae
Beomseok Kwon
Minjun Kim
Byeongwook Kim
S. Kwon
U. Kang
Dongsoo Lee
MQ
129
0
0
04 Jun 2025
A Statistical Physics of Language Model Reasoning
Jack David Carson
Amir Reisizadeh
LRM
AI4CE
78
0
0
04 Jun 2025
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
Zhiyu Zhang
Wei Chen
Youfang Lin
Huaiyu Wan
OffRL
CLL
111
0
0
04 Jun 2025
Multimodal Tabular Reasoning with Privileged Structured Information
Jun-Peng Jiang
Yu Xia
Hai-Long Sun
Shiyin Lu
Qing-Guo Chen
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
LMTD
LRM
89
0
0
04 Jun 2025
Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science
Peter Alexander Jansen
Samiah Hassan
Ruoyao Wang
37
0
0
04 Jun 2025
Schema Generation for Large Knowledge Graphs Using Large Language Models
Bohui Zhang
Yuan He
Lydia Pintscher
Albert Meroño-Peñuela
Elena Simperl
46
0
0
04 Jun 2025
Delta-KNN: Improving Demonstration Selection in In-Context Learning for Alzheimer's Disease Detection
Chuyuan Li
Raymond Li
Thalia S. Field
Giuseppe Carenini
120
0
0
04 Jun 2025
MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLP
Kurt Micallef
Claudia Borg
20
0
0
04 Jun 2025
Does Prompt Design Impact Quality of Data Imputation by LLMs?
Shreenidhi Srinivasan
Lydia Manikonda
SyDa
103
0
0
04 Jun 2025
Backbone Augmented Training for Adaptations
Jae Wan Park
Junhyeok Kim
Youngjun Jun
Hyunah Ko
Seong Jae Hwang
22
0
0
04 Jun 2025
Through the Stealth Lens: Rethinking Attacks and Defenses in RAG
Sarthak Choudhary
Nils Palumbo
Ashish Hooda
Krishnamurthy Dvijotham
Somesh Jha
45
0
0
04 Jun 2025
Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems
Sven Kirchner
Alois Knoll
47
0
0
04 Jun 2025
Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints
Utkarsh Utkarsh
Pengfei Cai
Alan Edelman
Rafael Gomez-Bombarelli
Christopher Rackauckas
AI4CE
76
0
0
04 Jun 2025
Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation
Mingxuan Xia
Haobo Wang
Yixuan Li
Zewei Yu
Jindong Wang
Junbo Zhao
Runze Wu
86
1
0
04 Jun 2025
Algorithms for estimating linear function in data mining
Thomas Hoang
15
0
0
04 Jun 2025
KG-BiLM: Knowledge Graph Embedding via Bidirectional Language Models
Zirui Chen
Xin Eric Wang
Zhao Li
Wenbin Guo
Dongxiao He
96
0
0
04 Jun 2025
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Kejian Zhu
Shangqing Tu
Zhuoran Jin
Lei Hou
Juanzi Li
Jun Zhao
KELM
76
0
0
04 Jun 2025
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information
Seungcheol Park
Sojin Lee
Jongjin Kim
Jinsik Lee
Hyunjik Jo
U. Kang
73
2
0
04 Jun 2025
The Latent Space Hypothesis: Toward Universal Medical Representation Learning
Salil Patel
181
0
0
04 Jun 2025
Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models
Fangrui Zhu
Hanhui Wang
Yiming Xie
Jing Gu
Tianye Ding
Jianwei Yang
Huaizu Jiang
3DV
LRM
97
0
0
04 Jun 2025
Explainability-Based Token Replacement on LLM-Generated Text
Hadi Mohammadi
Anastasia Giachanou
Daniel L. Oberski
Ayoub Bagheri
DeLMO
85
0
0
04 Jun 2025
ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch
Jiawei Chen
Xinyan Guan
Qianhao Yuan
Guozhao Mo
Weixiang Zhou
Yaojie Lu
Hongyu Lin
Ben He
Le Sun
Xianpei Han
ALM
LRM
76
0
0
04 Jun 2025
Learning to Insert [PAUSE] Tokens for Better Reasoning
Eunki Kim
Sangryul Kim
James Thorne
LRM
41
0
0
04 Jun 2025
From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context
Yuanchen Bai
Zijian Ding
Angelique Taylor
64
0
0
04 Jun 2025
Structured Pruning for Diverse Best-of-N Reasoning Optimization
Hieu Trung Nguyen
Bao Nguyen
Viet Anh Nguyen
LRM
62
0
0
04 Jun 2025
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
Hao Yu
Tangyu Jiang
Shuning Jia
Shannan Yan
Shunning Liu
Haolong Qian
Guanghao Li
Shuting Dong
Huaisong Zhang
Chun Yuan
96
0
0
04 Jun 2025
Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks
Lin Mu
Guowei Chu
Li Ni
Lei Sang
Zhize Wu
Peiquan Jin
Yiwen Zhang
83
0
0
04 Jun 2025
Zero-Shot Open-Schema Entity Structure Discovery
Xueqiang Xu
Jinfeng Xiao
James Barry
Mohab Elkaref
Jiaru Zou
Pengcheng Jiang
Yunyi Zhang
Max Giammona
Geeth de Mel
Jiawei Han
29
0
0
04 Jun 2025
Attention-Only Transformers via Unrolled Subspace Denoising
Peng Wang
Yifu Lu
Yaodong Yu
Druv Pai
Qing Qu
Yi Ma
ViT
116
1
0
04 Jun 2025
SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling
Anhao Zhao
Fanghua Ye
Yingqi Fan
Junlong Tong
Zhiwei Fei
Hui Su
Xiaoyu Shen
66
0
0
04 Jun 2025
TokAlign: Efficient Vocabulary Adaptation via Token Alignment
Chong Li
Jiajun Zhang
Chengqing Zong
VLM
53
0
0
04 Jun 2025
R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
Qingfei Zhao
Ruobing Wang
Dingling Xu
Daren Zha
Limin Liu
AI4TS
KELM
LRM
70
0
0
04 Jun 2025
QQSUM: A Novel Task and Model of Quantitative Query-Focused Summarization for Review-based Product Question Answering
A. Tang
Xiuzhen Zhang
M. Dinh
Zhuang Li
RALM
55
0
0
04 Jun 2025
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
Beatrix M. G. Nielsen
Emanuele Marconato
Andrea Dittadi
Luigi Gresele
49
0
0
04 Jun 2025
EALG: Evolutionary Adversarial Generation of Language Model-Guided Generators for Combinatorial Optimization
Ruibo Duan
Yuxin Liu
Xinyao Dong
Chenglin Fan
47
0
0
03 Jun 2025
Sign Language: Towards Sign Understanding for Robot Autonomy
Ayush Agrawal
Joel Loo
Nicky Zimmerman
David Hsu
SLR
70
0
0
03 Jun 2025
The Future of Continual Learning in the Era of Foundation Models: Three Key Directions
Jack Bell
Luigi Quarantiello
Eric Nuertey Coleman
Lanpei Li
Malio Li
Mauro Madeddu
Elia Piccoli
Vincenzo Lomonaco
KELM
23
0
0
03 Jun 2025
LLMs Can Also Do Well! Breaking Barriers in Semantic Role Labeling via Large Language Models
Xinxin Li
H. Chen
Chengjun Liu
Jing Li
Meishan Zhang
Jun-chen Yu
Min Zhang
15
0
0
03 Jun 2025
Adaptive Task Vectors for Large Language Models
Joonseong Kang
Soojeong Lee
Subeen Park
Sumin Park
Taero Kim
Jihee Kim
Ryunyi Lee
Kyungwoo Song
27
0
0
03 Jun 2025
Zero-Shot Time Series Forecasting with Covariates via In-Context Learning
Andreas Auer
Raghul Parthipan
Pedro Mercado
Abdul Fatir Ansari
Lorenzo Stella
Bernie Wang
Michael Bohlke-Schneider
Syama Sundar Rangapuram
AI4TS
60
0
0
03 Jun 2025
Design of Trimmed Helicoid Soft-Rigid Hybrid Robots
Zach J. Patterson
Emily R. Sologuren
Daniela Rus
30
0
0
03 Jun 2025
Asymptotics of SGD in Sequence-Single Index Models and Single-Layer Attention Networks
Luca Arnaboldi
Bruno Loureiro
Ludovic Stephan
Florent Krzakala
Lenka Zdeborová
53
0
0
03 Jun 2025
Previous
1
2
3
...
6
7
8
...
244
245
246
Next