Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.14165
Cited By
v1
v2
v3
v4 (latest)
Language Models are Few-Shot Learners
28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Language Models are Few-Shot Learners"
50 / 12,277 papers shown
Title
Towards Multi-modal Graph Large Language Model
Xin Wang
Zeyang Zhang
Linxin Xiao
Haibo Chen
Chendi Ge
Wenwu Zhu
48
0
0
11 Jun 2025
A quantum semantic framework for natural language processing
Christopher J. Agostino
Quan Le Thien
Molly Apsel
Denizhan Pak
Elina Lesyk
Ashabari Majumdar
55
0
0
11 Jun 2025
Enhancing Traffic Accident Classifications: Application of NLP Methods for City Safety
Enes Özeren
Alexander Ulbrich
Sascha Filimon
David Rügamer
Andreas Bender
5
0
0
11 Jun 2025
Outside Knowledge Conversational Video (OKCV) Dataset -- Dialoguing over Videos
Benjamin Z. Reichman
Constantin Patsch
Jack Truxal
Atishay Jain
Larry Heck
35
0
0
11 Jun 2025
Is Fine-Tuning an Effective Solution? Reassessing Knowledge Editing for Unstructured Data
Hao Xiong
Chuanyuan Tan
Wenliang Chen
KELM
47
0
0
11 Jun 2025
Team Anotheroption at SemEval-2025 Task 8: Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA
Nikolas Evkarpidi
Elena Tutubalina
LMTD
79
0
0
11 Jun 2025
SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
Xiangchen Li
Dimitrios Spatharakis
Saeid Ghafouri
Jiakun Fan
Dimitrios Nikolopoulos
Deepu John
Bo Ji
Dimitrios S. Nikolopoulos
43
0
0
11 Jun 2025
Latent Multi-Head Attention for Small Language Models
Sushant Mehta
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
RALM
39
0
0
11 Jun 2025
Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives
Wei Zeng
Hengshu Zhu
Chuan Qin
Han Wu
Yihang Cheng
...
Xiaowei Jin
Yinuo Shen
Zhenxing Wang
Feimin Zhong
Hui Xiong
AI4TS
65
0
0
11 Jun 2025
From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
Irving Fang
Juexiao Zhang
Shengbang Tong
Chen Feng
LM&Ro
48
1
0
11 Jun 2025
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
Xiyao Wang
Zhengyuan Yang
Chao Feng
Yongyuan Liang
Yuhang Zhou
...
Chung-Ching Lin
Kevin Lin
Linjie Li
Furong Huang
L. xilinx Wang
OffRL
LRM
52
0
0
11 Jun 2025
Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models
Haoyi Song
Ruihan Ji
Naichen Shi
Fan Lai
Raed Al Kontar
75
0
0
11 Jun 2025
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
C. L. Philip Chen
Yunpeng Zhai
Yifan Zhao
Jinyang Gao
Bolin Ding
Jia Li
36
0
0
11 Jun 2025
Reasoning Models Are More Easily Gaslighted Than You Think
B. Zhu
Hailong Yin
Jingjing Chen
Yu Jiang
LRM
73
0
0
11 Jun 2025
IntPhys 2: Benchmarking Intuitive Physics Understanding In Complex Synthetic Environments
Florian Bordes
Q. Garrido
Justine T Kao
Adina Williams
Michael G. Rabbat
Emmanuel Dupoux
PINN
76
0
0
11 Jun 2025
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
Shiting Huang
Zhen Fang
Zehui Chen
Siyu Yuan
Junjie Ye
Y. Zeng
Lin Yen-Chen
Qi Mao
Feng Zhao
LLMAG
KELM
14
0
0
11 Jun 2025
GLD-Road:A global-local decoding road network extraction model for remote sensing images
Ligao Deng
Yupeng Deng
Yu Meng
Jingbo Chen
Zhihao Xi
Diyou Liu
Qifeng Chu
56
0
0
11 Jun 2025
Memorization in Language Models through the Lens of Intrinsic Dimension
Stefan Arnold
PILM
102
0
0
11 Jun 2025
AI shares emotion with humans across languages and cultures
Xiuwen Wu
Hao Wang
Zhiang Yan
Xiaohan Tang
Pengfei Xu
Wai-Ting Siok
P. Li
Jia-Hong Gao
Bingjiang Lyu
Lang Qin
12
0
0
11 Jun 2025
Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models
Shuai Wang
Zhenhua Liu
Jiaheng Wei
Xuanwu Yin
Dong Li
E. Barsoum
LRM
75
0
0
11 Jun 2025
Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring
Gusseppe Bravo Rocca
Peini Liu
Jordi Guitart
Rodrigo M Carrillo-Larco
Ajay Dholakia
David Ellison
LLMAG
75
0
0
11 Jun 2025
Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework
Xiao Wei
Xiaobao Wang
Ning Zhuang
Chenyang Wang
L. Wang
Jianwu Dang
14
0
0
10 Jun 2025
Superposed Parameterised Quantum Circuits
Viktoria Patapovich
Mo Kordzanganeh
A. Melnikov
23
0
0
10 Jun 2025
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency
Chenlong Wang
Yuanning Feng
Dongping Chen
Zhaoyang Chu
Ranjay Krishna
Tianyi Zhou
LRM
18
0
0
10 Jun 2025
Graph Prompting for Graph Learning Models: Recent Advances and Future Directions
Xingbo Fu
Zehong Wang
Zihan Chen
Jiazheng Li
Yaochen Zhu
Zhenyu Lei
Cong Shen
Yanfang Ye
Chuxu Zhang
Jundong Li
AI4CE
VLM
20
0
0
10 Jun 2025
ORFS-agent: Tool-Using Agents for Chip Design Optimization
Amur Ghose
Andrew B. Kahng
Sayak Kundu
Zhiang Wang
AI4CE
18
0
0
10 Jun 2025
Enhanced Whole Page Optimization via Mixed-Grained Reward Mechanism-Adapted Language Models
Xinyuan Wang
Liang Wu
Yanjie Fu
23
0
0
10 Jun 2025
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving
Yuxuan Zhou
Xien Liu
Chenwei Yan
Chen Ning
X. Zhang
...
Xiangling Fu
Shijin Wang
Guoping Hu
Yu Wang
Ji Wu
ELM
27
0
0
10 Jun 2025
Olica: Efficient Structured Pruning of Large Language Models without Retraining
Jiujun He
Huazhen Lin
24
0
0
10 Jun 2025
Factors affecting the in-context learning abilities of LLMs for dialogue state tracking
Pradyoth Hegde
Santosh Kesiraju
J. Svec
Šimon Sedláček
Bolaji Yusuf
Oldrich Plchot
D. K T
Ján Černocký
10
0
0
10 Jun 2025
Safeguarding Multimodal Knowledge Copyright in the RAG-as-a-Service Environment
Tianyu Chen
Jian Lou
Wenjie Wang
18
0
0
10 Jun 2025
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Dongge Han
Menglin Xia
Daniel Madrigal Diaz
Samuel Kessler
Ankur Mallick
Xuchao Zhang
Mirian Hipolito Garcia
Jin Xu
Victor Rühle
Saravan Rajmohan
LRM
42
0
0
10 Jun 2025
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation
Zheqi He
Yesheng Liu
Jing-shu Zheng
Xuejing Li
Richeng Xuan
Jin-Ge Yao
Xi Yang
Xi Yang
MLLM
VLM
37
0
0
10 Jun 2025
ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts
Ruiran Su
Jiasheng Si
Zhijiang Guo
J. Pierrehumbert
70
0
0
10 Jun 2025
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs
Manooshree Patel
Rayna Bhattacharyya
Thomas Lu
Arnav Mehta
Niels Voss
Narges Norouzi
Gireeja Ranade
37
0
0
10 Jun 2025
CC-RAG: Structured Multi-Hop Reasoning via Theme-Based Causal Graphs
Jash Rajesh Parekh
Pengcheng Jiang
Jiawei Han
LRM
15
0
0
10 Jun 2025
Enhancing Accuracy and Maintainability in Nuclear Plant Data Retrieval: A Function-Calling LLM Approach Over NL-to-SQL
Mishca de Costa
Muhammad Anwar
Dave Mercier
Mark Randall
Issam Hammad
21
0
0
10 Jun 2025
Bayesian Inverse Physics for Neuro-Symbolic Robot Learning
Octavio Arriaga
Rebecca Adam
Melvin Laux
L. Gutzeit
Marco Ragni
Jan Peters
Frank Kirchner
PINN
AI4CE
28
0
0
10 Jun 2025
Quantifying Mix Network Privacy Erosion with Generative Models
Vasilios Mavroudis
Tariq Elahi
23
0
0
10 Jun 2025
Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents
Irene Testini
José Hernández-Orallo
Lorenzo Pacchiardi
18
0
0
10 Jun 2025
Towards Secure and Private Language Models for Nuclear Power Plants
Muhammad Anwar
Mishca de Costa
Issam Hammad
Daniel Lau
18
0
0
10 Jun 2025
Stronger Language Models Produce More Human-Like Errors
Andrew Keenan Richardson
Ryan Othniel Kearns
Sean Moss
Vincent Wang-Ma'scianica
Philipp Koralus
ReLM
LRM
26
0
0
10 Jun 2025
Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations
Yuxin Dong
Jiachen Jiang
Zhihui Zhu
Xia Ning
18
0
0
10 Jun 2025
Sample Efficient Demonstration Selection for In-Context Learning
Kiran Purohit
Venktesh V
Sourangshu Bhattacharya
Avishek Anand
41
0
0
10 Jun 2025
Multimodal Representation Alignment for Cross-modal Information Retrieval
Fan Xu
Luis A. Leiva
9
0
0
10 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
34
0
0
10 Jun 2025
On Finetuning Tabular Foundation Models
Ivan Rubachev
Akim Kotelnikov
Nikolay Kartashev
Artem Babenko
22
0
0
10 Jun 2025
ASRJam: Human-Friendly AI Speech Jamming to Prevent Automated Phone Scams
Freddie Grabovski
Gilad Gressel
Yisroel Mirsky
20
0
0
10 Jun 2025
Foundation Models in Medical Imaging -- A Review and Outlook
Vivien van Veldhuizen
Vanessa Botha
C. Lu
Melis Erdal Cesur
Kevin Groot Lipman
...
Cees Snoek
Lodewyk Wessels
Ritse Mann
Eric Marcus
Jonas Teuwen
MedIm
VLM
AI4CE
49
0
0
10 Jun 2025
Info-Coevolution: An Efficient Framework for Data Model Coevolution
Ziheng Qin
Hailun Xu
Wei Chee Yew
Qi Jia
Yang Luo
Kanchan Sarkar
Danhui Guan
Kai Wang
Yang You
22
0
0
09 Jun 2025
Previous
1
2
3
4
5
...
244
245
246
Next