ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,278 papers shown
Title
United Minds or Isolated Agents? Exploring Coordination of LLMs under Cognitive Load Theory
United Minds or Isolated Agents? Exploring Coordination of LLMs under Cognitive Load Theory
HaoYang Shang
Xuan Liu
Zi Liang
J. Zhang
Haibo Hu
Song Guo
LLMAG
24
0
0
07 Jun 2025
FREE: Fast and Robust Vision Language Models with Early Exits
FREE: Fast and Robust Vision Language Models with Early Exits
Divya J. Bajpai
M. Hanawal
VLM
15
0
0
07 Jun 2025
MedCite: Can Language Models Generate Verifiable Text for Medicine?
MedCite: Can Language Models Generate Verifiable Text for Medicine?
Xiao Wang
Mengjue Tan
Qiao Jin
Guangzhi Xiong
Yu Hu
Aidong Zhang
Zhiyong Lu
Minjia Zhang
18
0
0
07 Jun 2025
Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?
Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?
Paulius Sasnauskas
Yiğit Yalın
Goran Radanović
10
0
0
07 Jun 2025
RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints
RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints
Tan-Hanh Pham
Chris Ngo
OffRLLRM
23
0
0
07 Jun 2025
Modeling Earth-Scale Human-Like Societies with One Billion Agents
Modeling Earth-Scale Human-Like Societies with One Billion Agents
Haoxiang Guan
Jiyan He
Liyang Fan
Zhenzhen Ren
Shaobin He
Xin Yu
Yuan Chen
Shuxin Zheng
Tie-Yan Liu
Zhen Liu
AI4CE
19
0
0
07 Jun 2025
Exploring Visual Prompting: Robustness Inheritance and Beyond
Exploring Visual Prompting: Robustness Inheritance and Beyond
Qi Li
Liangzhi Li
Zhouqiang Jiang
Bowen Wang
Keke Tang
VPVLMVLM
18
0
0
07 Jun 2025
Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit
Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit
Charles Goddard
Fernando Fernandes Neto
22
0
0
07 Jun 2025
Label-semantics Aware Generative Approach for Domain-Agnostic Multilabel Classification
Label-semantics Aware Generative Approach for Domain-Agnostic Multilabel Classification
Subhendu Khatuya
Shashwat Naidu
Saptarshi Ghosh
Pawan Goyal
Niloy Ganguly
VLM
17
0
0
07 Jun 2025
Quantile Regression with Large Language Models for Price Prediction
Quantile Regression with Large Language Models for Price Prediction
Nikhita Vedula
Dushyanta Dhyani
Laleh Jalali
Boris Oreshkin
Mohsen Bayati
S. Malmasi
15
0
0
07 Jun 2025
MarginSel : Max-Margin Demonstration Selection for LLMs
MarginSel : Max-Margin Demonstration Selection for LLMs
Rajeev Bhatt Ambati
James Lester
Shashank Srivastava
Snigdha Chaturvedi
23
0
0
07 Jun 2025
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning
Yuan Yuan
Yukun Liu
Chonghua Han
Jie Feng
Yong Li
12
0
0
07 Jun 2025
Text-to-LoRA: Instant Transformer Adaption
Text-to-LoRA: Instant Transformer Adaption
Rujikorn Charakorn
Edoardo Cetin
Yujin Tang
Robert Tjarko Lange
AI4CE
51
0
0
06 Jun 2025
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Yuheng Lei
Sitong Mao
Shunbo Zhou
Hongyuan Zhang
Xuelong Li
Ping Luo
CLL
31
0
0
06 Jun 2025
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Adarsh Prasad Behera
J. Champati
Roberto Morabito
Sasu Tarkoma
J. Gross
16
0
0
06 Jun 2025
Building Models of Neurological Language
Building Models of Neurological Language
Henry Watkins
43
0
0
06 Jun 2025
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
Yichi Zhang
Xin Luna Dong
Zhaojiang Lin
Andrea Madotto
Anuj Kumar
Babak Damavandi
J. Chai
Seungwhan Moon
48
0
0
06 Jun 2025
RecGPT: A Foundation Model for Sequential Recommendation
RecGPT: A Foundation Model for Sequential Recommendation
Yangqin Jiang
Xubin Ren
Lianghao Xia
Da Luo
Kangyi Lin
Chao Huang
LRM
97
0
0
06 Jun 2025
When to Trust Context: Self-Reflective Debates for Context Reliability
When to Trust Context: Self-Reflective Debates for Context Reliability
Zeqi Zhou
Fang Wu
Shayan Talaei
Haokai Zhao
Cheng Meixin
Tinson Xu
Amin Saberi
Yejin Choi
HILM
47
0
0
06 Jun 2025
Transformative or Conservative? Conservation laws for ResNets and Transformers
Transformative or Conservative? Conservation laws for ResNets and Transformers
Sibylle Marcotte
Rémi Gribonval
Gabriel Peyré
35
0
0
06 Jun 2025
Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning
Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning
Yujia Huo
Jianchun Liu
Hongli Xu
Zhenguo Ma
Shilong Wang
Liusheng Huang
CLL
38
0
0
06 Jun 2025
Hey, That's My Data! Label-Only Dataset Inference in Large Language Models
Hey, That's My Data! Label-Only Dataset Inference in Large Language Models
Chen Xiong
Zihao Wang
Rui Zhu
Tsung-Yi Ho
Pin-Yu Chen
Jingwei Xiong
Haixu Tang
Lucila Ohno-Machado
47
0
0
06 Jun 2025
CP-Bench: Evaluating Large Language Models for Constraint Modelling
CP-Bench: Evaluating Large Language Models for Constraint Modelling
Kostis Michailidis
Dimos Tsouros
Tias Guns
55
0
0
06 Jun 2025
Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR
Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR
Fardis Nadimi
Payam Abdisarabshali
Kasra Borazjani
Jacob Chakareski
Seyyedali Hosseinalipour
42
0
0
06 Jun 2025
A Systematic Review of Poisoning Attacks Against Large Language Models
A Systematic Review of Poisoning Attacks Against Large Language Models
Neil Fendley
Edward W. Staley
Joshua Carney
William Redman
Marie Chau
Nathan G. Drenkow
AAMLPILM
21
0
0
06 Jun 2025
MIRIAD: Augmenting LLMs with millions of medical query-response pairs
MIRIAD: Augmenting LLMs with millions of medical query-response pairs
Qinyue Zheng
Salman Abdullah
Sam Rawal
C. Zakka
Sophie Ostmeier
Maximilian Purk
E. Reis
Eric J. Topol
J. Leskovec
Michael Moor
LM&MAAI4MH
54
1
0
06 Jun 2025
Large Language Models are Good Relational Learners
Large Language Models are Good Relational Learners
Fang Wu
Vijay Prakash Dwivedi
Jure Leskovec
33
0
0
06 Jun 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
Large Language Models are Demonstration Pre-Selectors for Themselves
Jiarui Jin
Yuwei Wu
Haoxuan Li
Xiaoting He
Weinan Zhang
Y. Yang
Yong Yu
Jun Wang
Mengyue Yang
51
0
0
06 Jun 2025
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
D. Kunin
Giovanni Luca Marchetti
F. Chen
Dhruva Karkada
James B. Simon
M. DeWeese
Surya Ganguli
Nina Miolane
26
0
0
06 Jun 2025
Masked Language Models are Good Heterogeneous Graph Generalizers
Masked Language Models are Good Heterogeneous Graph Generalizers
Jinyu Yang
Cheng Yang
Shanyuan Cui
Zeyuan Guo
Liangwei Yang
Muhan Zhang
Chuan Shi
51
0
0
06 Jun 2025
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Jiatao Gu
Tianrong Chen
David Berthelot
Huangjie Zheng
Yuyang Wang
Ruixiang Zhang
Laurent Dinh
Miguel Angel Bautista
Josh Susskind
Shuangfei Zhai
43
0
0
06 Jun 2025
Contextually Guided Transformers via Low-Rank Adaptation
Contextually Guided Transformers via Low-Rank Adaptation
A. Zhmoginov
Jihwan Lee
Max Vladymyrov
Mark Sandler
OffRL
55
0
0
06 Jun 2025
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers
Yutao Hou
Zeguan Xiao
Fei Yu
Yihan Jiang
Xuetao Wei
Hailiang Huang
Yun-Nung Chen
Guanhua Chen
LRM
99
0
0
05 Jun 2025
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Tennison Liu
M. Schaar
AIFinLRM
119
0
0
05 Jun 2025
When can in-context learning generalize out of task distribution?
When can in-context learning generalize out of task distribution?
Chase Goddard
Lindsay M. Smith
Vudtiwat Ngampruetikorn
David J. Schwab
OOD
25
0
0
05 Jun 2025
Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study
Yujun Zhou
Jiayi Ye
Zipeng Ling
Yufei Han
Yue Huang
...
Zhenwen Liang
Kehan Guo
Taicheng Guo
Xiangqi Wang
Xiangliang Zhang
ReLMLRM
118
1
0
05 Jun 2025
Gen-n-Val: Agentic Image Data Generation and Validation
Jing-En Huang
I-Sheng Fang
Tzuhsuan Huang
Chih-Yu Wang
Jun-Cheng Chen
VLM
112
0
0
05 Jun 2025
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Neural Network Reprogrammability: A Unified Theme on Model Reprogramming, Prompt Tuning, and Prompt Instruction
Zesheng Ye
C. Cai
Ruijiang Dong
Jianzhong Qi
Lei Feng
Pin-Yu Chen
Feng Liu
197
0
0
05 Jun 2025
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation
Charles Hong
Brendan Roberts
Huijae An
Alex Um
Advay Ratan
Y. Shao
115
0
0
05 Jun 2025
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs
Ananth Muppidi
Abhilash Nandy
Sambaran Bandyopadhyay
14
0
0
05 Jun 2025
Stable Vision Concept Transformers for Medical Diagnosis
Lijie Hu
Songning Lai
Yuan Hua
Shu Yang
Jingfeng Zhang
Di Wang
MedIm
96
0
0
05 Jun 2025
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety
Seongmin Lee
Aeree Cho
Grace C. Kim
ShengYun Peng
Mansi Phute
Duen Horng Chau
LM&MAAI4CE
63
0
0
05 Jun 2025
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning
Tanmay Parekh
Kartik Mehta
Ninareh Mehrabi
Kai-Wei Chang
Nanyun Peng
89
0
0
05 Jun 2025
Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification
Chengwu Liu
Ye Yuan
Yichun Yin
Yan Xu
Xin Xu
Zaoyu Chen
Yasheng Wang
Lifeng Shang
Qun Liu
Ming Zhang
LRM
134
0
0
05 Jun 2025
MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark
MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark
Junjie Xing
Yeye He
Mengyu Zhou
Haoyu Dong
Shi Han
Lingjiao Chen
Dongmei Zhang
S. Chaudhuri
H. V. Jagadish
LMTDELMLRM
32
0
0
05 Jun 2025
SoK: Are Watermarks in LLMs Ready for Deployment?
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
25
0
0
05 Jun 2025
SeedEdit 3.0: Fast and High-Quality Generative Image Editing
SeedEdit 3.0: Fast and High-Quality Generative Image Editing
Peng Wang
Yichun Shi
Xiaochen Lian
Zhonghua Zhai
Xin Xia
Xuefeng Xiao
Weilin Huang
Jianchao Yang
119
0
0
05 Jun 2025
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Sample Complexity and Representation Ability of Test-time Scaling Paradigms
Baihe Huang
Shanda Li
Tianhao Wu
Yiming Yang
Ameet Talwalkar
Kannan Ramchandran
Michael I. Jordan
Jiantao Jiao
LRM
102
0
0
05 Jun 2025
Counterfactual reasoning: an analysis of in-context emergence
Moritz Miller
Bernhard Schölkopf
Siyuan Guo
ReLMLRM
157
0
0
05 Jun 2025
Survey on the Evaluation of Generative Models in Music
Alexander Lerch
Claire Arthur
Nick Bryan-Kinns
Corey Ford
Qianyi Sun
Ashvala Vinay
157
0
0
05 Jun 2025
Previous
123...567...244245246
Next