ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 1,609 papers shown
Title
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Dong In Lee
Hyeongcheol Park
Jiyoung Seo
Eunbyung Park
Hyunje Park
Ha Dam Baek
Shin Sangheon
Sangmin kim
Sangpil Kim
3DGS
175
2
0
16 Dec 2024
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models
Boyang Xue
Fei Mi
Qi Zhu
Hongru Wang
Rui Wang
Sheng Wang
Erxin Yu
Xuming Hu
Kam-Fai Wong
HILM
191
2
0
16 Dec 2024
A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
A recent evaluation on the performance of LLMs on radiation oncology physics using questions of randomly shuffled options
Peilong Wang
J. Holmes
Ziqiang Liu
Dequan Chen
Tianming Liu
Jiajian Shen
Wen Liu
LRMELMLM&MA
145
0
0
14 Dec 2024
Video Diffusion Transformers are In-Context Learners
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGenDiffM
370
3
0
14 Dec 2024
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs
Hortense Fong
George Gui
HAI
125
0
0
13 Dec 2024
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers
Zehao Chen
Rong Pan
130
2
0
13 Dec 2024
Olympus: A Universal Task Router for Computer Vision Tasks
Olympus: A Universal Task Router for Computer Vision Tasks
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Philip Torr
VLMObjD
485
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
157
2
0
12 Dec 2024
AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework
AutoPrep: Natural Language Question-Aware Data Preparation with a Multi-Agent Framework
Meihao Fan
Ju Fan
Nan Tang
Lei Cao
Guoliang Li
Xiaoyong Du
LMTD
180
0
0
10 Dec 2024
Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs
Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHRs
Michael Wornow
Suhana Bedi
Miguel Angel Fuentes Hernandez
E. Steinberg
Jason Alan Fries
Christopher Ré
Sanmi Koyejo
N. Shah
166
5
0
09 Dec 2024
Bridging Conversational and Collaborative Signals for Conversational Recommendation
Bridging Conversational and Collaborative Signals for Conversational Recommendation
Ahmad Bin Rabiah
Nafis Sadeq
Julian McAuley
169
0
0
09 Dec 2024
Chimera: Improving Generalist Model with Domain-Specific Experts
Chimera: Improving Generalist Model with Domain-Specific Experts
Tianshuo Peng
Mingxing Li
Hongbin Zhou
Renqiu Xia
Renrui Zhang
...
Aojun Zhou
Botian Shi
Tao Chen
Bo Zhang
Xiangyu Yue
159
5
0
08 Dec 2024
Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
Haizhou Shi
Yibin Wang
Ligong Han
Huatian Zhang
Hao Wang
UQCV
196
2
0
07 Dec 2024
LABIIUM: AI-Enhanced Zero-configuration Measurement Automation System
LABIIUM: AI-Enhanced Zero-configuration Measurement Automation System
Emmanuel A. Olowe
Danial Chitnis
104
0
0
07 Dec 2024
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
Bingchen Li
Xin Li
Yiting Lu
Zhibo Chen
155
1
0
05 Dec 2024
Unifying KV Cache Compression for Large Language Models with LeanKV
Unifying KV Cache Compression for Large Language Models with LeanKV
Yanqi Zhang
Yuwei Hu
Runyuan Zhao
John C. S. Lui
Haibo Chen
MQ
252
7
0
04 Dec 2024
DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
Tejumade Afonja
Hui-Po Wang
Raouf Kerkouche
Mario Fritz
SyDa
160
2
0
03 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELMCLL
145
0
0
02 Dec 2024
VideoSAVi: Self-Aligned Video Language Models without Human Supervision
VideoSAVi: Self-Aligned Video Language Models without Human Supervision
Yogesh Kulkarni
Pooyan Fazli
VLM
193
2
0
01 Dec 2024
GloCOM: A Short Text Neural Topic Model via Global Clustering Context
GloCOM: A Short Text Neural Topic Model via Global Clustering Context
Quang Nguyen
Tung Nguyen
Duc Anh Nguyen
Linh Ngo Van
Sang Dinh
Truong Nguyen
154
1
0
30 Nov 2024
Unified Parameter-Efficient Unlearning for LLMs
Chenlu Ding
Jiancan Wu
Yancheng Yuan
Jinda Lu
Kai Zhang
Alex Su
Xiang Wang
Xiangnan He
MUKELM
155
7
0
30 Nov 2024
Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models
Nadeen Fathallah
Monika Bhole
Steffen Staab
135
0
0
30 Nov 2024
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
Kaustubh Ponkshe
Raghav Singhal
Eduard A. Gorbunov
Alexey Tumanov
Samuel Horváth
Praneeth Vepakomma
247
7
0
29 Nov 2024
On Domain-Specific Post-Training for Multimodal Large Language Models
On Domain-Specific Post-Training for Multimodal Large Language Models
Daixuan Cheng
Shaohan Huang
Ziyu Zhu
Xintong Zhang
Wayne Xin Zhao
Zhongzhi Luan
Bo Dai
Zhenliang Zhang
VLM
144
5
0
29 Nov 2024
Marconi: Prefix Caching for the Era of Hybrid LLMs
Marconi: Prefix Caching for the Era of Hybrid LLMs
Rui Pan
Zhuang Wang
Zhen Jia
Can Karakus
Luca Zancato
Tri Dao
Ravi Netravali
Yida Wang
150
4
0
28 Nov 2024
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
318
4
0
28 Nov 2024
Sneaking Syntax into Transformer Language Models with Tree Regularization
Sneaking Syntax into Transformer Language Models with Tree Regularization
Ananjan Nandi
Christopher D. Manning
Shikhar Murty
121
0
0
28 Nov 2024
The importance of visual modelling languages in generative software engineering
The importance of visual modelling languages in generative software engineering
Roberto Rossi
141
1
0
27 Nov 2024
DRS: Deep Question Reformulation With Structured Output
DRS: Deep Question Reformulation With Structured Output
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Nanyun Peng
Kai-Wei Chang
KELM
159
0
0
27 Nov 2024
MetaphorShare: A Dynamic Collaborative Repository of Open Metaphor Datasets
MetaphorShare: A Dynamic Collaborative Repository of Open Metaphor Datasets
Joanne Boisson
Arif Mehmood
Jose Camacho-Collados
99
0
0
27 Nov 2024
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Zhiyang Guo
Jinxu Xiang
Kai Ma
Wengang Zhou
Haoyang Li
Ran Zhang
3DH
135
3
0
27 Nov 2024
SoK: Decentralized AI (DeAI)
SoK: Decentralized AI (DeAI)
Zhipeng Wang
Rui Sun
Elizabeth Lui
Vatsal Shah
Xihan Xiong
Jiahao Sun
Davide Crapis
William Knottenbelt
181
1
0
26 Nov 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Xinhao Liu
Jiajian Li
Yichen Jiang
Niranjan Sujay
Zhiyong Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
151
2
0
26 Nov 2024
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
Zhu Xu
Zhiqiang Zhao
Zihan Zhang
Yuchi Liu
Quanwei Shen
Fei Liu
Yu Kuang
Jian He
Conglin Liu
132
2
0
26 Nov 2024
Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models
Enhancing Answer Reliability Through Inter-Model Consensus of Large Language Models
Alireza Amiri-Margavi
Iman Jebellat
Ehsan Jebellat
Seyed Pouyan Mousavi Davoudi
155
3
0
25 Nov 2024
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
Yongwei Chen
Yushi Lan
Shangchen Zhou
Tengfei Wang
Xingang Pan
195
6
0
25 Nov 2024
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou
Xiaoyu Zhang
Yongchuan Tang
MLLMDiffM
172
1
0
24 Nov 2024
Aligning Generalisation Between Humans and Machines
Aligning Generalisation Between Humans and Machines
Filip Ilievski
Barbara Hammer
F. V. Harmelen
Benjamin Paassen
S. Saralajew
...
Vered Shwartz
Gabriella Skitalinskaya
Clemens Stachl
Gido M. van de Ven
T. Villmann
282
1
0
23 Nov 2024
On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
Elita Lobo
Chirag Agarwal
Himabindu Lakkaraju
LRM
140
10
0
22 Nov 2024
Planning-Driven Programming: A Large Language Model Programming Workflow
Planning-Driven Programming: A Large Language Model Programming Workflow
Chao Lei
Yanchuan Chang
Nir Lipovetzky
Krista A. Ehinger
159
6
0
21 Nov 2024
FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Zehua Pei
Hui-Ling Zhen
Xianzhi Yu
Sinno Jialin Pan
Mingxuan Yuan
Bei Yu
AI4CE
225
3
0
21 Nov 2024
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Jiange Yang
Haoyi Zhu
Yanjie Wang
Gangshan Wu
Tong He
Limin Wang
171
3
0
21 Nov 2024
Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction
Safety Without Semantic Disruptions: Editing-free Safe Image Generation via Context-preserving Dual Latent Reconstruction
Jordan Vice
Naveed Akhtar
Leonid Sigal
Ajmal Mian
Ajmal Mian
DiffM
129
0
0
21 Nov 2024
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
Javier Ferrando
Oscar Obeso
Senthooran Rajamanoharan
Neel Nanda
153
29
0
21 Nov 2024
Enhancing LLMs for Power System Simulations: A Feedback-driven Multi-agent Framework
Enhancing LLMs for Power System Simulations: A Feedback-driven Multi-agent Framework
Mengshuo Jia
Zeyu Cui
Gabriela Hug
127
1
0
21 Nov 2024
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation
Seokil Ham
H. Kim
Sangmin Woo
Changick Kim
Mamba
472
0
0
21 Nov 2024
Teaching VLMs to Localize Specific Objects from In-context Examples
Teaching VLMs to Localize Specific Objects from In-context Examples
Sivan Doveh
Nimrod Shabtay
Wei Lin
Eli Schwartz
Hilde Kuehne
...
Leonid Karlinsky
James Glass
Assaf Arbelle
S. Ullman
Muhammad Jehanzeb Mirza
VLM
153
1
0
20 Nov 2024
UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction
UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction
Yuan Yuan
Jingtao Ding
Chonghua Han
Depeng Jin
Yong Li
Yong Li
AI4TSAI4CE
148
2
0
20 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
166
1
0
20 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAGLRM
196
20
0
20 Nov 2024
Previous
123...131415...313233
Next