ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners
v1v2v3v4 (latest)

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXiv (abs)PDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 12,272 papers shown
Title
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
Ziyu Yao
Xuxin Cheng
Zhiqi Huang
Lei Li
159
2
0
01 Jul 2025
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections
Haven Kim
Cheng-i Wang
Weihan Xu
Julian McAuley
Hao-Wen Dong
VGen
24
0
0
01 Jul 2025
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM
OrderChain: Towards General Instruct-Tuning for Stimulating the Ordinal Understanding Ability of MLLM
Jinhong Wang
Shuo Tong
Jian Liu
Dongqi Tang
Weiqiang Wang
Wentong Li
Hongxia Xu
Danny Chen
Jintai Chen
Jian Wu
LRM
74
0
0
01 Jul 2025
A Pre-trained Sequential Recommendation Framework: Popularity Dynamics for Zero-shot Transfer
A Pre-trained Sequential Recommendation Framework: Popularity Dynamics for Zero-shot Transfer
Junting Wang
Praneet Rathi
Hari Sundaram
HAIVLM
31
5
0
01 Jul 2025
SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization
SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization
Hao Zhang
Shuo Shao
Song Li
Zhenyu Zhong
Yan Liu
Zhan Qin
K. Ren
12
0
0
20 Jun 2025
Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs
Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs
Thomas Marwitz
Alexander Colsmann
Ben Breitung
Christoph Brabec
Christoph Kirchlechner
...
Michael Hirtz
Pavel A. Levkin
Yolita M. Eggeler
Tobias Schlöder
Pascal Friederich
AI4CE
28
0
0
20 Jun 2025
Latent Concept Disentanglement in Transformer-based Language Models
Latent Concept Disentanglement in Transformer-based Language Models
Guan Zhe Hong
Bhavya Vasudeva
Vatsal Sharan
Cyrus Rashtchian
Prabhakar Raghavan
Rina Panigrahy
ReLMLRM
12
0
0
20 Jun 2025
Language-driven Description Generation and Common Sense Reasoning for Video Action Recognition
Language-driven Description Generation and Common Sense Reasoning for Video Action Recognition
Xiaodan Hu
Chuhang Zou
Suchen Wang
Jaechul Kim
Narendra Ahuja
LRM
12
0
0
20 Jun 2025
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
Jiashun Cheng
Aochuan Chen
Nuo Chen
Ziqi Gao
Yuhan Li
Jia Li
Fugee Tsung
12
0
0
20 Jun 2025
A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset
A Common Pool of Privacy Problems: Legal and Technical Lessons from a Large-Scale Web-Scraped Machine Learning Dataset
Rachel Hong
Jevan Hutson
William Agnew
Imaad Huda
Tadayoshi Kohno
Jamie Morgenstern
AILaw
23
0
0
20 Jun 2025
Deep generative models as the probability transformation functions
Deep generative models as the probability transformation functions
Vitalii Bondar
Vira Babenko
Roman Trembovetskyi
Yurii Korobeinyk
Viktoriya Dzyuba
17
0
0
20 Jun 2025
Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models
Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models
Semin Kim
Yeonwoo Cha
Jaehoon Yoo
Seunghoon Hong
EGVM
27
0
0
20 Jun 2025
Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
Co-VisiON: Co-Visibility ReasONing on Sparse Image Sets of Indoor Scenes
Chao-Yeh Chen
Nobel Dang
Juexiao Zhang
Wenkai Sun
Pengfei Zheng
Xuhang He
Yimeng Ye
Taarun Srinivas
Chen Feng
3DV
16
0
0
20 Jun 2025
Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects
Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects
Zihan Hong
Yushi Wu
Zhiting Zhao
Shanshan Feng
Jianghong Ma
Jiao Liu
Tianjun Wei
3DV
17
0
0
20 Jun 2025
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse
Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse
Paulina DeVito
Akhil Vallala
Sean Mcmahon
Yaroslav Hinda
Benjamin Thaw
Hanqi Zhuang
Hari Kalva
10
0
0
19 Jun 2025
AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models
AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models
Yuan Zhang
Chun-Kai Fan
Tao Huang
Ming Lu
Sicheng Yu
Junwen Pan
Kuan Cheng
Qi She
Shanghang Zhang
VLMLRM
14
0
0
19 Jun 2025
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang
Jing Xu
Franziska Boenisch
Michael Backes
Christopher A. Choquette-Choo
Adam Dziedzic
AAML
14
0
0
19 Jun 2025
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation
From General to Targeted Rewards: Surpassing GPT-4 in Open-Ended Long-Context Generation
Zhihan Guo
Jiele Wu
Wenqian Cui
Yifei Zhang
Minda Hu
Yufei Wang
Irwin King
ALMLRM
12
0
0
19 Jun 2025
Revela: Dense Retriever Learning via Language Modeling
Revela: Dense Retriever Learning via Language Modeling
Fengyu Cai
Tong Chen
Xinran Zhao
Sihao Chen
Hongming Zhang
Sherry Tongshuang Wu
Iryna Gurevych
Heinz Koeppl
RALMVLM
13
0
0
19 Jun 2025
Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning
Self-Critique-Guided Curiosity Refinement: Enhancing Honesty and Helpfulness in Large Language Models via In-Context Learning
Duc Hieu Ho
Chenglin Fan
HILMLRM
13
0
0
19 Jun 2025
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System
Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System
Mustafa Akben
Aaron Satko
10
0
0
19 Jun 2025
EvoLM: In Search of Lost Language Model Training Dynamics
EvoLM: In Search of Lost Language Model Training Dynamics
Zhenting Qi
Fan Nie
Alexandre Alahi
James Zou
Himabindu Lakkaraju
Yilun Du
Eric P. Xing
Sham Kakade
Hanlin Zhang
21
1
0
19 Jun 2025
TrainVerify: Equivalence-Based Verification for Distributed LLM Training
TrainVerify: Equivalence-Based Verification for Distributed LLM Training
Yunchi Lu
Youshan Miao
Cheng Tan
Peng Huang
Yi Zhu
Xian Zhang
Fan Yang
LRM
14
0
0
19 Jun 2025
Joint Tensor-Train Parameterization for Efficient and Expressive Low-Rank Adaptation
Joint Tensor-Train Parameterization for Efficient and Expressive Low-Rank Adaptation
Jun Qi
Chen-Yu Liu
Sabato Marco Siniscalchi
Chao-Han Huck Yang
Min-hsiu Hsieh
12
0
0
19 Jun 2025
GFlowGR: Fine-tuning Generative Recommendation Frameworks with Generative Flow Networks
GFlowGR: Fine-tuning Generative Recommendation Frameworks with Generative Flow Networks
Y. X. R. Wang
Shengyu Zhou
Jinyu Lu
Qidong Liu
Xinhang Li
...
Feng Li
Pengjie Wang
Jian Xu
Bo Zheng
Xiangyu Zhao
12
0
0
19 Jun 2025
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Next-Token Prediction Should be Ambiguity-Sensitive: A Meta-Learning Perspective
Léo Gagnon
Eric Elmoznino
Sarthak Mittal
Tom Marty
Tejas Kasetty
Dhanya Sridhar
Guillaume Lajoie
7
0
0
19 Jun 2025
Semantic Outlier Removal with Embedding Models and LLMs
Semantic Outlier Removal with Embedding Models and LLMs
Eren Akbiyik
João Almeida
Rik Melis
Ritu Sriram
Viviana Petrescu
Vilhjálmur Vilhjálmsson
12
0
0
19 Jun 2025
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Knee-Deep in C-RASP: A Transformer Depth Hierarchy
Andy Yang
Michaël Cadilhac
David Chiang
12
0
0
19 Jun 2025
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
Fenghua Cheng
Jinxiang Wang
Sen Wang
Zi Huang
Xue Li
LRM
14
0
0
19 Jun 2025
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning
Natapong Nitarach
Warit Sirichotedumrong
Panop Pitchayarthorn
Pittawat Taveekitworachai
Potsawee Manakul
Kunat Pipatanakul
ReLMLRM
10
0
0
19 Jun 2025
Multi-use LLM Watermarking and the False Detection Problem
Multi-use LLM Watermarking and the False Detection Problem
Zihao Fu
Chris Russell
WaLM
26
0
0
19 Jun 2025
RiOT: Efficient Prompt Refinement with Residual Optimization Tree
RiOT: Efficient Prompt Refinement with Residual Optimization Tree
Chenyi Zhou
Zhengyan Shi
Yuan Yao
Lei Liang
H. Chen
Qiang Zhang
13
0
0
19 Jun 2025
Subspace-Boosted Model Merging
Subspace-Boosted Model Merging
Ronald Skorobogat
Karsten Roth
Mariana-Iuliana Georgescu
Zeynep Akata
MoMe
18
0
0
19 Jun 2025
Bridging Brain with Foundation Models through Self-Supervised Learning
Hamdi Altaheri
Fakhri Karray
Md. Milon Islam
S M Taslim Uddin Raju
Amir-Hossein Karimi
10
0
0
19 Jun 2025
Exploring and Exploiting the Inherent Efficiency within Large Reasoning Models for Self-Guided Efficiency Enhancement
Exploring and Exploiting the Inherent Efficiency within Large Reasoning Models for Self-Guided Efficiency Enhancement
Weixiang Zhao
Jiahe Guo
Yang Deng
Xingyu Sui
Yulin Hu
Yanyan Zhao
Wanxiang Che
Bing Qin
Tat-Seng Chua
Ting Liu
LRM
38
0
0
18 Jun 2025
Architecture is All You Need: Improving LLM Recommenders by Dropping the Text
Architecture is All You Need: Improving LLM Recommenders by Dropping the Text
Kevin Foley
Shaghayegh Agah
Kavya Priyanka Kakinada
7
0
0
18 Jun 2025
Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning
Context Matters! Relaxing Goals with LLMs for Feasible 3D Scene Planning
Emanuele Musumeci
Michele Brienza
F. Argenziano
Vincenzo Suriani
Daniele Nardi
D. Bloisi
9
0
0
18 Jun 2025
RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation
Xinnuo Xu
Rachel Lawrence
Kshitij Dubey
Atharva Pandey
Risa Ueno
Fabian Falck
A. Nori
Rahul Sharma
Amit Sharma
Javier González
LRM
14
0
0
18 Jun 2025
COSMMIC: Comment-Sensitive Multimodal Multilingual Indian Corpus for Summarization and Headline Generation
COSMMIC: Comment-Sensitive Multimodal Multilingual Indian Corpus for Summarization and Headline Generation
Raghvendra Kumar
S. A. Mohammed Salman
Aryan Sahu
Tridib Nandi
Pragathi Y. P.
S. Saha
Jose G. Moreno
12
0
0
18 Jun 2025
Finance Language Model Evaluation (FLaME)
Finance Language Model Evaluation (FLaME)
Glenn Matlin
Mika Okamoto
Huzaifa Pardawala
Yang Yang
Sudheer Chava
AIFinLRM
23
0
0
18 Jun 2025
Enhancing Hyperbole and Metaphor Detection with Their Bidirectional Dynamic Interaction and Emotion Knowledge
Enhancing Hyperbole and Metaphor Detection with Their Bidirectional Dynamic Interaction and Emotion Knowledge
Li Zheng
Sihang Wang
Hao Fei
Zuquan Peng
Fei Li
Jianming Fu
Chong Teng
Donghong Ji
10
0
0
18 Jun 2025
Multimodal Large Language Models for Medical Report Generation via Customized Prompt Tuning
Multimodal Large Language Models for Medical Report Generation via Customized Prompt Tuning
Chunlei Li
Jingyang Hou
Yilei Shi
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
LM&MA
25
0
0
18 Jun 2025
MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Zijian Zhou
Ao Qu
Zhaoxuan Wu
Sunghwan Kim
Alok Prakash
Daniela Rus
Jinhua Zhao
Bryan Kian Hsiang Low
Paul Liang
LLMAGOffRLLRM
10
0
0
18 Jun 2025
SecFwT: Efficient Privacy-Preserving Fine-Tuning of Large Language Models Using Forward-Only Passes
SecFwT: Efficient Privacy-Preserving Fine-Tuning of Large Language Models Using Forward-Only Passes
Jinglong Luo
Zhuo Zhang
Yehong Zhang
Shiyu Liu
Ye Dong
Xun Zhou
Hui Wang
Yue Yu
Zenglin Xu
12
0
0
18 Jun 2025
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Enrico Motta
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
14
0
0
18 Jun 2025
From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem
From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem
Yanxu Mao
Tiehan Cui
Peipei Liu
Datao You
Hongsong Zhu
AAML
12
0
0
18 Jun 2025
When and How Unlabeled Data Provably Improve In-Context Learning
When and How Unlabeled Data Provably Improve In-Context Learning
Yingcong Li
Xiangyu Chang
Muti Kara
Xiaofeng Liu
Amit K. Roy-Chowdhury
Samet Oymak
12
0
0
18 Jun 2025
Uncovering Intention through LLM-Driven Code Snippet Description Generation
Uncovering Intention through LLM-Driven Code Snippet Description Generation
Yusuf Sulistyo Nugroho
Farah Danisha Salam
Brittany Reid
R. Kula
Kazumasa Shimari
Kenichi Matsumoto
12
0
0
18 Jun 2025
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian Culture
SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models' Knowledge of Indian Culture
Arijit Maji
Raghvendra Kumar
Akash Ghosh
Anushka
Sriparna Saha
ELM
14
0
0
18 Jun 2025
Zero-Shot Reinforcement Learning Under Partial Observability
Zero-Shot Reinforcement Learning Under Partial Observability
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
17
0
0
18 Jun 2025
1234...244245246
Next