ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.14165
  4. Cited By
Language Models are Few-Shot Learners

Language Models are Few-Shot Learners

28 May 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
Prafulla Dhariwal
Arvind Neelakantan
Pranav Shyam
Girish Sastry
Amanda Askell
Sandhini Agarwal
Ariel Herbert-Voss
Gretchen Krueger
T. Henighan
R. Child
Aditya A. Ramesh
Daniel M. Ziegler
Jeff Wu
Clemens Winter
Christopher Hesse
Mark Chen
Eric Sigler
Ma-teusz Litwin
Scott Gray
B. Chess
Jack Clark
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
    BDL
ArXivPDFHTML

Papers citing "Language Models are Few-Shot Learners"

50 / 10,698 papers shown
Title
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
EAVIT: Efficient and Accurate Human Value Identification from Text data via LLMs
Wenhao Zhu
Yuhang Xie
Guojie Song
Xin Zhang
5
0
0
19 May 2025
CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents
CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents
Rebecca Westhäußer
Frederik Berenz
Wolfgang Minker
Sebastian Zepf
5
0
0
19 May 2025
Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs
Reasoning BO: Enhancing Bayesian Optimization with Long-Context Reasoning Power of LLMs
Zhuo Yang
Lingli Ge
Dong Han
Tianfan Fu
Yuqiang Li
5
0
0
19 May 2025
Curriculum Abductive Learning
Curriculum Abductive Learning
Wen-Chao Hu
Qi-Jie Li
Lin Jia
Cunjing Ge
Yu-Feng Li
Yuan Jiang
Zhi-Hua Zhou
0
0
0
18 May 2025
Model alignment using inter-modal bridges
Model alignment using inter-modal bridges
Ali Gholamzadeh
Noor Sajid
0
0
0
18 May 2025
CompBench: Benchmarking Complex Instruction-guided Image Editing
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia
Wenxuan Huang
Yuntian Tang
Junbo Qiao
Jincheng Liao
...
Lin Chen
Fei Zhao
Zihan Wang
Yuan Xie
Shaohui Lin
CoGe
10
0
0
18 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao-quan Song
0
0
0
17 May 2025
Enhanced Multimodal Hate Video Detection via Channel-wise and Modality-wise Fusion
Enhanced Multimodal Hate Video Detection via Channel-wise and Modality-wise Fusion
Yinghui Zhang
Tailin Chen
Yuchen Zhang
Zeyu Fu
0
0
0
17 May 2025
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
Xuannan Liu
Zekun Li
Zheqi He
Peipei Li
Shuhan Xia
Xing Cui
Huaibo Huang
Xi Yang
Ran He
EGVM
AAML
14
0
0
17 May 2025
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models
Camille Couturier
Spyros Mastorakis
Haiying Shen
Saravan Rajmohan
Victor Rühle
KELM
17
0
0
16 May 2025
CAMEO: Collection of Multilingual Emotional Speech Corpora
CAMEO: Collection of Multilingual Emotional Speech Corpora
Iwona Christop
Maciej Czajka
19
0
0
16 May 2025
When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs
When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs
Xiaomin Li
Zhou Yu
Zhiwei Zhang
Xupeng Chen
Ziji Zhang
Yingying Zhuang
Narayanan Sadagopan
Anurag Beniwal
LRM
7
0
0
16 May 2025
REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?
REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?
Chenxi Jiang
Chuhao Zhou
Jianfei Yang
9
0
0
16 May 2025
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning
Yige Xu
Xu Guo
Zhiwei Zeng
Chunyan Miao
BDL
LRM
12
0
0
16 May 2025
Visual Planning: Let's Think Only with Images
Visual Planning: Let's Think Only with Images
Yi Xu
Chengzu Li
Han Zhou
Xingchen Wan
Caiqi Zhang
Anna Korhonen
Ivan Vulić
LM&Ro
LRM
7
0
0
16 May 2025
Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions
Automated Identification of Logical Errors in Programs: Advancing Scalable Analysis of Student Misconceptions
Muntasir Hoq
Ananya Rao
Reisha Jaishankar
Krish Piryani
Nithya Janapati
Jessica Vandenberg
Bradford Mott
Narges Norouzi
James Lester
Bita Akram
12
0
0
16 May 2025
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning
Illusion or Algorithm? Investigating Memorization, Emergence, and Symbolic Processing in In-Context Learning
Jingcheng Niu
Subhabrata Dutta
Ahmed Elshabrawy
Harish Tayyar Madabushi
Iryna Gurevych
19
0
0
16 May 2025
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRL
VLM
14
0
0
16 May 2025
Parkour in the Wild: Learning a General and Extensible Agile Locomotion Policy Using Multi-expert Distillation and RL Fine-tuning
Parkour in the Wild: Learning a General and Extensible Agile Locomotion Policy Using Multi-expert Distillation and RL Fine-tuning
Nikita Rudin
Junzhe He
Joshua Aurand
Marco Hutter
17
0
0
16 May 2025
Modeling cognitive processes of natural reading with transformer-based Language Models
Modeling cognitive processes of natural reading with transformer-based Language Models
Bruno Bianchi
Fermín Travi
Juan E. Kamienkowski
14
0
0
16 May 2025
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction
Mohammadtaha Bagherifard
Sahar Rajabi
Ali Edalat
Yadollah Yaghoobzadeh
KELM
26
0
0
16 May 2025
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision
Alexey Magay
Dhurba Tripathi
Yu Hao
Yi Fang
14
0
0
16 May 2025
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis
Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis
Akarsh Kumar
Jeff Clune
Joel Lehman
Kenneth O. Stanley
OOD
4
0
0
16 May 2025
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
14
0
0
16 May 2025
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
Jacob Trauger
Ambuj Tewari
12
0
0
16 May 2025
Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation
Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation
Massimiliano Cassia
Luca Guarnera
Mirko Casu
Ignazio Zangara
Sebastiano Battiato
17
0
0
16 May 2025
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models
Giulia Bertazzini
Daniele Baracchi
Dasara Shullani
Isao Echizen
Alessandro Piva
27
0
0
16 May 2025
ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks
ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks
Zhixiong Zhuang
Maria-Irina Nicolae
Hui-Po Wang
Mario Fritz
AAML
SILM
28
0
0
16 May 2025
Feasibility with Language Models for Open-World Compositional Zero-Shot Learning
Feasibility with Language Models for Open-World Compositional Zero-Shot Learning
Jae Myung Kim
Stephan Alaniz
Cordelia Schmid
Zeynep Akata
14
0
0
16 May 2025
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data
Poli A. Nemkova
S. Ubani
Mark V. Albert
AILaw
35
0
0
15 May 2025
CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning
CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning
S. Wang
L. Zhang
Zheren Fu
Zhendong Mao
27
0
0
15 May 2025
AI-enhanced semantic feature norms for 786 concepts
AI-enhanced semantic feature norms for 786 concepts
Siddharth Suresh
Kushin Mukherjee
Tyler Giallanza
Xizheng Yu
Mia Patil
J. Cohen
Timothy Rogers
19
0
0
15 May 2025
Superposition Yields Robust Neural Scaling
Superposition Yields Robust Neural Scaling
Yizhou Liu
Ziming Liu
Jeff Gore
MILM
24
0
0
15 May 2025
Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Tuan Dung Nguyen
Duncan J. Watts
Mark E. Whiting
ELM
26
0
0
15 May 2025
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models
Yuncheng Guo
Xiaodong Gu
OffRL
VLM
32
0
0
15 May 2025
Interpretable Risk Mitigation in LLM Agent Systems
Interpretable Risk Mitigation in LLM Agent Systems
Jan Chojnacki
LLMAG
14
0
0
15 May 2025
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Annie Wong
Thomas Bäck
Aske Plaat
Niki van Stein
Anna V. Kononova
ReLM
ELM
LRM
45
0
0
15 May 2025
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Tianyu Huai
Jie Zhou
Yuxuan Cai
Qin Chen
Wen Wu
Xingjiao Wu
Xipeng Qiu
Liang He
CLL
33
0
0
15 May 2025
Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models
Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models
Jianfei Zhao
Feng Zhang
Xingwu Sun
Chong Feng
MLLM
28
0
0
15 May 2025
Demystifying AI Agents: The Final Generation of Intelligence
Demystifying AI Agents: The Final Generation of Intelligence
Kevin J McNamara
Rhea Pritham Marpu
29
0
0
15 May 2025
Evaluations at Work: Measuring the Capabilities of GenAI in Use
Evaluations at Work: Measuring the Capabilities of GenAI in Use
Brandon Lepine
Gawesha Weerantunga
Juho Kim
Pamela Mishkin
Matthew Beane
9
0
0
15 May 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Ranjan Sapkota
Konstantinos I Roumeliotis
Manoj Karkee
AI4TS
24
0
0
15 May 2025
Parallel Scaling Law for Language Models
Parallel Scaling Law for Language Models
Mouxiang Chen
Binyuan Hui
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
Jianling Sun
Junyang Lin
Zhongxin Liu
MoE
LRM
37
0
0
15 May 2025
Towards Fair In-Context Learning with Tabular Foundation Models
Towards Fair In-Context Learning with Tabular Foundation Models
Patrik Kenfack
Samira Ebrahimi Kahou
Ulrich Aïvodji
19
0
0
14 May 2025
A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning
A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning
Berkay Guler
Giovanni Geraci
Hamid Jafarkhani
33
0
0
14 May 2025
Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment
Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment
Paul Tschisgale
Holger Maus
Fabian Kieser
Ben Kroehs
Stefan Petersen
Peter Wulff
ELM
LRM
41
0
0
14 May 2025
ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor
ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor
Seungbeom Choi
Jeonghoe Goo
Eunjoo Jeon
Mingyu Yang
Minsung Jang
21
0
0
14 May 2025
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Jingcheng Niu
Xingdi Yuan
Tong Wang
Hamidreza Saghir
Amir H. Abdi
27
0
0
14 May 2025
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?
Anthony GX-Chen
Dongyan Lin
Mandana Samiei
Doina Precup
Blake A. Richards
Rob Fergus
Kenneth Marino
CML
LRM
34
0
0
14 May 2025
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures
Julian Kranz
Davide Gallon
Steffen Dereich
Arnulf Jentzen
24
0
0
14 May 2025
1234...212213214
Next