Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation

26 May 2023

Papers citing "Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation"

36 / 36 papers shown

Title
Retrieval Augmented Generation Evaluation for Health Documents Mario Ceresa Lorenzo Bertolini Valentin Comte Nicholas Spadaro Barbara Raffael ... Sergio Consoli Amalia Muñoz Piñeiro Alex Patak Maddalena Querci Tobias Wiesenthal RALM 3DV 39 0 1 07 May 2025
Synthline: A Product Line Approach for Synthetic Requirements Engineering Data Generation using Large Language Models Abdelkarim El-Hajjami Camille Salinesi SyDa 39 0 0 06 May 2025
On the generalization of language models from in-context learning and finetuning: a controlled study Andrew Kyle Lampinen Arslan Chaudhry Stephanie Chan Cody Wild Diane Wan Alex Ku Jorg Bornschein Razvan Pascanu Murray Shanahan James L. McClelland 57 0 0 01 May 2025
Memorization and Knowledge Injection in Gated LLMs Xu Pan Ely Hahami Zechen Zhang H. Sompolinsky KELM CLL RALM 108 1 0 30 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs Jiliang Ni Jiachen Pu Zhongyi Yang Kun Zhou Hui Wang Xiaoliang Xiao Dakui Wang Xin Li Jingfeng Luo Conggang Hu 39 0 0 18 Apr 2025
Mimic In-Context Learning for Multimodal Tasks Yuchu Jiang Jiale Fu Chenduo Hao Xinting Hu Yingzhe Peng Xin Geng Xu Yang 32 0 0 11 Apr 2025
System Log Parsing with Large Language Models: A Review Viktor Beck Max Landauer Markus Wurzenberger Florian Skopik Andreas Rauber 38 0 0 07 Apr 2025
Shh, don't say that! Domain Certification in LLMs Cornelius Emde Alasdair Paren Preetham Arvind Maxime Kayser Tom Rainforth Thomas Lukasiewicz Guohao Li Philip Torr Adel Bibi 53 1 0 26 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks Jing Yang Max Glockner Anderson de Rezende Rocha Iryna Gurevych LRM 73 1 0 07 Feb 2025
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Moreno La Quatra Valerio Mario Salerno Yu Tsao Sabato Marco Siniscalchi 99 0 0 22 Jan 2025
In-Context Learning Distillation for Efficient Few-Shot Fine-Tuning Yifei Duan Liu Li Zirui Zhai Jinxia Yao 80 0 0 17 Dec 2024
The Lou Dataset -- Exploring the Impact of Gender-Fair Language in German Text Classification Andreas Waldis Joel Birrer Anne Lauscher Iryna Gurevych 41 1 0 26 Sep 2024
Efficient LLM Context Distillation Rajesh Upadhayayaya Zachary Smith Chritopher Kottmyer Manish Raj Osti 50 1 0 03 Sep 2024
LLM-based MOFs Synthesis Condition Extraction using Few-Shot Demonstrations Lei Shi Zhimeng Liu Yi Yang Weize Wu Yuyang Zhang ... Zipeng Liu Huobin Tan Hongyi Gao Yue Zhang Ge Wang 42 0 0 06 Aug 2024
Dialogue Ontology Relation Extraction via Constrained Chain-of-Thought Decoding Renato Vukovic David Arps Carel van Niekerk Benjamin Matthias Ruppik Hsien-chin Lin Michael Heck Milica Gašić 52 1 0 05 Aug 2024
Stress-Testing Capability Elicitation With Password-Locked Models Ryan Greenblatt Fabien Roger Dmitrii Krasheninnikov David M. Krueger 38 14 0 29 May 2024
In-Context Learning with Long-Context Models: An In-Depth Exploration Amanda Bertsch Maor Ivgi Uri Alon Jonathan Berant Matthew R. Gormley Matthew R. Gormley Graham Neubig ReLM AIMat 93 64 0 30 Apr 2024
LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing Zeyang Ma A. Chen Dong Jae Kim Tse-Husn Chen Shaowei Wang 35 47 0 27 Apr 2024
Privacy Preserving Prompt Engineering: A Survey Kennedy Edemacu Xintao Wu 60 18 0 09 Apr 2024
Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries Seanie Lee Jianpeng Cheng Joris Driesen Alexandru Coca Anders Johannsen RALM 48 1 0 20 Feb 2024
NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning Yufeng Zhao Yoshihiro Sakai Naoya Inoue 33 5 0 08 Feb 2024
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning Lucas Weber Elia Bruni Dieuwke Hupkes 32 25 0 20 Oct 2023
FTFT: Efficient and Robust Fine-Tuning by Transferring Training Dynamics Yupei Du Albert Gatt Dong Nguyen 31 1 0 10 Oct 2023
Measuring the Robustness of NLP Models to Domain Shifts Nitay Calderon Naveh Porat Eyal Ben-David Alexander Chapanin Zorik Gekhman Nadav Oved Vitaly Shalumov Roi Reichart 21 7 0 31 May 2023
Lexical Generalization Improves with Larger Models and Longer Training Elron Bandel Yoav Goldberg Yanai Elazar 64 6 0 23 Oct 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models Anas Awadalla Mitchell Wortsman Gabriel Ilharco Sewon Min Ian H. Magnusson Hannaneh Hajishirzi Ludwig Schmidt ELM OOD KELM 72 19 0 22 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 129 95 0 06 Oct 2022
Linear Connectivity Reveals Generalization Strategies Jeevesh Juneja Rachit Bansal Kyunghyun Cho João Sedoc Naomi Saphra 244 45 0 24 May 2022
Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors Ravid Shwartz-Ziv Micah Goldblum Hossein Souri Sanyam Kapoor Chen Zhu Yann LeCun A. Wilson UQCV BDL 64 43 0 20 May 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization Victor Sanh Albert Webson Colin Raffel Stephen H. Bach Lintang Sutawika ... T. Bers Stella Biderman Leo Gao Thomas Wolf Alexander M. Rush LRM 215 1,663 0 15 Oct 2021
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning Prasetya Ajie Utama N. Moosavi Victor Sanh Iryna Gurevych AAML 63 35 0 09 Sep 2021
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Ofir Press Noah A. Smith M. Lewis 253 701 0 27 Aug 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity Yao Lu Max Bartolo Alastair Moore Sebastian Riedel Pontus Stenetorp AILaw LRM 281 1,125 0 18 Apr 2021
Making Pre-trained Language Models Better Few-shot Learners Tianyu Gao Adam Fisch Danqi Chen 243 1,927 0 31 Dec 2020
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference Timo Schick Hinrich Schütze 258 1,591 0 21 Jan 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 299 6,996 0 20 Apr 2018