Retrieval-Augmented LLM Agents: Learning to Learn from Experience

18 March 2026

Thomas Palmeira Ferraz

Romain Deffayet

Vassilina Nikoulina

Hervé Déjean

Stéphane Clinchant

RALM

AIFin

ArXiv (abs)PDF HTML Github

Main:10 Pages

9 Figures

Bibliography:5 Pages

21 Tables

Appendix:16 Pages

Abstract

While large language models (LLMs) have advanced the development of general-purpose agents, achieving robust generalization to unseen tasks remains a significant challenge. Current approaches typically rely on either fine-tuning or training-free memory-augmented generation using retrieved experience; yet both have limitations: fine-tuning often fails to extrapolate to new tasks, while experience retrieval often underperforms compared to supervised baselines. In this work, we propose to combine these approaches and systematically study how to train retrieval-augmented LLM agents to effectively leverage retrieved trajectories in-context. First, we establish a robust supervised fine-tuning (SFT) recipe using LoRA that outperforms several state-of-the-art agent training pipelines. Second, we provide a detailed analysis of key design choices for experience retrieval, identifying optimal strategies for storage, querying, and trajectory selection. Finally, we propose a pipeline that integrates experience retrieval into the fine-tuning process. Our results demonstrate that this combined approach significantly improves generalization to unseen tasks, providing a scalable and effective framework for building agents that learn to learn from experience.

View on arXiv

Comments on this paper