ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.00789
41
0

RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems

1 June 2025
Yixiao Zeng
Tianyu Cao
Danqing Wang
Xinran Zhao
Zimeng Qiu
Morteza Ziyadi
Tongshuang Wu
Lei Li
    RALM
ArXiv (abs)PDFHTML
Main:12 Pages
10 Figures
Bibliography:1 Pages
4 Tables
Appendix:12 Pages
Abstract

Retrieval-Augmented Generation (RAG) enhances recency and factuality in answers. However, existing evaluations rarely test how well these systems cope with real-world noise, conflicting between internal and external retrieved contexts, or fast-changing facts. We introduce Retrieval-Aware Robustness Evaluation (RARE), a unified framework and large-scale benchmark that jointly stress-tests query and document perturbations over dynamic, time-sensitive corpora. One of the central features of RARE is a knowledge-graph-driven synthesis pipeline (RARE-Get) that automatically extracts single and multi-hop relations from the customized corpus and generates multi-level question sets without manual intervention. Leveraging this pipeline, we construct a dataset (RARE-Set) spanning 400 expert-level time-sensitive finance, economics, and policy documents and 48,322 questions whose distribution evolves as the underlying sources change. To quantify resilience, we formalize retrieval-conditioned robustness metrics (RARE-Met) that capture a model's ability to remain correct or recover when queries, documents, or real-world retrieval results are systematically altered. Our results show that RAG systems exhibit surprising vulnerability to perturbations, with document robustness consistently being the weakest point regardless of generator size or architecture. RAG systems consistently show lower robustness on multi-hop queries than single-hop queries across all domains.

View on arXiv
@article{zeng2025_2506.00789,
  title={ RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems },
  author={ Yixiao Zeng and Tianyu Cao and Danqing Wang and Xinran Zhao and Zimeng Qiu and Morteza Ziyadi and Tongshuang Wu and Lei Li },
  journal={arXiv preprint arXiv:2506.00789},
  year={ 2025 }
}
Comments on this paper