Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.17347
Cited By
InspectorRAGet: An Introspection Platform for RAG Evaluation
26 April 2024
Kshitij P. Fadnis
Siva Sankalp Patel
O. Boni
Yannis Katsis
Sara Rosenthal
Benjamin Sznajder
Marina Danilevsky
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InspectorRAGet: An Introspection Platform for RAG Evaluation"
6 / 6 papers shown
Title
Ragas: Automated Evaluation of Retrieval Augmented Generation
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
120
190
0
26 Sep 2023
Benchmarking Large Language Models in Retrieval-Augmented Generation
Jiawei Chen
Hongyu Lin
Xianpei Han
Le Sun
3DV
RALM
62
296
0
04 Sep 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
351
4,312
0
09 Jun 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu
Yushi Hu
Weijia Shi
Nouha Dziri
Alane Suhr
Prithviraj Ammanabrolu
Noah A. Smith
Mari Ostendorf
Hannaneh Hajishirzi
ALM
139
329
0
02 Jun 2023
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann
Abhik Bhattacharjee
Abinaya Mahendiran
Alex Jinpeng Wang
Alexandros Papangelis
...
Yacine Jernite
Yi Xu
Yisi Sang
Yixin Liu
Yufang Hou
89
38
0
22 Jun 2022
A critical analysis of metrics used for measuring progress in artificial intelligence
Kathrin Blagec
Georg Dorffner
M. Moradi
Matthias Samwald
64
34
0
06 Aug 2020
1