Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.02317
Cited By
Layer or Representation Space: What makes BERT-based Evaluation Metrics Robust?
6 September 2022
Doan Nam Long Vu
N. Moosavi
Steffen Eger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Layer or Representation Space: What makes BERT-based Evaluation Metrics Robust?"
10 / 10 papers shown
Title
MEANT: Multimodal Encoder for Antecedent Information
Benjamin Iyoya Irving
Annika Marie Schoene
AIFin
34
0
0
10 Nov 2024
LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores
Yiqi Liu
N. Moosavi
Chenghua Lin
ELM
30
48
0
16 Nov 2023
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics
Christoph Leiter
Juri Opitz
Daniel Deutsch
Yang Gao
Rotem Dror
Steffen Eger
ALM
LRM
ELM
40
31
0
30 Oct 2023
Towards Explainable Evaluation Metrics for Machine Translation
Christoph Leiter
Piyawat Lertvittayakumjorn
M. Fomicheva
Wei-Ye Zhao
Yang Gao
Steffen Eger
ELM
30
13
0
22 Jun 2023
Evaluating Machine Translation Quality with Conformal Predictive Distributions
Patrizio Giovannotti
UQLM
19
7
0
02 Jun 2023
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist
Iftitahu Ni'mah
Meng Fang
Vlado Menkovski
Mykola Pechenizkiy
30
13
0
15 May 2023
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
Tianxing He
Jingyu Zhang
Tianle Wang
Sachin Kumar
Kyunghyun Cho
James R. Glass
Yulia Tsvetkov
40
44
0
20 Dec 2022
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics
Daniil Larionov
Jens Grunwald
Christoph Leiter
Steffen Eger
22
5
0
20 Sep 2022
MENLI: Robust Evaluation Metrics from Natural Language Inference
Yanran Chen
Steffen Eger
32
15
0
15 Aug 2022
Perturbation CheckLists for Evaluating NLG Evaluation Metrics
Ananya B. Sai
Tanay Dixit
D. Y. Sheth
S. Mohan
Mitesh M. Khapra
AAML
113
57
0
13 Sep 2021
1