Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.02622
Cited By
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
5 September 2019
Wei Zhao
Maxime Peyrard
Fei Liu
Yang Gao
Christian M. Meyer
Steffen Eger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance"
50 / 165 papers shown
Title
QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance
Xiaoqiang Wang
Bang Liu
Siliang Tang
Lingfei Wu
35
9
0
29 Apr 2022
Repro: An Open-Source Library for Improving the Reproducibility and Usability of Publicly Available Research Code
Daniel Deutsch
Dan Roth
AI4CE
47
2
0
29 Apr 2022
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Daniel Deutsch
Rotem Dror
Dan Roth
22
44
0
21 Apr 2022
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation
Pei Ke
Hao Zhou
Yankai Lin
Peng Li
Jie Zhou
Xiaoyan Zhu
Minlie Huang
29
38
0
02 Apr 2022
Investigating Data Variance in Evaluations of Automatic Machine Translation Metrics
Jiannan Xiang
Huayang Li
Yahui Liu
Lemao Liu
Guoping Huang
Defu Lian
Shuming Shi
12
4
0
29 Mar 2022
Entailment Relation Aware Paraphrase Generation
Abhilasha Sancheti
Balaji Vasan Srinivasan
Rachel Rudinger
47
4
0
20 Mar 2022
RoMe: A Robust Metric for Evaluating Natural Language Generation
Md. Rony
Liubov Kovriguina
Debanjan Chaudhuri
Ricardo Usbeck
Jens Lehmann
22
12
0
17 Mar 2022
E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning
Jiangjie Chen
Rui Xu
Ziquan Fu
Wei Shi
Zhongqiao Li
Xinbo Zhang
Changzhi Sun
Lei Li
Yanghua Xiao
Hao Zhou
ELM
28
35
0
16 Mar 2022
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization
Mathieu Ravaut
Chenyu You
Nancy F. Chen
MoE
19
93
0
13 Mar 2022
A Variational Hierarchical Model for Neural Cross-Lingual Summarization
Yunlong Liang
Fandong Meng
Chulun Zhou
Jinan Xu
Jinan Xu
Jinsong Su
Jie Zhou
BDL
27
34
0
08 Mar 2022
Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models
Shengnan An
Yifei Li
Zeqi Lin
Qian Liu
Bei Chen
Qiang Fu
Weizhu Chen
Nanning Zheng
Jian-Guang Lou
VLM
AAML
52
40
0
07 Mar 2022
Feeding What You Need by Understanding What You Learned
Xiaoqiang Wang
Bang Liu
Fangli Xu
Bowei Long
Siliang Tang
Lingfei Wu
65
6
0
05 Mar 2022
Moving Other Way: Exploring Word Mover Distance Extensions
Ilya Smirnov
Ivan P. Yamshchikov
29
1
0
07 Feb 2022
DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
Wei Zhao
Michael Strube
Steffen Eger
29
37
0
26 Jan 2022
Discourse-Aware Soft Prompting for Text Generation
Marjan Ghazvininejad
Vladimir Karpukhin
Vera Gor
Asli Celikyilmaz
36
6
0
10 Dec 2021
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation
Pierre Colombo
Chloe Clave
Pablo Piantanida
40
41
0
02 Dec 2021
SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation
Hong Chen
Hiroya Takamura
Hideki Nakayama
18
18
0
20 Oct 2021
Better than Average: Paired Evaluation of NLP Systems
Maxime Peyrard
Wei Zhao
Steffen Eger
Robert West
ELM
21
24
0
20 Oct 2021
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation
Moussa Kamal Eddine
Guokan Shang
A. Tixier
Michalis Vazirgiannis
26
25
0
16 Oct 2021
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors
Marvin Kaster
Wei Zhao
Steffen Eger
35
24
0
08 Oct 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
47
30
0
28 Sep 2021
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation
Mingkai Deng
Bowen Tan
Zhengzhong Liu
Eric Xing
Zhiting Hu
21
73
0
14 Sep 2021
Perturbation CheckLists for Evaluating NLG Evaluation Metrics
Ananya B. Sai
Tanay Dixit
D. Y. Sheth
S. Mohan
Mitesh M. Khapra
AAML
116
58
0
13 Sep 2021
Biomedical Data-to-Text Generation via Fine-Tuning Transformers
Ruslan Yermakov
Nicholas Drago
Angelo Ziletti
MedIm
32
13
0
03 Sep 2021
CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization
Haitao Lin
Liqun Ma
Junnan Zhu
Lu Xiang
Yu Zhou
Jiajun Zhang
Chengqing Zong
35
46
0
30 Aug 2021
Automatic Text Evaluation through the Lens of Wasserstein Barycenters
Pierre Colombo
Guillaume Staerman
Chloé Clavel
Pablo Piantanida
27
41
0
27 Aug 2021
Controllable Summarization with Constrained Markov Decision Process
Hou Pong Chan
Lu Wang
Irwin King
207
21
0
07 Aug 2021
How to Evaluate Your Dialogue Models: A Review of Approaches
Xinmeng Li
Wansen Wu
Long Qin
Quanjun Yin
ELM
30
8
0
03 Aug 2021
Generative Pretraining for Paraphrase Evaluation
J. Weston
R. Lenain
U. Meepegama
E. Fristed
AIMat
27
10
0
17 Jul 2021
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
Xiachong Feng
Xiaocheng Feng
Bing Qin
46
100
0
07 Jul 2021
Evaluation of Thematic Coherence in Microblogs
I. Bilal
Bo Wang
M. Liakata
Rob Procter
Adam Tsakalidis
33
5
0
30 Jun 2021
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis
Linyi Yang
Jiazheng Li
Padraig Cunningham
Yue Zhang
Barry Smyth
Ruihai Dong
27
47
0
29 Jun 2021
BARTScore: Evaluating Generated Text as Text Generation
Weizhe Yuan
Graham Neubig
Pengfei Liu
57
811
0
22 Jun 2021
How well do you know your summarization datasets?
Priyam Tejaswin
Dhruv Naik
Peng Liu
35
26
0
21 Jun 2021
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation
Xiang Lin
Simeng Han
Chenyu You
20
24
0
14 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
26
32
0
04 Jun 2021
Evaluating the Efficacy of Summarization Evaluation across Languages
Fajri Koto
Jey Han Lau
Timothy Baldwin
52
19
0
02 Jun 2021
Re-evaluating Word Mover's Distance
Ryoma Sato
M. Yamada
H. Kashima
38
23
0
30 May 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
Jian Guan
Xiaoxi Mao
Changjie Fan
Zitao Liu
Wenbiao Ding
Minlie Huang
AuLLM
29
79
0
19 May 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Jian Guan
Zhexin Zhang
Zhuoer Feng
Zitao Liu
Wenbiao Ding
Xiaoxi Mao
Changjie Fan
Minlie Huang
20
61
0
19 May 2021
Towards Human-Free Automatic Quality Evaluation of German Summarization
Neslihan Iskender
Oleg V. Vasilyev
Tim Polzehl
John Bohannon
Sebastian Möller
34
1
0
13 May 2021
Learning to Reason for Text Generation from Scientific Tables
N. Moosavi
Andreas Rucklé
Dan Roth
Iryna Gurevych
LMTD
LRM
24
20
0
16 Apr 2021
A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions
Guillaume Staerman
Pavlo Mozharovskyi
Pierre Colombo
Stéphan Clémenccon
Florence dÁlché-Buc
OOD
69
17
0
23 Mar 2021
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
27
197
0
22 Mar 2021
MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers
Krishna Pillutla
Swabha Swayamdipta
Rowan Zellers
John Thickstun
Sean Welleck
Yejin Choi
Zaïd Harchaoui
56
343
0
02 Feb 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
85
4,103
0
01 Jan 2021
Multi-document Summarization via Deep Learning Techniques: A Survey
Congbo Ma
W. Zhang
Mingyu Guo
Hu Wang
Quan Z. Sheng
18
126
0
10 Nov 2020
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze
Ece Takmaz
Sandro Pezzelle
Lisa Beinborn
Raquel Fernández
40
22
0
09 Nov 2020
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
Daniel Deutsch
Tania Bedrax-Weiss
Dan Roth
24
109
0
01 Oct 2020
Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning
Yuning Mao
Yanru Qu
Yiqing Xie
Xiang Ren
Jiawei Han
AI4TS
23
46
0
30 Sep 2020
Previous
1
2
3
4
Next