Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,522 papers shown
Title
High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics
Markus Freitag
David Grangier
Qijun Tan
Bowen Liang
139
98
0
17 Nov 2021
Transparent Human Evaluation for Image Captioning
Jungo Kasai
Keisuke Sakaguchi
Lavinia Dunagan
Jacob Morrison
Ronan Le Bras
Yejin Choi
Noah A. Smith
84
49
0
17 Nov 2021
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
Yaya Shi
Xu Yang
Haiyang Xu
Chunfen Yuan
Bing Li
Weiming Hu
Zhengjun Zha
82
33
0
17 Nov 2021
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasović
Iz Beltagy
Doug Downey
Matthew E. Peters
LRM
93
110
0
16 Nov 2021
Triggerless Backdoor Attack for NLP Tasks with Clean Labels
Leilei Gan
Jiwei Li
Tianwei Zhang
Xiaoya Li
Yuxian Meng
Leilei Gan
Yi Yang
Shangwei Guo
Chun Fan
AAML
SILM
110
80
0
15 Nov 2021
Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection
Daniel Deutsch
Dan Roth
97
7
0
15 Nov 2021
Understanding Jargon: Combining Extraction and Generation for Definition Modeling
Jie Huang
Hanyin Shao
Kevin Chen-Chuan Chang
Jinjun Xiong
Wen-mei W. Hwu
67
17
0
14 Nov 2021
Variance-Aware Machine Translation Test Sets
Runzhe Zhan
Xuebo Liu
Derek F. Wong
Lidia S. Chao
DRL
62
4
0
07 Nov 2021
Dialogue Inspectional Summarization with Factual Inconsistency Awareness
Leilei Gan
Yating Zhang
Kun Kuang
Lin Yuan
Shuo Li
Changlong Sun
Xiaozhong Liu
Leilei Gan
HILM
50
5
0
05 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
97
227
0
04 Nov 2021
Automatic Evaluation and Moderation of Open-domain Dialogue Systems
Chen Zhang
João Sedoc
L. F. D’Haro
Rafael E. Banchs
Alexander I. Rudnicky
78
38
0
03 Nov 2021
Template Filling for Controllable Commonsense Reasoning
Dheeraj Rajagopal
Vivek Khetan
Bogdan Sacaleanu
A. Gershman
Andy E. Fano
Eduard H. Hovy
BDL
LRM
69
7
0
31 Oct 2021
EventNarrative: A large-scale Event-centric Dataset for Knowledge Graph-to-Text Generation
Anthony Colas
A. Sadeghian
Yue Wang
D. Wang
83
22
0
30 Oct 2021
Unsupervised Full Constituency Parsing with Neighboring Distribution Divergence
Letian Peng
Zuchao Li
Hai Zhao
37
0
0
29 Oct 2021
From Theories on Styles to their Transfer in Text: Bridging the Gap with a Hierarchical Survey
Enrica Troiano
Aswathy Velutharambath
Roman Klinger
131
9
0
29 Oct 2021
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework
Lifan Yuan
Yichi Zhang
Yangyi Chen
Wei Wei
AAML
126
34
0
28 Oct 2021
FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference
Alejandro Martín
Javier Huertas-Tato
Álvaro Huertas-García
Guillermo Villar-Rodríguez
David Camacho
HILM
122
31
0
27 Oct 2021
Assessing the Sufficiency of Arguments through Conclusion Generation
Timon Ziegenbein
Milad Alshomary
Henning Wachsmuth
ELM
78
26
0
26 Oct 2021
Better than Average: Paired Evaluation of NLP Systems
Maxime Peyrard
Wei Zhao
Steffen Eger
Robert West
ELM
122
26
0
20 Oct 2021
Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer
Eleftheria Briakou
Sweta Agrawal
Joel R. Tetreault
Marine Carpuat
81
31
0
20 Oct 2021
Multilingual Unsupervised Neural Machine Translation with Denoising Adapters
Ahmet Üstün
Alexandre Berard
Laurent Besacier
Matthias Gallé
84
47
0
20 Oct 2021
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement
HyoJung Han
Seokchan Ahn
Yoonjung Choi
Insoo Chung
Sangha Kim
Kyunghyun Cho
65
6
0
18 Oct 2021
Protecting Anonymous Speech: A Generative Adversarial Network Methodology for Removing Stylistic Indicators in Text
Rishi Balakrishnan
S. Sloan
A. Aswani
30
1
0
18 Oct 2021
BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation
Thomas Scialom
Felix Hill
78
7
0
18 Oct 2021
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation
Moussa Kamal Eddine
Guokan Shang
A. Tixier
Michalis Vazirgiannis
77
28
0
16 Oct 2021
Analyzing Dynamic Adversarial Training Data in the Limit
Eric Wallace
Adina Williams
Robin Jia
Douwe Kiela
282
31
0
16 Oct 2021
Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER
Dong-Ho Lee
Akshen Kadakia
Kangmin Tan
Mahak Agarwal
Xinyu Feng
Takashi Shibuya
Ryosuke Mitani
Toshiyuki Sekiya
Jay Pujara
Xiang Ren
129
87
0
16 Oct 2021
Open Domain Question Answering with A Unified Knowledge Interface
Kaixin Ma
Hao Cheng
Xiaodong Liu
Eric Nyberg
Jianfeng Gao
RALM
213
40
0
16 Oct 2021
Training Conversational Agents with Generative Conversational Networks
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Dilek Z. Hakkani-Tür
72
0
0
15 Oct 2021
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
Ojas Ahuja
Jiacheng Xu
A. Gupta
Kevin Horecka
Greg Durrett
107
46
0
15 Oct 2021
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
140
22
0
15 Oct 2021
MixQG: Neural Question Generation with Mixed Answer Types
Lidiya Murakhovs'ka
Chien-Sheng Wu
Philippe Laban
Tong Niu
Wenhao Liu
Caiming Xiong
86
48
0
15 Oct 2021
Modeling Endorsement for Multi-Document Abstractive Summarization
Logan Lebanoff
Bingqing Wang
Z. Feng
Fei Liu
343
4
0
15 Oct 2021
Comparative Opinion Summarization via Collaborative Decoding
Hayate Iso
Xiaolan Wang
Stefanos Angelidis
Yoshihiko Suhara
92
25
0
14 Oct 2021
CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization
Prafulla Kumar Choubey
Alexander R. Fabbri
Jesse Vig
Chien-Sheng Wu
Wenhao Liu
Nazneen Rajani
HILM
80
19
0
14 Oct 2021
Exploring Dense Retrieval for Dialogue Response Selection
Tian Lan
Deng Cai
Yan Wang
Yixuan Su
Heyan Huang
Xian-Ling Mao
180
19
0
13 Oct 2021
Learning Compact Metrics for MT
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
94
101
0
12 Oct 2021
Speech Summarization using Restricted Self-Attention
Roshan S. Sharma
Shruti Palaskar
A. Black
Florian Metze
65
34
0
12 Oct 2021
SportsSum2.0: Generating High-Quality Sports News from Live Text Commentary
Jiaan Wang
Zhixu Li
Qiang Yang
Jianfeng Qu
Zhigang Chen
Qingsheng Liu
Guoping Hu
AI4TS
63
7
0
12 Oct 2021
Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation
Weiting Tan
Shuoyang Ding
Huda Khayrallah
Philipp Koehn
SILM
AAML
99
1
0
12 Oct 2021
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Suyoun Kim
Duc Le
Weiyi Zheng
Tarun Singh
Abhinav Arora
Xiaoyu Zhai
Christian Fuegen
Ozlem Kalinli
M. Seltzer
64
16
0
11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
154
171
0
11 Oct 2021
Can Audio Captions Be Evaluated with Image Caption Metrics?
Zelin Zhou
Zhiling Zhang
Xuenan Xu
Zeyu Xie
Mengyue Wu
Kenny Q. Zhu
68
46
0
10 Oct 2021
Natural Language for Human-Robot Collaboration: Problems Beyond Language Grounding
Seth Pate
Wei Xu
Ziyi Yang
Maxwell Love
Siddarth Ganguri
Lawson L. S. Wong
80
7
0
09 Oct 2021
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors
Marvin Kaster
Wei Zhao
Steffen Eger
111
26
0
08 Oct 2021
The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results
M. Fomicheva
Piyawat Lertvittayakumjorn
Wei Zhao
Steffen Eger
Yang Gao
ELM
97
41
0
08 Oct 2021
Toward a Visual Concept Vocabulary for GAN Latent Space
Sarah Schwettmann
Evan Hernandez
David Bau
Samuel J. Klein
Jacob Andreas
Antonio Torralba
68
15
0
08 Oct 2021
GeSERA: General-domain Summary Evaluation by Relevance Analysis
Jessica Nayeli López Espejel
Gaël de Chalendar
J. G. Flores
Thierry Charnois
Ivan Vladimir Meza Ruiz
36
0
0
07 Oct 2021
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer
Wenda Xu
Michael Stephen Saxon
Misha Sra
Wenjie Wang
MedIm
83
13
0
06 Oct 2021
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Chen Zhang
L. F. D’Haro
Yiming Chen
Thomas Friedrichs
Haizhou Li
66
5
0
05 Oct 2021
Previous
1
2
3
...
61
62
63
...
69
70
71
Next