Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 1,232 papers shown
Title
To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP
Gözde Gül Sahin
42
33
0
18 Nov 2021
How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN
R. Thomas McCoy
P. Smolensky
Tal Linzen
Jianfeng Gao
Asli Celikyilmaz
SyDa
25
119
0
18 Nov 2021
High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics
Markus Freitag
David Grangier
Qijun Tan
Bowen Liang
33
92
0
17 Nov 2021
Transparent Human Evaluation for Image Captioning
Jungo Kasai
Keisuke Sakaguchi
Lavinia Dunagan
Jacob Morrison
Ronan Le Bras
Yejin Choi
Noah A. Smith
33
47
0
17 Nov 2021
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
Yaya Shi
Xu Yang
Haiyang Xu
Chunfen Yuan
Bing Li
Weiming Hu
Zhengjun Zha
39
33
0
17 Nov 2021
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasović
Iz Beltagy
Doug Downey
Matthew E. Peters
LRM
30
106
0
16 Nov 2021
Triggerless Backdoor Attack for NLP Tasks with Clean Labels
Leilei Gan
Jiwei Li
Tianwei Zhang
Xiaoya Li
Yuxian Meng
Fei Wu
Yi Yang
Shangwei Guo
Chun Fan
AAML
SILM
27
74
0
15 Nov 2021
Dialogue Inspectional Summarization with Factual Inconsistency Awareness
Leilei Gan
Yating Zhang
Kun Kuang
Lin Yuan
Shuo Li
Changlong Sun
Xiaozhong Liu
Fei Wu
HILM
24
4
0
05 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
33
216
0
04 Nov 2021
Automatic Evaluation and Moderation of Open-domain Dialogue Systems
Chen Zhang
João Sedoc
L. F. D’Haro
Rafael E. Banchs
Alexander I. Rudnicky
22
36
0
03 Nov 2021
FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference
Alejandro Martín
Javier Huertas-Tato
Álvaro Huertas-García
Guillermo Villar-Rodríguez
David Camacho
HILM
30
31
0
27 Oct 2021
Assessing the Sufficiency of Arguments through Conclusion Generation
Timon Ziegenbein
Milad Alshomary
Henning Wachsmuth
ELM
21
25
0
26 Oct 2021
Better than Average: Paired Evaluation of NLP Systems
Maxime Peyrard
Wei Zhao
Steffen Eger
Robert West
ELM
19
24
0
20 Oct 2021
Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer
Eleftheria Briakou
Sweta Agrawal
Joel R. Tetreault
Marine Carpuat
23
30
0
20 Oct 2021
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement
HyoJung Han
Seokchan Ahn
Yoonjung Choi
Insoo Chung
Sangha Kim
Kyunghyun Cho
33
6
0
18 Oct 2021
BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation
Thomas Scialom
Felix Hill
28
7
0
18 Oct 2021
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation
Moussa Kamal Eddine
Guokan Shang
A. Tixier
Michalis Vazirgiannis
26
25
0
16 Oct 2021
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
Ojas Ahuja
Jiacheng Xu
A. Gupta
Kevin Horecka
Greg Durrett
84
46
0
15 Oct 2021
Guiding Visual Question Generation
Nihir Vedd
Zixu Wang
Marek Rei
Yishu Miao
Lucia Specia
89
23
0
15 Oct 2021
Learning Compact Metrics for MT
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
38
99
0
12 Oct 2021
Speech Summarization using Restricted Self-Attention
Roshan S. Sharma
Shruti Palaskar
A. Black
Florian Metze
30
33
0
12 Oct 2021
Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation
Weiting Tan
Shuoyang Ding
Huda Khayrallah
Philipp Koehn
SILM
AAML
37
1
0
12 Oct 2021
Can Audio Captions Be Evaluated with Image Caption Metrics?
Zelin Zhou
Zhiling Zhang
Xuenan Xu
Zeyu Xie
Mengyue Wu
Kenny Q. Zhu
30
43
0
10 Oct 2021
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors
Marvin Kaster
Wei Zhao
Steffen Eger
33
24
0
08 Oct 2021
The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results
M. Fomicheva
Piyawat Lertvittayakumjorn
Wei Zhao
Steffen Eger
Yang Gao
ELM
24
39
0
08 Oct 2021
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer
Wenda Xu
Michael Stephen Saxon
Misha Sra
Wenjie Wang
MedIm
19
13
0
06 Oct 2021
Truth-Conditional Captioning of Time Series Data
Harsh Jhamtani
Taylor Berg-Kirkpatrick
AI4TS
43
7
0
05 Oct 2021
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
48
43
0
30 Sep 2021
Key Point Analysis via Contrastive Learning and Extractive Argument Summarization
Milad Alshomary
Timon Ziegenbein
S. Syed
Philipp Heinisch
Maximilian Spliethover
Philipp Cimiano
Martin Potthast
Henning Wachsmuth
59
15
0
30 Sep 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
47
30
0
28 Sep 2021
More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering
Yang Bai
D. Wang
96
10
0
25 Sep 2021
Rethinking Crowd Sourcing for Semantic Similarity
Shaul Solomon
Adam Cohn
Hernan Rosenblum
Chezi Hershkovitz
Ivan P. Yamshchikov
21
2
0
24 Sep 2021
Scalable Fact-checking with Human-in-the-Loop
Jing Yang
D. Vega-Oliveros
Taís Seibt
Anderson de Rezende Rocha
24
10
0
22 Sep 2021
Recursively Summarizing Books with Human Feedback
Jeff Wu
Long Ouyang
Daniel M. Ziegler
Nissan Stiennon
Ryan J. Lowe
Jan Leike
Paul Christiano
ALM
40
296
0
22 Sep 2021
MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization
Xinnuo Xu
Ondrej Dusek
Shashi Narayan
Verena Rieser
Ioannis Konstas
HILM
28
6
0
22 Sep 2021
MOVER: Mask, Over-generate and Rank for Hyperbole Generation
Yunxiang Zhang
Xiaojun Wan
27
15
0
16 Sep 2021
Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering
Zhe-nan Lin
Yitao Cai
Xiaojun Wan
45
13
0
15 Sep 2021
Assisting the Human Fact-Checkers: Detecting All Previously Fact-Checked Claims in a Document
Shaden Shaar
Nikola Georgiev
Firoj Alam
Giovanni Da San Martino
Aisha Mohamed
Preslav Nakov
HILM
75
26
0
14 Sep 2021
MMCoVaR: Multimodal COVID-19 Vaccine Focused Data Repository for Fake News Detection and a Baseline Architecture for Classification
Mingxuan Chen
Xinqiao Chu
K. P. Subbalakshmi
46
26
0
14 Sep 2021
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation
Mingkai Deng
Bowen Tan
Zhengzhong Liu
Eric Xing
Zhiting Hu
16
73
0
14 Sep 2021
Fine Grained Human Evaluation for English-to-Chinese Machine Translation: A Case Study on Scientific Text
Ming Liu
Heng Zhang
Guanhao Wu
37
1
0
13 Sep 2021
Perturbation CheckLists for Evaluating NLG Evaluation Metrics
Ananya B. Sai
Tanay Dixit
D. Y. Sheth
S. Mohan
Mitesh M. Khapra
AAML
116
58
0
13 Sep 2021
"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding
Faeze Brahman
Meng Huang
Oyvind Tafjord
Chao Zhao
Mrinmaya Sachan
Snigdha Chaturvedi
27
53
0
12 Sep 2021
StreamHover: Livestream Transcript Summarization and Annotation
Sangwoo Cho
Franck Dernoncourt
Timothy Jeewun Ganter
Trung Bui
Nedim Lipka
Walter Chang
Hailin Jin
Jonathan Brandt
H. Foroosh
Fei Liu
3DGS
AI4TS
24
29
0
11 Sep 2021
Towards Zero-shot Commonsense Reasoning with Self-supervised Refinement of Language Models
T. Klein
Moin Nabi
ReLM
LRM
35
8
0
10 Sep 2021
HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge
Yufei Tian
A. Sridhar
Nanyun Peng
33
27
0
10 Sep 2021
BiSECT: Learning to Split and Rephrase Sentences with Bitexts
Joongwon Kim
Mounica Maddela
Reno Kriz
Wei Xu
Chris Callison-Burch
58
25
0
10 Sep 2021
A Large-Scale Study of Machine Translation in the Turkic Languages
Jamshidbek Mirzakhalov
A. Babu
Duygu Ataman
S. Kariev
Francis M. Tyers
...
Esra Onal
Shaxnoza Pulatova
Ahsan Wahab
Orhan Firat
Sriram Chellappan
26
28
0
09 Sep 2021
Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models
Steven Y. Feng
Kevin Lu
Zhuofu Tao
Malihe Alikhani
Teruko Mitamura
Eduard H. Hovy
Varun Gangal
LRM
43
13
0
08 Sep 2021
Mixup Decoding for Diverse Machine Translation
Jicheng Li
Pengzhi Gao
Xuanfu Wu
Yang Feng
Zhongjun He
Hua Wu
Haifeng Wang
33
14
0
08 Sep 2021
Previous
1
2
3
...
20
21
22
23
24
25
Next