ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,519 papers shown
Title
Mark-Evaluate: Assessing Language Generation using Population Estimation
  Methods
Mark-Evaluate: Assessing Language Generation using Population Estimation Methods
Gonçalo Mordido
Christoph Meinel
25
7
0
09 Oct 2020
What Have We Achieved on Text Summarization?
What Have We Achieved on Text Summarization?
Dandan Huang
Leyang Cui
Sen Yang
Guangsheng Bao
Kun Wang
Jun Xie
Yue Zhang
116
109
0
09 Oct 2020
Online Back-Parsing for AMR-to-Text Generation
Online Back-Parsing for AMR-to-Text Generation
Xuefeng Bai
Linfeng Song
Yue Zhang
71
17
0
09 Oct 2020
Learning to Evaluate Translation Beyond English: BLEURT Submissions to
  the WMT Metrics 2020 Shared Task
Learning to Evaluate Translation Beyond English: BLEURT Submissions to the WMT Metrics 2020 Shared Task
Thibault Sellam
Amy Pu
Hyung Won Chung
Sebastian Gehrmann
Qijun Tan
Markus Freitag
Dipanjan Das
Ankur P. Parikh
VLM
76
37
0
08 Oct 2020
GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating
  Open-Domain Dialogue Systems
GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems
Lishan Huang
Zheng Ye
Jinghui Qin
Liang Lin
Xiaodan Liang
70
103
0
08 Oct 2020
Leveraging Discourse Rewards for Document-Level Neural Machine
  Translation
Leveraging Discourse Rewards for Document-Level Neural Machine Translation
Inigo Jauregi Unanue
Nazanin Esmaili
Gholamreza Haffari
Massimo Piccardi
51
4
0
08 Oct 2020
Learning to Fuse Sentences with Transformers for Summarization
Learning to Fuse Sentences with Transformers for Summarization
Logan Lebanoff
Franck Dernoncourt
Doo Soon Kim
Lidan Wang
W. Chang
Fei Liu
48
22
0
08 Oct 2020
Towards Understanding Sample Variance in Visually Grounded Language
  Generation: Evaluations and Observations
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations
Wanrong Zhu
Xinze Wang
P. Narayana
Kazoo Sone
Sugato Basu
William Yang Wang
38
8
0
07 Oct 2020
MOCHA: A Dataset for Training and Evaluating Generative Reading
  Comprehension Metrics
MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics
Anthony Chen
Gabriel Stanovsky
Sameer Singh
Matt Gardner
99
51
0
07 Oct 2020
Like hiking? You probably enjoy nature: Persona-grounded Dialog with
  Commonsense Expansions
Like hiking? You probably enjoy nature: Persona-grounded Dialog with Commonsense Expansions
Bodhisattwa Prasad Majumder
Harsh Jhamtani
Taylor Berg-Kirkpatrick
Julian McAuley
88
85
0
07 Oct 2020
VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word
  Representations for Improved Definition Modeling
VCDM: Leveraging Variational Bi-encoding and Deep Contextualized Word Representations for Improved Definition Modeling
Machel Reid
Edison Marrese-Taylor
Y. Matsuo
69
18
0
07 Oct 2020
Incorporating Behavioral Hypotheses for Query Generation
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng Chen
Chia-Jung Lee
10
1
0
06 Oct 2020
Semantically Driven Sentence Fusion: Modeling and Evaluation
Semantically Driven Sentence Fusion: Modeling and Evaluation
Eyal Ben-David
Orgad Keller
Eric Malmi
Idan Szpektor
Roi Reichart
41
5
0
06 Oct 2020
GRUEN for Evaluating Linguistic Quality of Generated Text
GRUEN for Evaluating Linguistic Quality of Generated Text
Wanzheng Zhu
S. Bhat
120
61
0
06 Oct 2020
Multi-Fact Correction in Abstractive Text Summarization
Multi-Fact Correction in Abstractive Text Summarization
Yue Dong
Shuohang Wang
Zhe Gan
Yu Cheng
Jackie C.K. Cheung
Jingjing Liu
KELMHILM
112
119
0
06 Oct 2020
SPLAT: Speech-Language Joint Pre-Training for Spoken Language
  Understanding
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding
Yu-An Chung
Chenguang Zhu
Michael Zeng
VLM
70
8
0
05 Oct 2020
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel Daza
Anette Frank
55
30
0
05 Oct 2020
GenAug: Data Augmentation for Finetuning Text Generators
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. Feng
Varun Gangal
Dongyeop Kang
Teruko Mitamura
Eduard H. Hovy
72
73
0
05 Oct 2020
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive
  Learning
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
Hanlu Wu
Tengfei Ma
Lingfei Wu
Tariro Manyumwa
S. Ji
SSL
73
57
0
05 Oct 2020
Second-Order NLP Adversarial Examples
Second-Order NLP Adversarial Examples
John X. Morris
AAML
47
0
0
05 Oct 2020
Adversarial Attack and Defense of Structured Prediction Models
Adversarial Attack and Defense of Structured Prediction Models
Wenjuan Han
Liwen Zhang
Yong Jiang
Kewei Tu
AAML
66
39
0
04 Oct 2020
Towards Question-Answering as an Automatic Metric for Evaluating the
  Content Quality of a Summary
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
Daniel Deutsch
Tania Bedrax-Weiss
Dan Roth
85
113
0
01 Oct 2020
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems
Sashank Santhanam
Zhuo Cheng
Brodie Mather
Bonnie J. Dorr
Archna Bhatia
Bryanna Hebenstreit
Alan Zemel
Adam Dalton
T. Strzalkowski
Samira Shaikh
56
6
0
26 Sep 2020
Improving Dialog Evaluation with a Multi-reference Adversarial Dataset
  and Large Scale Pretraining
Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining
Ananya B. Sai
Akash Kumar Mohankumar
Siddharth Arora
Mitesh M. Khapra
87
76
0
23 Sep 2020
Can questions summarize a corpus? Using question generation for
  characterizing COVID-19 research
Can questions summarize a corpus? Using question generation for characterizing COVID-19 research
Gabriela Surita
Rodrigo Nogueira
R. Lotufo
29
7
0
19 Sep 2020
COMET: A Neural Framework for MT Evaluation
COMET: A Neural Framework for MT Evaluation
Ricardo Rei
Craig Alan Stewart
Ana C. Farinha
A. Lavie
167
1,099
0
18 Sep 2020
Evaluating Interactive Summarization: an Expansion-Based Framework
Evaluating Interactive Summarization: an Expansion-Based Framework
Ori Shapira
Ramakanth Pasunuru
H. Ronen
Joey Tianyi Zhou
Yael Amsterdamer
Ido Dagan
129
2
0
17 Sep 2020
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
Jian Guan
Minlie Huang
66
70
0
16 Sep 2020
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
Xiang Gao
Yizhe Zhang
Michel Galley
Chris Brockett
Bill Dolan
ALM
87
106
0
15 Sep 2020
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Junlong Li
Zhuosheng Zhang
Hai Zhao
OffRL
61
12
0
10 Sep 2020
Searching for a Search Method: Benchmarking Search Algorithms for
  Generating NLP Adversarial Examples
Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples
Jin Yong Yoo
John X. Morris
Eli Lifland
Yanjun Qi
AAML
121
53
0
09 Sep 2020
Summary-Source Proposition-level Alignment: Task, Datasets and
  Supervised Baseline
Summary-Source Proposition-level Alignment: Task, Datasets and Supervised Baseline
Ori Ernst
Ori Shapira
Ramakanth Pasunuru
Michael Lepioshkin
Jacob Goldberger
Joey Tianyi Zhou
Ido Dagan
111
28
0
01 Sep 2020
A Survey of Evaluation Metrics Used for NLG Systems
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
99
237
0
27 Aug 2020
MultiGBS: A multi-layer graph approach to biomedical summarization
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh Davoodijam
Nasser Ghadiri
Maryam Lotfi Shahreza
Fabio Rinaldi
34
16
0
27 Aug 2020
Towards a Decomposable Metric for Explainable Evaluation of Text
  Generation from AMR
Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR
Juri Opitz
Anette Frank
155
35
0
20 Aug 2020
Perception Score, A Learned Metric for Open-ended Text Generation
  Evaluation
Perception Score, A Learned Metric for Open-ended Text Generation Evaluation
Jing Gu
Qingyang Wu
Zhou Yu
75
12
0
07 Aug 2020
Which Kind Is Better in Open-domain Multi-turn Dialog,Hierarchical or
  Non-hierarchical Models? An Empirical Study
Which Kind Is Better in Open-domain Multi-turn Dialog,Hierarchical or Non-hierarchical Models? An Empirical Study
Tian Lan
Xian-Ling Mao
Wei Wei
Heyan Huang
66
3
0
07 Aug 2020
A critical analysis of metrics used for measuring progress in artificial
  intelligence
A critical analysis of metrics used for measuring progress in artificial intelligence
Kathrin Blagec
Georg Dorffner
M. Moradi
Matthias Samwald
74
34
0
06 Aug 2020
Neural Language Generation: Formulation, Methods, and Evaluation
Neural Language Generation: Formulation, Methods, and Evaluation
Cristina Garbacea
Qiaozhu Mei
153
30
0
31 Jul 2020
SummEval: Re-evaluating Summarization Evaluation
SummEval: Re-evaluating Summarization Evaluation
Alexander R. Fabbri
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
Dragomir R. Radev
HILM
131
724
0
24 Jul 2020
Investigating Pretrained Language Models for Graph-to-Text Generation
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. Ribeiro
Martin Schmitt
Hinrich Schütze
Iryna Gurevych
97
218
0
16 Jul 2020
SacreROUGE: An Open-Source Library for Using and Developing
  Summarization Evaluation Metrics
SacreROUGE: An Open-Source Library for Using and Developing Summarization Evaluation Metrics
Daniel Deutsch
Dan Roth
97
26
0
10 Jul 2020
DART: Open-Domain Structured Data Record to Text Generation
DART: Open-Domain Structured Data Record to Text Generation
Linyong Nan
Dragomir R. Radev
Rui Zhang
Amrit Rau
Abhinand Sivaprasad
...
Y. Tan
Xi Lin
Caiming Xiong
R. Socher
Nazneen Rajani
60
201
0
06 Jul 2020
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with
  Bilingual Semantic Similarity Rewards
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards
Zi-Yi Dou
Sachin Kumar
Yulia Tsvetkov
82
10
0
27 Jun 2020
Evaluation of Text Generation: A Survey
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELMLM&MA
150
389
0
26 Jun 2020
Speaker Sensitive Response Evaluation Model
Speaker Sensitive Response Evaluation Model
Jinyeong Bak
Alice Oh
55
10
0
12 Jun 2020
Revisiting Few-sample BERT Fine-tuning
Revisiting Few-sample BERT Fine-tuning
Tianyi Zhang
Felix Wu
Arzoo Katiyar
Kilian Q. Weinberger
Yoav Artzi
180
446
0
10 Jun 2020
Understanding Points of Correspondence between Sentences for Abstractive
  Summarization
Understanding Points of Correspondence between Sentences for Abstractive Summarization
Logan Lebanoff
John Muchovej
Franck Dernoncourt
Doo Soon Kim
Lidan Wang
Walter Chang
Fei Liu
65
28
0
10 Jun 2020
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via
  Cycle Training
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training
Qipeng Guo
Zhijing Jin
Xipeng Qiu
Weinan Zhang
David Wipf
Zheng Zhang
126
61
0
08 Jun 2020
Online Versus Offline NMT Quality: An In-depth Analysis on
  English-German and German-English
Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
Maha Elbayad
M. Ustaszewski
Emmanuelle Esperancca-Rodier
Francis Brunet Manquat
Jakob Verbeek
Laurent Besacier
OffRL
75
10
0
01 Jun 2020
Previous
123...68697071
Next