Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,522 papers shown
Title
Is MultiWOZ a Solved Task? An Interactive TOD Evaluation Framework with User Simulator
Qinyuan Cheng
Linyang Li
Guofeng Quan
Feng Gao
Xiaofeng Mou
Xipeng Qiu
82
13
0
26 Oct 2022
SentBS: Sentence-level Beam Search for Controllable Summarization
Chenhui Shen
Liying Cheng
Lidong Bing
Yang You
Luo Si
122
11
0
26 Oct 2022
Leveraging Affirmative Interpretations from Negation Improves Natural Language Understanding
Md Mosharaf Hossain
Eduardo Blanco
73
5
0
26 Oct 2022
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
85
1
0
25 Oct 2022
Revision for Concision: A Constrained Paraphrase Generation Task
Wenchuan Mu
Kwanin Lim
70
3
0
25 Oct 2022
Towards Interpretable Summary Evaluation via Allocation of Contextual Embeddings to Reference Text Topics
Ben Schaper
Christopher Lohse
Marcell Streile
Andrea Giovannini
Richard Osuala
52
1
0
25 Oct 2022
Contrastive Search Is What You Need For Neural Text Generation
Yixuan Su
Nigel Collier
91
53
0
25 Oct 2022
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation
Melanie Sclar
Peter West
Sachin Kumar
Yulia Tsvetkov
Yejin Choi
64
20
0
25 Oct 2022
DEMETR: Diagnosing Evaluation Metrics for Translation
Marzena Karpinska
N. Raj
Katherine Thai
Yixiao Song
Ankita Gupta
Mohit Iyyer
87
39
0
25 Oct 2022
Mutual Information Alleviates Hallucinations in Abstractive Summarization
Liam van der Poel
Ryan Cotterell
Clara Meister
HILM
109
61
0
24 Oct 2022
"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds
Keshav Kolluru
Gabriel Stanovsky
Mausam
NAI
43
1
0
24 Oct 2022
Subspace Representations for Soft Set Operations and Sentence Similarities
Yoichi Ishibashi
Sho Yokoi
Katsuhito Sudoh
Satoshi Nakamura
NAI
66
1
0
24 Oct 2022
Knowledge Transfer from Answer Ranking to Answer Generation
Matteo Gabburo
Rik Koncel-Kedziorski
Siddhant Garg
Luca Soldaini
Alessandro Moschitti
67
8
0
23 Oct 2022
EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation
Sedrick Scott Keh
Rohit K Bharadwaj
Emmy Liu
Simone Tedeschi
Varun Gangal
Roberto Navigli
64
7
0
23 Oct 2022
TAPE: Assessing Few-shot Russian Language Understanding
Ekaterina Taktasheva
Tatiana Shavrina
Alena Fenogenova
Denis Shevelev
Nadezhda Katricheva
...
Svetlana Iordanskaia
Alena Spiridonova
Valentina Kurenshchikova
Ekaterina Artemova
Vladislav Mikhailov
AAML
79
10
0
23 Oct 2022
On the Limitations of Reference-Free Evaluations of Generated Text
Daniel Deutsch
Rotem Dror
Dan Roth
127
48
0
22 Oct 2022
Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts
Hongli Zhan
Tiberiu Sosea
Cornelia Caragea
Junjie Li
64
6
0
22 Oct 2022
ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts
Rajdeep Mukherjee
Abhinav Bohra
Akash Banerjee
Soumya Sharma
Manjunath Hegde
...
Shivani Shrivastava
Koustuv Dasgupta
Niloy Ganguly
Saptarshi Ghosh
Pawan Goyal
RALM
114
49
0
22 Oct 2022
Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation
Jinyi Hu
Xiaoyuan Yi
Wenhao Li
Maosong Sun
Xingxu Xie
DRL
129
0
0
22 Oct 2022
EnDex: Evaluation of Dialogue Engagingness at Scale
Guangxuan Xu
Ruibo Liu
Fabrice Harel-Canada
Nischal Reddy Chandra
Nanyun Peng
63
5
0
22 Oct 2022
A Dataset for Plain Language Adaptation of Biomedical Abstracts
Kush Attal
Brian D. Ondov
Dina Demner-Fushman
77
26
0
21 Oct 2022
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards
Jean-Benoit Delbrouck
Pierre J. Chambon
Christian Blüthgen
E. Tsai
Omar Almusa
C. Langlotz
MedIm
116
81
0
21 Oct 2022
WikiWhy: Answering and Explaining Cause-and-Effect Questions
Matthew Ho
Aditya Sharma
Justin Chang
Michael Stephen Saxon
Sharon Levy
Yujie Lu
William Yang Wang
ReLM
KELM
LRM
163
19
0
21 Oct 2022
A Textless Metric for Speech-to-Speech Comparison
Laurent Besacier
S. Ribeiro
Olivier Galibert
Ioan Calapodescu
103
5
0
21 Oct 2022
Analyzing and Evaluating Faithfulness in Dialogue Summarization
Bin Wang
Chen Zhang
Yan Zhang
Yiming Chen
Haizhou Li
HILM
89
16
0
21 Oct 2022
Dense Paraphrasing for Textual Enrichment
Jingxuan Tu
Kyeongmin Rim
E. Holderness
James Pustejovsky
76
6
0
20 Oct 2022
CONSISTENT: Open-Ended Question Generation From News Articles
Tuhin Chakrabarty
Justin Lewis
Smaranda Muresan
72
7
0
20 Oct 2022
Counterfactual Recipe Generation: Exploring Compositional Generalization in a Realistic Scenario
Xiao Liu
Yansong Feng
Jizhi Tang
ChenGang Hu
Dongyan Zhao
43
9
0
20 Oct 2022
Self-supervised Graph Masking Pre-training for Graph-to-Text Generation
Paul Burgess
Ehsan Shareghi
64
14
0
19 Oct 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
Xuehai He
Diji Yang
Weixi Feng
Tsu-Jui Fu
Arjun Reddy Akula
Varun Jampani
P. Narayana
Sugato Basu
William Yang Wang
Xinze Wang
VPVLM
VLM
100
15
0
19 Oct 2022
Revision Transformers: Instructing Language Models to Change their Values
Felix Friedrich
Wolfgang Stammer
P. Schramowski
Kristian Kersting
KELM
77
8
0
19 Oct 2022
A Data-Driven Investigation of Noise-Adaptive Utterance Generation with Linguistic Modification
Anupama Chingacham
Vera Demberg
Dietrich Klakow
45
1
0
19 Oct 2022
Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Jialin Wu
Raymond J. Mooney
RALM
140
11
0
18 Oct 2022
SafeText: A Benchmark for Exploring Physical Safety in Language Models
Sharon Levy
Emily Allaway
Melanie Subbiah
Lydia B. Chilton
D. Patton
Kathleen McKeown
William Yang Wang
100
45
0
18 Oct 2022
Taxonomy of Abstractive Dialogue Summarization: Scenarios, Approaches and Future Directions
Qi Jia
Yizhu Liu
Siyu Ren
Kenny Q. Zhu
83
8
0
18 Oct 2022
Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task
Boyi Deng
Keqin Bao
Dayiheng Liu
Baosong Yang
Derek F. Wong
Lidia S. Chao
Wenqiang Lei
Jun Xie
72
9
0
18 Oct 2022
Summary Workbench: Unifying Application and Evaluation of Text Summarization Models
S. Syed
Dominik Schwabe
Martin Potthast
49
0
0
18 Oct 2022
Mitigating Covertly Unsafe Text within Natural Language Systems
Alex Mei
Anisha Kabir
Sharon Levy
Melanie Subbiah
Emily Allaway
J. Judge
D. Patton
Bruce Bimber
Kathleen McKeown
William Yang Wang
127
13
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
96
335
0
17 Oct 2022
Beyond Model Interpretability: On the Faithfulness and Adversarial Robustness of Contrastive Textual Explanations
Julia El Zini
M. Awad
AAML
72
2
0
17 Oct 2022
Social Biases in Automatic Evaluation Metrics for NLG
Mingqi Gao
Xiaojun Wan
59
3
0
17 Oct 2022
StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning
Hong Chen
D. Vo
Hiroya Takamura
Yusuke Miyao
Hideki Nakayama
108
20
0
16 Oct 2022
Model Criticism for Long-Form Text Generation
Yuntian Deng
Volodymyr Kuleshov
Alexander M. Rush
117
19
0
16 Oct 2022
LEATHER: A Framework for Learning to Generate Human-like Text in Dialogue
Anthony Sicilia
Malihe Alikhani
118
4
0
14 Oct 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
Tianxiang Sun
Junliang He
Xipeng Qiu
Xuanjing Huang
89
47
0
14 Oct 2022
Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation
A. Shukla
Paheli Bhattacharya
Soham Poddar
Rajdeep Mukherjee
Kripabandhu Ghosh
Pawan Goyal
Saptarshi Ghosh
ELM
AILaw
64
51
0
14 Oct 2022
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang
Qifan Wang
Yi-Chia Wang
Maziar Sanjabi
Jingzhou Liu
Hamed Firooz
Hongning Wang
Shaoliang Nie
107
6
0
14 Oct 2022
Controlling Bias Exposure for Fair Interpretable Predictions
Zexue He
Yu Wang
Julian McAuley
Bodhisattwa Prasad Majumder
60
19
0
14 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
118
276
0
13 Oct 2022
Language Models of Code are Few-Shot Commonsense Learners
Aman Madaan
Shuyan Zhou
Uri Alon
Yiming Yang
Graham Neubig
ReLM
LRM
148
223
0
13 Oct 2022
Previous
1
2
3
...
53
54
55
...
69
70
71
Next