ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown
Title
Rethinking Model Evaluation as Narrowing the Socio-Technical Gap
Rethinking Model Evaluation as Narrowing the Socio-Technical Gap
Q. V. Liao
Ziang Xiao
ALMELM
149
32
0
01 Jun 2023
Measuring the Robustness of NLP Models to Domain Shifts
Measuring the Robustness of NLP Models to Domain Shifts
Nitay Calderon
Naveh Porat
Eyal Ben-David
Alexander Chapanin
Zorik Gekhman
Nadav Oved
Vitaly Shalumov
Roi Reichart
126
8
0
31 May 2023
Speaking the Language of Your Listener: Audience-Aware Adaptation via
  Plug-and-Play Theory of Mind
Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind
Ece Takmaz
Nicolo' Brandizzi
Mario Giulianelli
Sandro Pezzelle
Raquel Fernández
70
7
0
31 May 2023
Breeding Machine Translations: Evolutionary approach to survive and
  thrive in the world of automated evaluation
Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation
Josef Jon
Ondrej Bojar
59
10
0
30 May 2023
Concise Answers to Complex Questions: Summarization of Long-form Answers
Concise Answers to Complex Questions: Summarization of Long-form Answers
Abhilash Potluri
Fangyuan Xu
Eunsol Choi
ELM
72
11
0
30 May 2023
The Magic of IF: Investigating Causal Reasoning Abilities in Large
  Language Models of Code
The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code
Xiao Liu
Da Yin
Chen Zhang
Yansong Feng
Dongyan Zhao
ELMReLMReCodLRM
94
22
0
30 May 2023
Event-Centric Query Expansion in Web Search
Event-Centric Query Expansion in Web Search
Yanan Zhang
Weijie Cui
Yangfan Zhang
Xiaoling Bai
Zhe Zhang
Jin Ma
Xinyu Chen
Tianhua Zhou
69
2
0
30 May 2023
DEPLAIN: A German Parallel Corpus with Intralingual Translations into
  Plain Language for Sentence and Document Simplification
DEPLAIN: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification
Regina Stodden
Omar Momen
Laura Kallmeyer
57
15
0
30 May 2023
KEYword based Sampling (KEYS) for Large Language Models
KEYword based Sampling (KEYS) for Large Language Models
V. JyothirS
Zuhaib Akhtar
56
1
0
30 May 2023
A Critical Evaluation of Evaluations for Long-form Question Answering
A Critical Evaluation of Evaluations for Long-form Question Answering
Fangyuan Xu
Yixiao Song
Mohit Iyyer
Eunsol Choi
ELM
100
104
0
29 May 2023
Large Language Models are not Fair Evaluators
Large Language Models are not Fair Evaluators
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
166
575
0
29 May 2023
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning
  in Goal-Oriented Dialogue Models
Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning in Goal-Oriented Dialogue Models
Qiang Zhang
Jason Naradowsky
Yusuke Miyao
ELM
94
35
0
29 May 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
84
11
0
28 May 2023
Decoding the Underlying Meaning of Multimodal Hateful Memes
Decoding the Underlying Meaning of Multimodal Hateful Memes
Ming Shan Hee
Wen-Haw Chong
Roy Ka-wei Lee
91
43
0
28 May 2023
MeetingBank: A Benchmark Dataset for Meeting Summarization
MeetingBank: A Benchmark Dataset for Meeting Summarization
Yebowen Hu
Timothy Jeewun Ganter
Hanieh Deilamsalehy
Franck Dernoncourt
H. Foroosh
Fei Liu
AI4TS
82
50
0
27 May 2023
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph
  Parsing
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing
Zhuang Li
Yuyang Chai
Terry Yue Zhuo
Zhuang Li
Gholamreza Haffari
Fei Li
Donghong Ji
Quan Hung Tran
115
33
0
27 May 2023
A Practical Toolkit for Multilingual Question and Answer Generation
A Practical Toolkit for Multilingual Question and Answer Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
SyDa
85
14
0
27 May 2023
An Investigation of Evaluation Metrics for Automated Medical Note
  Generation
An Investigation of Evaluation Metrics for Automated Medical Note Generation
Asma Ben Abacha
Wen-wai Yim
George Michalopoulos
Thomas Lin
71
23
0
27 May 2023
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence
  Prediction and Mutual Information
Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information
Kun Zhao
Bohao Yang
Chenghua Lin
Wenge Rong
Aline Villavicencio
Xiaohui Cui
DRL
69
4
0
26 May 2023
Inter-connection: Effective Connection between Pre-trained Encoder and
  Decoder for Speech Translation
Inter-connection: Effective Connection between Pre-trained Encoder and Decoder for Speech Translation
Yuta Nishikawa
Satoshi Nakamura
74
4
0
26 May 2023
UMSE: Unified Multi-scenario Summarization Evaluation
UMSE: Unified Multi-scenario Summarization Evaluation
Shen Gao
Zhitao Yao
Chongyang Tao
Preslav Nakov
Fajie Yuan
Zhaochun Ren
Zhumin Chen
88
5
0
26 May 2023
Incorporating Distributions of Discourse Structure for Long Document
  Abstractive Summarization
Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization
Dongqi Pu
Yifa Wang
Vera Demberg
93
23
0
26 May 2023
AlignScore: Evaluating Factual Consistency with a Unified Alignment
  Function
AlignScore: Evaluating Factual Consistency with a Unified Alignment Function
Yuheng Zha
Yichi Yang
Ruichen Li
Zhiting Hu
HILM
122
208
0
26 May 2023
People and Places of Historical Europe: Bootstrapping Annotation
  Pipeline and a New Corpus of Named Entities in Late Medieval Texts
People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts
Vít Novotný
Kristýna Luger
Michal Štefánik
Tereza Vrabcová
Ales Horak
74
1
0
26 May 2023
Evaluation of Question Generation Needs More References
Evaluation of Question Generation Needs More References
Shinhyeok Oh
Hyojun Go
Hyeongdon Moon
Yunsung Lee
Myeongho Jeong
Hyun Seung Lee
Seungtaek Choi
ELM
68
8
0
26 May 2023
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate
  Model
Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao
Hongcheng Gao
Hao Zhang
Zhijie Deng
DeLMO
84
20
0
26 May 2023
The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in
  Open-domain Conversational Question Answering
The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering
Sabrina Chiesurin
Dimitris Dimakopoulos
Marco Antonio Sobrevilla Cabezudo
Arash Eshghi
Ioannis V. Papaioannou
Verena Rieser
Ioannis Konstas
HILM
69
28
0
25 May 2023
Do You Hear The People Sing? Key Point Analysis via Iterative Clustering
  and Abstractive Summarisation
Do You Hear The People Sing? Key Point Analysis via Iterative Clustering and Abstractive Summarisation
Hao Li
Viktor Schlegel
Riza Batista-Navarro
Goran Nenadic
61
7
0
25 May 2023
Private Meeting Summarization Without Performance Loss
Private Meeting Summarization Without Performance Loss
Seolhwa Lee
Anders Søgaard
63
3
0
25 May 2023
Self-contradictory Hallucinations of Large Language Models: Evaluation,
  Detection and Mitigation
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
Niels Mündler
Jingxuan He
Slobodan Jenko
Martin Vechev
HILM
74
119
0
25 May 2023
MERGE: Fast Private Text Generation
MERGE: Fast Private Text Generation
Zi Liang
Pinghui Wang
Ruofei Zhang
Nuo Xu
Lifeng Xing
Shuo Zhang
61
8
0
25 May 2023
Learning Answer Generation using Supervision from Automatic Question
  Answering Evaluators
Learning Answer Generation using Supervision from Automatic Question Answering Evaluators
Matteo Gabburo
Siddhant Garg
Rik Koncel-Kedziorski
Alessandro Moschitti
78
6
0
24 May 2023
Visually-Situated Natural Language Understanding with Contrastive
  Reading Model and Frozen Large Language Models
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Geewook Kim
Hodong Lee
D. Kim
Haeji Jung
S. Park
Yoon Kim
Sangdoo Yun
Taeho Kil
Bado Lee
Seunghyun Park
VLM
105
4
0
24 May 2023
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying
  References
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References
Tianyi Tang
Hongyuan Lu
Yuchen Eleanor Jiang
Haoyang Huang
Dongdong Zhang
Wayne Xin Zhao
Tom Kocmi
Furu Wei
58
7
0
24 May 2023
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text
  Summaries on Downstream Tasks
Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks
Xiao Pu
Mingqi Gao
Xiaojun Wan
ELM
93
4
0
24 May 2023
Active Learning for Natural Language Generation
Active Learning for Natural Language Generation
Yotam Perlitz
Ariel Gera
Michal Shmueli-Scheuer
D. Sheinwald
Noam Slonim
L. Ein-Dor
96
3
0
24 May 2023
Transferring Visual Attributes from Natural Language to Verified Image
  Generation
Transferring Visual Attributes from Natural Language to Verified Image Generation
Rodrigo Valerio
João Bordalo
Michal Yarom
Yonattan Bitton
Idan Szpektor
João Magalhães
69
5
0
24 May 2023
Controlling Pre-trained Language Models for Grade-Specific Text
  Simplification
Controlling Pre-trained Language Models for Grade-Specific Text Simplification
Sweta Agrawal
Marine Carpuat
67
15
0
24 May 2023
MuLER: Detailed and Scalable Reference-based Evaluation
MuLER: Detailed and Scalable Reference-based Evaluation
Taelin Karidi
Leshem Choshen
Gal Patel
Omri Abend
74
0
0
24 May 2023
Improving Factuality of Abstractive Summarization without Sacrificing
  Summary Quality
Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality
Tanay Dixit
Fei Wang
Muhao Chen
HILM
63
10
0
24 May 2023
Coverage-based Example Selection for In-Context Learning
Coverage-based Example Selection for In-Context Learning
Shivanshu Gupta
Matt Gardner
Sameer Singh
111
49
0
24 May 2023
Text encoders bottleneck compositionality in contrastive vision-language
  models
Text encoders bottleneck compositionality in contrastive vision-language models
Amita Kamath
Jack Hessel
Kai-Wei Chang
CoGeCLIPVLM
92
21
0
24 May 2023
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation
  Metrics using Measurement Theory
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory
Ziang Xiao
Susu Zhang
Vivian Lai
Q. V. Liao
ELM
115
30
0
24 May 2023
CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense
  Question Answering
CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Weiqi Wang
Tianqing Fang
Wenxuan Ding
Baixuan Xu
Xin Liu
Yangqiu Song
Antoine Bosselut
ReLMLRM
73
43
0
24 May 2023
Drafting Event Schemas using Language Models
Drafting Event Schemas using Language Models
Anisha Gunjal
Greg Durrett
AI4TS
114
6
0
24 May 2023
SummIt: Iterative Text Summarization via ChatGPT
SummIt: Iterative Text Summarization via ChatGPT
Haopeng Zhang
Xiao Liu
Jiawei Zhang
113
72
0
24 May 2023
Faithful Low-Resource Data-to-Text Generation through Cycle Training
Faithful Low-Resource Data-to-Text Generation through Cycle Training
Zhuoer Wang
Marcus D. Collins
Nikhita Vedula
Simone Filice
S. Malmasi
Oleg Rokhlenko
99
10
0
24 May 2023
Advancing Topic Segmentation and Outline Generation in Chinese Texts:
  The Paragraph-level Topic Representation, Corpus, and Benchmark
Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark
Feng Jiang
Weihao Liu
Xiaomin Chu
Peifeng Li
Qiaoming Zhu
Haizhou Li
72
1
0
24 May 2023
A Question Answering Framework for Decontextualizing User-facing
  Snippets from Scientific Documents
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents
Benjamin Newman
Luca Soldaini
Raymond Fok
Arman Cohan
Kyle Lo
RALM
55
18
0
24 May 2023
Psychological Metrics for Dialog System Evaluation
Psychological Metrics for Dialog System Evaluation
Salvatore Giorgi
Shreya Havaldar
Farhan S. Ahmed
Zuhaib Akhtar
Shalaka Vaidya
Gary Pan
Pallavi V. Kulkarni
H. Andrew Schwartz
Joao Sedoc
94
2
0
24 May 2023
Previous
123...444546...697071
Next