ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown
Title
Are LLM-based Evaluators Confusing NLG Quality Criteria?
Are LLM-based Evaluators Confusing NLG Quality Criteria?
Xinyu Hu
Mingqi Gao
Sen Hu
Yang Zhang
Yicheng Chen
Teng Xu
Xiaojun Wan
AAMLELM
127
22
0
19 Feb 2024
Analysis of Multidomain Abstractive Summarization Using Salience
  Allocation
Analysis of Multidomain Abstractive Summarization Using Salience Allocation
Tohida Rehman
Raghubir Bose
Soumik Dey
S. Chattopadhyay
76
2
0
19 Feb 2024
NOTE: Notable generation Of patient Text summaries through Efficient
  approach based on direct preference optimization
NOTE: Notable generation Of patient Text summaries through Efficient approach based on direct preference optimization
Imjin Ahn
Hansle Gwon
Young-Hak Kim
Tae Joon Jun
Sanghyun Park
72
3
0
19 Feb 2024
MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement
  Learning for Discrete Prompt Optimization
MORL-Prompt: An Empirical Analysis of Multi-Objective Reinforcement Learning for Discrete Prompt Optimization
Yasaman Jafari
Dheeraj Mekala
Rose Yu
Taylor Berg-Kirkpatrick
108
8
0
18 Feb 2024
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
Tejpalsingh Siledar
Swaroop Nath
Sankara Sri Raghava Ravindra Muddu
Rupasai Rangaraju
Swaprava Nath
...
Suman Banerjee
Amey Patil
Sudhanshu Singh
M. Chelliah
Nikesh Garera
ALMLRM
72
7
0
18 Feb 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large
  Language Models
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Jaylen Jones
Lingbo Mo
Eric Fosler-Lussier
Huan Sun
102
4
0
18 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated
  Text Detectors Under Attacks
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
119
20
0
18 Feb 2024
FactPICO: Factuality Evaluation for Plain Language Summarization of
  Medical Evidence
FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence
Sebastian Antony Joseph
Lily Chen
Jan Trienes
Hannah Louisa Göke
Monika Coers
Wei Xu
Byron C. Wallace
Junyi Jessy Li
LM&MAHILM
77
11
0
18 Feb 2024
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of
  Machine-Generated Text
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
Abe Bohan Hou
Jingyu Zhang
Yichen Wang
Daniel Khashabi
Tianxing He
WaLM
168
21
0
17 Feb 2024
Centroid-Based Efficient Minimum Bayes Risk Decoding
Centroid-Based Efficient Minimum Bayes Risk Decoding
Hiroyuki Deguchi
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
Hideki Tanaka
Masao Utiyama
66
9
0
17 Feb 2024
RENOVI: A Benchmark Towards Remediating Norm Violations in
  Socio-Cultural Conversations
RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations
Haolan Zhan
Zhuang Li
Xiaoxi Kang
Tao Feng
Yuncheng Hua
...
Linhao Luo
Lay-Ki Soon
Zhaleh Semnani Azad
Ingrid Zukerman
Gholamreza Haffari
114
9
0
17 Feb 2024
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
Yougang Lyu
Lingyong Yan
Shuaiqiang Wang
Haibo Shi
D. Yin
Fajie Yuan
Zhumin Chen
Maarten de Rijke
Zhaochun Ren
82
7
0
17 Feb 2024
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned
  LLMs?
How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
Ehsan Doostmohammadi
Oskar Holmstrom
Marco Kuhlmann
76
12
0
16 Feb 2024
GenRES: Rethinking Evaluation for Generative Relation Extraction in the
  Era of Large Language Models
GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models
Pengcheng Jiang
Jiacheng Lin
Zifeng Wang
Jimeng Sun
Jiawei Han
64
6
0
16 Feb 2024
Exploring Precision and Recall to assess the quality and diversity of
  LLMs
Exploring Precision and Recall to assess the quality and diversity of LLMs
Florian Le Bronnec
Alexandre Verine
Benjamin Négrevergne
Y. Chevaleyre
Alexandre Allauzen
96
16
0
16 Feb 2024
Humans or LLMs as the Judge? A Study on Judgement Biases
Humans or LLMs as the Judge? A Study on Judgement Biases
Guiming Hardy Chen
Shunian Chen
Ziche Liu
Feng Jiang
Benyou Wang
208
113
0
16 Feb 2024
`Keep it Together': Enforcing Cohesion in Extractive Summaries by
  Simulating Human Memory
`Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory
Ronald Cardenas
Matthias Shen
Shay B. Cohen
74
0
0
16 Feb 2024
Enhancing Role-playing Systems through Aggressive Queries: Evaluation
  and Improvement
Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement
Yihong Tang
Jiao Ou
Che Liu
Fuzheng Zhang
Di Zhang
Kun Gai
101
5
0
16 Feb 2024
Disordered-DABS: A Benchmark for Dynamic Aspect-Based Summarization in
  Disordered Texts
Disordered-DABS: A Benchmark for Dynamic Aspect-Based Summarization in Disordered Texts
Xiaobo Guo
Soroush Vosoughi
70
1
0
16 Feb 2024
Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for
  Positional Discourse Coherence
Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence
Yinhong Liu
Yixuan Su
Ehsan Shareghi
Nigel Collier
77
4
0
15 Feb 2024
X-lifecycle Learning for Cloud Incident Management using LLMs
X-lifecycle Learning for Cloud Incident Management using LLMs
Drishti Goel
Fiza Husain
Aditya Singh
Supriyo Ghosh
Anjaly Parayil
Chetan Bansal
Xuchao Zhang
Saravan Rajmohan
124
18
0
15 Feb 2024
ProtChatGPT: Towards Understanding Proteins with Large Language Models
ProtChatGPT: Towards Understanding Proteins with Large Language Models
Chao Wang
Hehe Fan
Ruijie Quan
Yi Yang
108
16
0
15 Feb 2024
LLMAuditor: A Framework for Auditing Large Language Models Using
  Human-in-the-Loop
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop
Maryam Amirizaniani
Jihan Yao
Adrian Lavergne
Elizabeth Snell Okada
Aman Chadha
Tanya Roosta
Chirag Shah
HILM
68
4
0
14 Feb 2024
Generating Diverse Translation with Perturbed kNN-MT
Generating Diverse Translation with Perturbed kNN-MT
Yuto Nishida
Makoto Morishita
Hidetaka Kamigaito
Taro Watanabe
59
1
0
14 Feb 2024
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe
  Approach
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
Maryam Amirizaniani
Elias Martin
Tanya Roosta
Aman Chadha
Chirag Shah
75
3
0
14 Feb 2024
AutoTutor meets Large Language Models: A Language Model Tutor with Rich
  Pedagogy and Guardrails
AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails
Sankalan Pal Chowdhury
Vilém Zouhar
Mrinmaya Sachan
AI4EdLRM
68
18
0
14 Feb 2024
eCeLLM: Generalizing Large Language Models for E-commerce from
  Large-scale, High-quality Instruction Data
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
Bo Peng
Xinyi Ling
Ziru Chen
Huan Sun
Xia Ning
ELM
81
21
0
13 Feb 2024
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
Xing-ming Guo
Fangxu Yu
Huan Zhang
Lianhui Qin
Bin Hu
AAML
180
92
0
13 Feb 2024
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13
  Languages
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages
N. Ousidhoum
Shamsuddeen Hassan Muhammad
Mohamed Abdalla
Idris Abdulmumin
Ibrahim Said Ahmad
...
Hailegnaw Getaneh Tilaye
Krishnapriya Vishnubhotla
Genta Indra Winata
Seid Muhie Yimam
Saif M. Mohammad
150
41
0
13 Feb 2024
A Systematic Review of Data-to-Text NLG
A Systematic Review of Data-to-Text NLG
Chinonso Osuji
Thiago Castro Ferreira
Brian Davis
69
2
0
13 Feb 2024
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness
Miltiadis Allamanis
Sheena Panthaplackel
Pengcheng Yin
ALMOffRLLRM
96
10
0
13 Feb 2024
Open-ended VQA benchmarking of Vision-Language models by exploiting
  Classification datasets and their semantic hierarchy
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
Simon Ging
M. A. Bravo
Thomas Brox
VLM
158
12
0
11 Feb 2024
Low-Resource Counterspeech Generation for Indic Languages: The Case of
  Bengali and Hindi
Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi
Mithun Das
Saurabh Kumar Pandey
Shivansh Sethi
Punyajoy Saha
Animesh Mukherjee
77
2
0
11 Feb 2024
Event-Keyed Summarization
Event-Keyed Summarization
William Gantt
Alexander Martin
Pavlo Kuchmiichuk
Aaron Steven White
58
1
0
10 Feb 2024
GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph
  Alignment via Neighborhood Partitioning and Generative Subgraph Encoding
GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding
Stefan Dernbach
Khushbu Agarwal
Alejandro Zuniga
Michael Henry
Sutanay Choudhury
101
10
0
09 Feb 2024
GPTs Are Multilingual Annotators for Sequence Generation Tasks
GPTs Are Multilingual Annotators for Sequence Generation Tasks
Juhwan Choi
Eunju Lee
Kyohoon Jin
Youngbin Kim
68
11
0
08 Feb 2024
Improving Cross-Domain Low-Resource Text Generation through LLM
  Post-Editing: A Programmer-Interpreter Approach
Improving Cross-Domain Low-Resource Text Generation through LLM Post-Editing: A Programmer-Interpreter Approach
Zhuang Li
Levon Haroutunian
Raj Tumuluri
Philip R. Cohen
Gholamreza Haffari
31
3
0
07 Feb 2024
Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue
Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue
Kun Ouyang
Liqiang Jing
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
192
3
0
06 Feb 2024
Partially Recentralization Softmax Loss for Vision-Language Models
  Robustness
Partially Recentralization Softmax Loss for Vision-Language Models Robustness
Hao Wang
Xin Zhang
Jinzhe Jiang
Yaqian Zhao
Chen Li
AAML
57
0
0
06 Feb 2024
Psychological Assessments with Large Language Models: A Privacy-Focused
  and Cost-Effective Approach
Psychological Assessments with Large Language Models: A Privacy-Focused and Cost-Effective Approach
Sergi Blanco-Cuaresma
55
1
0
05 Feb 2024
Exploiting Class Probabilities for Black-box Sentence-level Attacks
Exploiting Class Probabilities for Black-box Sentence-level Attacks
Raha Moraffah
Huan Liu
60
1
0
05 Feb 2024
VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based
  Machine Reading Comprehension
VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension
Thinh P. Ngo
Khoa Tran Anh Dang
Son T. Luu
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
134
0
0
05 Feb 2024
Analyzing Sentiment Polarity Reduction in News Presentation through
  Contextual Perturbation and Large Language Models
Analyzing Sentiment Polarity Reduction in News Presentation through Contextual Perturbation and Large Language Models
Alapan Kuila
Somnath Jena
Sudeshna Sarkar
P. Chakrabarti
AAML
44
2
0
03 Feb 2024
Explaining latent representations of generative models with large
  multimodal models
Explaining latent representations of generative models with large multimodal models
Mengdan Zhu
Zhenke Liu
Bo Pan
Abhinav Angirekula
Liang Zhao
60
2
0
02 Feb 2024
An Empirical Analysis of Diversity in Argument Summarization
An Empirical Analysis of Diversity in Argument Summarization
Michiel van der Meer
Piek T. J. M. Vossen
Catholijn M. Jonker
P. Murukannaiah
58
8
0
02 Feb 2024
A Comparative Analysis of Conversational Large Language Models in
  Knowledge-Based Text Generation
A Comparative Analysis of Conversational Large Language Models in Knowledge-Based Text Generation
Phillip Schneider
Manuel Klettner
Elena Simperl
Florian Matthes
41
7
0
02 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELMLM&MA
215
41
0
02 Feb 2024
Plan-Grounded Large Language Models for Dual Goal Conversational
  Settings
Plan-Grounded Large Language Models for Dual Goal Conversational Settings
Diogo Glória-Silva
Rafael Ferreira
Diogo Tavares
David Semedo
João Magalhães
LLMAG
75
4
0
01 Feb 2024
An Information-Theoretic Approach to Analyze NLP Classification Tasks
An Information-Theoretic Approach to Analyze NLP Classification Tasks
Luran Wang
Mark Gales
Vatsal Raina
50
1
0
01 Feb 2024
ALISON: Fast and Effective Stylometric Authorship Obfuscation
ALISON: Fast and Effective Stylometric Authorship Obfuscation
Eric Xing
Saranya Venkatraman
Thai V. Le
Dongwon Lee
DeLMO
55
2
0
01 Feb 2024
Previous
123...313233...697071
Next