ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,519 papers shown
Title
MrRank: Improving Question Answering Retrieval System through
  Multi-Result Ranking Model
MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model
Danupat Khamnuansin
Tawunrat Chalothorn
Ekapol Chuangsuwanich
84
2
0
09 Jun 2024
Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs
  for Open-Ended Responses
Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses
Maryam Amirizaniani
Elias Martin
Maryna Sivachenko
A. Mashhadi
Chirag Shah
LRM
91
18
0
09 Jun 2024
ATLAS: Improving Lay Summarisation with Attribute-based Control
ATLAS: Improving Lay Summarisation with Attribute-based Control
Zhihao Zhang
Tomas Goldsack
Carolina Scarton
Chenghua Lin
41
0
0
09 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELMALMLM&MA
206
44
0
09 Jun 2024
CERET: Cost-Effective Extrinsic Refinement for Text Generation
CERET: Cost-Effective Extrinsic Refinement for Text Generation
Jason (Jinglun) Cai
Hang Su
Monica Sunkara
Igor Shalyminov
Saab Mansour
82
1
0
08 Jun 2024
Write Summary Step-by-Step: A Pilot Study of Stepwise Summarization
Write Summary Step-by-Step: A Pilot Study of Stepwise Summarization
Preslav Nakov
Shen Gao
Mingzhe Li
Qingqing Zhu
Xin Gao
Xiangliang Zhang
70
0
0
08 Jun 2024
Flexible and Adaptable Summarization via Expertise Separation
Flexible and Adaptable Summarization via Expertise Separation
Preslav Nakov
Mingzhe Li
Shen Gao
Xin Cheng
Qingqing Zhu
Rui Yan
Xin Gao
Xiangliang Zhang
MoE
70
5
0
08 Jun 2024
MemeGuard: An LLM and VLM-based Framework for Advancing Content
  Moderation via Meme Intervention
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
Prince Jha
Raghav Jain
Konika Mandal
Aman Chadha
Sriparna Saha
P. Bhattacharyya
60
8
0
08 Jun 2024
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
Hao Fang
Jiawei Kong
Wenbo Yu
Bin Chen
Jiawei Li
Hao Wu
Ke Xu
Ke Xu
AAMLVLM
133
14
0
08 Jun 2024
Seeing the Unseen: Visual Metaphor Captioning for Videos
Seeing the Unseen: Visual Metaphor Captioning for Videos
Abisek Rajakumar Kalarani
Pushpak Bhattacharyya
Sumit Shekhar
VLM
71
1
0
07 Jun 2024
Key-Element-Informed sLLM Tuning for Document Summarization
Key-Element-Informed sLLM Tuning for Document Summarization
Sangwon Ryu
Heejin Do
Yunsu Kim
G. G. Lee
Jungseul Ok
103
6
0
07 Jun 2024
SC2: Towards Enhancing Content Preservation and Style Consistency in
  Long Text Style Transfer
SC2: Towards Enhancing Content Preservation and Style Consistency in Long Text Style Transfer
Jie Zhao
Ziyu Guan
Cai Xu
Wei Zhao
Yue Jiang
67
2
0
07 Jun 2024
StackSight: Unveiling WebAssembly through Large Language Models and
  Neurosymbolic Chain-of-Thought Decompilation
StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation
Weike Fang
Zhejian Zhou
Junzhou He
Weihang Wang
LRM
42
3
0
07 Jun 2024
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI
  Interpretation in Indian Courts
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
S. Nigam
Anurag Sharma
Danush Khanna
Noel Shallum
Kripabandhu Ghosh
Arnab Bhattacharya
ELMAILaw
77
9
0
06 Jun 2024
On The Persona-based Summarization of Domain-Specific Documents
On The Persona-based Summarization of Domain-Specific Documents
Ankan Mullick
Sombit Bose
Rounak Saha
Ayan Kumar Bhowmick
Pawan Goyal
Niloy Ganguly
Prasenjit Dey
Ravi Kokku
57
3
0
06 Jun 2024
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of
  Implicit Hate Speech
Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Neemesh Yadav
Sarah Masud
Vikram Goyal
Vikram Goyal
Md. Shad Akhtar
Tanmoy Chakraborty
75
8
0
06 Jun 2024
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
Anushka Singh
Ananya B. Sai
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
40
1
0
06 Jun 2024
XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the
  Multilingual Generation of News Headlines and Tags
XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
Faisal Tareque Shohan
Mir Tafseer Nayeem
Samsul Islam
Abu Ubaida Akash
Shafiq Joty
76
4
0
06 Jun 2024
M-QALM: A Benchmark to Assess Clinical Reading Comprehension and
  Knowledge Recall in Large Language Models via Question Answering
M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering
Anand Subramanian
Viktor Schlegel
Abhinav Ramesh Kashyap
Thanh-Tung Nguyen
Vijay Prakash Dwivedi
Stefan Winkler
ELMLM&MAAI4MH
66
3
0
06 Jun 2024
Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing
Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing
Hadi Askari
Anshuman Chhabra
Muhao Chen
Prasant Mohapatra
67
5
0
06 Jun 2024
What is the Best Way for ChatGPT to Translate Poetry?
What is the Best Way for ChatGPT to Translate Poetry?
Shanshan Wang
Derek F. Wong
Jingming Yao
Lidia S. Chao
57
5
0
05 Jun 2024
LLM-based Rewriting of Inappropriate Argumentation using Reinforcement
  Learning from Machine Feedback
LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback
Timon Ziegenbein
Gabriella Skitalinskaya
Alireza Bayat Makou
Henning Wachsmuth
LLMAGKELM
98
8
0
05 Jun 2024
The Challenges of Evaluating LLM Applications: An Analysis of Automated,
  Human, and LLM-Based Approaches
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Bhashithe Abeysinghe
Ruhan Circi
ELM
108
23
0
05 Jun 2024
Document-level Claim Extraction and Decontextualisation for
  Fact-Checking
Document-level Claim Extraction and Decontextualisation for Fact-Checking
Zhenyun Deng
Michael Schlichtkrull
Andreas Vlachos
HILM
88
3
0
05 Jun 2024
Missci: Reconstructing Fallacies in Misrepresented Science
Missci: Reconstructing Fallacies in Misrepresented Science
Max Glockner
Yufang Hou
Preslav Nakov
Iryna Gurevych
92
6
0
05 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence
  Models for Abstractive Radiology Report Summarization
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
33
1
0
05 Jun 2024
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and
  Social Experiences
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences
Yidong Huang
Jacob Sansom
Ziqiao Ma
Felix Gervits
Joyce Chai
116
18
0
05 Jun 2024
Readability-guided Idiom-aware Sentence Simplification (RISS) for
  Chinese
Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese
Jingshen Zhang
Xinglu Chen
Xinying Qiu
Zhimin Wang
Wenhe Feng
63
0
0
05 Jun 2024
Improving In-Context Learning with Prediction Feedback for Sentiment
  Analysis
Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
Hongling Xu
Qianlong Wang
Yice Zhang
Min Yang
Xi Zeng
Bing Qin
Ruifeng Xu
51
6
0
05 Jun 2024
Exploring Robustness in Doctor-Patient Conversation Summarization: An
  Analysis of Out-of-Domain SOAP Notes
Exploring Robustness in Doctor-Patient Conversation Summarization: An Analysis of Out-of-Domain SOAP Notes
Yu-Wen Chen
Julia Hirschberg
54
4
0
05 Jun 2024
BIPED: Pedagogically Informed Tutoring System for ESL Education
BIPED: Pedagogically Informed Tutoring System for ESL Education
Soonwoo Kwon
Sojung Kim
Minju Park
Seunghyun Lee
Kyuseok Kim
101
3
0
05 Jun 2024
Story Generation from Visual Inputs: Techniques, Related Tasks, and
  Challenges
Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges
Daniel A. P. Oliveira
Eugénio Ribeiro
David Martins de Matos
VGen
53
3
0
04 Jun 2024
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Maciej Besta
Lorenzo Paleari
Marcin Copik
Robert Gerstenberger
Aleš Kubíček
...
Eric Schreiber
Torsten Hoefler
Tomasz Lehmann
H. Niewiadomski
Torsten Hoefler
171
7
0
04 Jun 2024
MAD: Multi-Alignment MEG-to-Text Decoding
MAD: Multi-Alignment MEG-to-Text Decoding
Yiqian Yang
Hyejeong Jo
Yiqun Duan
Qiang Zhang
Jinni Zhou
Won Hee Lee
Renjing Xu
Hui Xiong
88
11
0
03 Jun 2024
Two Tales of Persona in LLMs: A Survey of Role-Playing and
  Personalization
Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization
Yu-Min Tseng
Yu-Chao Huang
Teng-Yun Hsiao
Yu-Ching Hsu
Chao-Wei Huang
Jia-Yin Foo
Yun-Nung Chen
LLMAG
422
92
0
03 Jun 2024
Presence or Absence: Are Unknown Word Usages in Dictionaries?
Presence or Absence: Are Unknown Word Usages in Dictionaries?
Xianghe Ma
Dominik Schlechtweg
Wei Zhao
98
4
0
02 Jun 2024
Distortion-free Watermarks are not Truly Distortion-free under Watermark
  Key Collisions
Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions
Yihan Wu
Ruibo Chen
Zhengmian Hu
Yanshuo Chen
Junfeng Guo
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
108
5
0
02 Jun 2024
DS@BioMed at ImageCLEFmedical Caption 2024: Enhanced Attention
  Mechanisms in Medical Caption Generation through Concept Detection
  Integration
DS@BioMed at ImageCLEFmedical Caption 2024: Enhanced Attention Mechanisms in Medical Caption Generation through Concept Detection Integration
Nhi Ngoc-Yen Nguyen
Le-Huy Tu
Dieu-Phuong Nguyen
Nhat-Tan Do
Minh Triet Thai
Bao-Thien Nguyen-Tat
MedIm
83
2
0
01 Jun 2024
Multi-Dimensional Optimization for Text Summarization via Reinforcement
  Learning
Multi-Dimensional Optimization for Text Summarization via Reinforcement Learning
Sangwon Ryu
Heejin Do
Yunsu Kim
Gary Geunbae Lee
Jungseul Ok
97
3
0
01 Jun 2024
Artemis: Towards Referential Understanding in Complex Videos
Artemis: Towards Referential Understanding in Complex Videos
Jihao Qiu
Yuan Zhang
Xi Tang
Lingxi Xie
Tianren Ma
Pengyu Yan
David Doermann
Qixiang Ye
Yunjie Tian
VLMVGen
85
10
0
01 Jun 2024
Amortizing intractable inference in diffusion models for vision, language, and control
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
191
32
0
31 May 2024
FinGen: A Dataset for Argument Generation in Finance
FinGen: A Dataset for Argument Generation in Finance
Chung-Chi Chen
Hiroya Takamura
Ichiro Kobayashi
Yusuke Miyao
58
0
0
31 May 2024
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization
  for Prompt Enhancement
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement
Pengwei Zhan
Zhen Xu
Qian Tan
Jie Song
Ru Xie
81
7
0
31 May 2024
FineRadScore: A Radiology Report Line-by-Line Evaluation Technique
  Generating Corrections with Severity Scores
FineRadScore: A Radiology Report Line-by-Line Evaluation Technique Generating Corrections with Severity Scores
Alyssa Huang
Oishi Banerjee
Kay Wu
Eduardo Pontes Reis
Pranav Rajpurkar
MedImLM&MA
61
8
0
31 May 2024
OR-Bench: An Over-Refusal Benchmark for Large Language Models
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Justin Cui
Wei-Lin Chiang
Ion Stoica
Cho-Jui Hsieh
ALM
161
55
0
31 May 2024
Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Mingda Li
Xinyu Li
Yifan Chen
Wenfeng Xuan
Weinan Zhang
RALM
88
2
0
31 May 2024
Facilitating Human-LLM Collaboration through Factuality Scores and
  Source Attributions
Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
Hyo Jin Do
Rachel Ostrand
Justin D. Weisz
Casey Dugan
P. Sattigeri
Dennis L. Wei
K. Murugesan
Werner Geyer
HILM
98
10
0
30 May 2024
CoSy: Evaluating Textual Explanations of Neurons
CoSy: Evaluating Textual Explanations of Neurons
Laura Kopf
P. Bommer
Anna Hedström
Sebastian Lapuschkin
Marina M.-C. Höhne
Kirill Bykov
66
13
0
30 May 2024
ANAH: Analytical Annotation of Hallucinations in Large Language Models
ANAH: Analytical Annotation of Hallucinations in Large Language Models
Ziwei Ji
Yuzhe Gu
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai-xiang Chen
HILM
95
3
0
30 May 2024
WRDScore: New Metric for Evaluation of Natural Language Generation
  Models
WRDScore: New Metric for Evaluation of Natural Language Generation Models
Ravil Mussabayev
41
0
0
29 May 2024
Previous
123...232425...697071
Next