ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,519 papers shown
Title
A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization
A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization
Sarvesh Soni
Dina Demner-Fushman
121
3
0
04 Jun 2025
QQSUM: A Novel Task and Model of Quantitative Query-Focused Summarization for Review-based Product Question Answering
QQSUM: A Novel Task and Model of Quantitative Query-Focused Summarization for Review-based Product Question Answering
A. Tang
Xiuzhen Zhang
M. Dinh
Zhuang Li
RALM
62
0
0
04 Jun 2025
DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation
Kun Zhao
Bohao Yang
Chen Tang
Siyuan Dai
Haoteng Tang
Chenghua Lin
Liang Zhan
25
0
0
04 Jun 2025
Measuring Human Involvement in AI-Generated Text: A Case Study on Academic Writing
Measuring Human Involvement in AI-Generated Text: A Case Study on Academic Writing
Yuchen Guo
Zhicheng Dou
H. Nguyen
Ching-Chun Chang
Saku Sugawara
Isao Echizen
DeLMO
107
0
0
04 Jun 2025
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths
Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths
Inderjeet Nair
Lu Wang
50
0
0
03 Jun 2025
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Chunkit Chan
Yauwai Yim
Hongchuan Zeng
Zhiying Zou
Xinyuan Cheng
...
Ginny Wong
Helmut Schmid
Hinrich Schütze
Simon See
Yangqiu Song
LRM
54
0
0
03 Jun 2025
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations
Jinyuan Luo
Zhen Fang
Yixuan Li
Seongheon Park
Ling Chen
AAMLHILM
57
0
0
03 Jun 2025
Go Beyond Earth: Understanding Human Actions and Scenes in Microgravity Environments
Go Beyond Earth: Understanding Human Actions and Scenes in Microgravity Environments
Di Wen
Lei Qi
Kunyu Peng
Kailun Yang
Fei Teng
...
Yufan Chen
R. Liu
Yitian Shi
M. Sarfraz
Rainer Stiefelhagen
59
0
0
03 Jun 2025
CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG
CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG
Yang Tian
Fan Liu
Jingyuan Zhang
Victoria A. Webster-Wood
Yupeng Hu
Liqiang Nie
VLM
55
0
0
03 Jun 2025
Should LLM Safety Be More Than Refusing Harmful Instructions?
Should LLM Safety Be More Than Refusing Harmful Instructions?
Utsav Maskey
Mark Dras
Usman Naseem
57
0
0
03 Jun 2025
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
Mingzhe Li
Gehao Zhang
Zhenting Wang
Shiqing Ma
Siqi Pan
Richard Cartwright
Juan Zhai
DiffM
52
0
0
03 Jun 2025
Exploring Explanations Improves the Robustness of In-Context Learning
Exploring Explanations Improves the Robustness of In-Context Learning
Ukyo Honda
Tatsushi Oka
LRM
63
0
0
03 Jun 2025
FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning
Zhuohan Xie
Dhruv Sahnan
Debopriyo Banerjee
Georgi Georgiev
Rushil Thareja
...
Ivan Koychev
Tanmoy Chakraborty
Salem Lahlou
Veselin Stoyanov
Preslav Nakov
ReLMLRM
67
0
0
03 Jun 2025
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences
Hyojin Bahng
Caroline Chan
F. Durand
Phillip Isola
EGVM
25
0
0
02 Jun 2025
Multilingual Definition Modeling
Multilingual Definition Modeling
Edison Marrese-Taylor
Erica K. Shimomoto
Alfredo Solano
Enrique Reid
50
0
0
02 Jun 2025
Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach
Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach
Heng Er Metilda Chee
Jiayin Wang
Zhiqiang Guo
Weizhi Ma
Min Zhang
53
0
0
02 Jun 2025
How do Transformer Embeddings Represent Compositions? A Functional Analysis
How do Transformer Embeddings Represent Compositions? A Functional Analysis
Aishik Nagar
Ishaan Singh Rawal
Mansi Dhanania
Cheston Tan
CoGe
62
0
0
01 Jun 2025
Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues
Speaking Beyond Language: A Large-Scale Multimodal Dataset for Learning Nonverbal Cues from Video-Grounded Dialogues
Youngmin Kim
Jiwan Chung
Jisoo Kim
Sunghyun Lee
Sangkyu Lee
Junhyeok Kim
Cheoljong Yang
Youngjae Yu
VGen
28
0
0
01 Jun 2025
From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses
From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses
Manoj Balaji Jagadeeshan
S. Bhatia
Pretam Ray
Harshul Raj Surana
A. Prathosh
Priya Mishra
Annarao Kulkarni
Ganesh Ramakrishnan
Prathosh AP
Pawan Goyal
45
0
0
01 Jun 2025
From Objectives to Questions: A Planning-based Framework for Educational Mathematical Question Generation
From Objectives to Questions: A Planning-based Framework for Educational Mathematical Question Generation
Cheng Cheng
Z. Huang
Guanhao Zhao
Yuxiang Guo
Xin Lin
J. Wu
Xin Li
Shijin Wang
42
0
0
01 Jun 2025
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation
Xinyi Liu
Lipeng Ma
Yixuan Li
Weidong Yang
Qingyuan Zhou
Jiayi Song
Shuhao Li
Ben Fei
LRM
43
0
0
01 Jun 2025
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Shaoxiong Ji
Zihao Li
Jaakko Paavola
Indraneil Paul
Hengyu Luo
Jörg Tiedemann
CLL
49
0
0
31 May 2025
AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation
AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation
Ming Wang
Peidong Wang
Lin Wu
Xiaocui Yang
Daling Wang
Shi Feng
Yuxin Chen
B. Wang
Yifei Zhang
28
0
0
31 May 2025
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages
Hyangsuk Min
Yuho Lee
Minjeong Ban
Jiaqi Deng
Nicole Hee-Yeon Kim
Taewon Yun
Hang Su
Jason (Jinglun) Cai
Hwanjun Song
ELM
25
0
0
31 May 2025
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments
Measuring Faithfulness and Abstention: An Automated Pipeline for Evaluating LLM-Generated 3-ply Case-Based Legal Arguments
Li Zhang
Morgan A. Gray
Jaromír Šavelka
Kevin D. Ashley
22
0
0
31 May 2025
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Emilio Villa-Cueva
Sholpan Bolatzhanova
Diana Turmakhan
Kareem Elzeky
H. Ademtew
...
Tiago Timponi Torrent
Debela Desalegn Yadeta
Injy Hamed
A. Tonja
Thamar Solorio
VLM
89
0
0
30 May 2025
DLM-One: Diffusion Language Models for One-Step Sequence Generation
DLM-One: Diffusion Language Models for One-Step Sequence Generation
Tianqi Chen
Shujian Zhang
Mingyuan Zhou
28
0
0
30 May 2025
TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
Hyundong Jin
Sicheol Sung
Shinwoo Park
SeungYeop Baik
Yo-Sub Han
22
0
0
30 May 2025
Adaptive LoRA Merge with Parameter Pruning for Low-Resource Generation
Adaptive LoRA Merge with Parameter Pruning for Low-Resource Generation
Ryota Miyano
Yuki Arase
MoMe
20
0
0
30 May 2025
Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity
Dang Nguyen
Ali Payani
Baharan Mirzasoleiman
20
0
0
30 May 2025
Structuring Radiology Reports: Challenging LLMs with Lightweight Models
Structuring Radiology Reports: Challenging LLMs with Lightweight Models
Johannes Moll
Louisa Fay
Asfandyar Azhar
Sophie Ostmeier
Tim Lueth
S. Gatidis
Curtis P. Langlotz
Jean-Benoit Delbrouck
7
0
0
30 May 2025
Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization
Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization
Utsav Maskey
Chencheng Zhu
Usman Naseem
AAMLELM
17
1
0
30 May 2025
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou
Yin Xu
Yingtao Zhu
Chuxi Xiao
Haiyang Bian
Lei Wei
Xuegong Zhang
LM&MAVLM
25
0
0
30 May 2025
Automated Structured Radiology Report Generation
Automated Structured Radiology Report Generation
Jean-Benoit Delbrouck
Justin Xu
Johannes Moll
Alois Thomas
Zhihong Chen
...
Christian Bluethgen
E. Reis
Mohamed Muneer
Maya Varma
Curtis P. Langlotz
MedIm
22
0
0
30 May 2025
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization
NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization
Hyuntak Kim
Byung-Hak Kim
28
0
0
30 May 2025
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
Li yunhan
Wu gengshen
AILawELMALM
15
0
0
30 May 2025
Detecting Stealthy Backdoor Samples based on Intra-class Distance for Large Language Models
Detecting Stealthy Backdoor Samples based on Intra-class Distance for Large Language Models
Jinwen Chen
Hainan Zhang
Fei Sun
Qinnan Zhang
Sijia Wen
Ziwei Wang
Zhiming Zheng
AAML
17
0
0
29 May 2025
AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora
AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora
Jiaxin Bai
Wei Fan
Qi Hu
Qing Zong
Chunyang Li
...
Leijie Wu
Yi Ji
Gong Zhang
Renhai Chen
Yangqiu Song
55
0
0
29 May 2025
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
Yuu Jinnai
OT
48
0
0
29 May 2025
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
Haohan Yuan
Sukhwa Hong
Haopeng Zhang
RALMReLMLRM
43
0
0
29 May 2025
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning
Jinquan Guan
Qi Chen
Lizhou Liang
Yuhang Liu
Vu Minh Hieu Phan
Minh-Son To
Jian Chen
Yutong Xie
LM&MALRM
43
0
0
29 May 2025
TabXEval: Why this is a Bad Table? An eXhaustive Rubric for Table Evaluation
TabXEval: Why this is a Bad Table? An eXhaustive Rubric for Table Evaluation
Vihang Pancholi
J. Bafna
Tejas Anvekar
Manish Shrivastava
Vivek Gupta
LMTD
27
0
0
28 May 2025
CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction
CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction
Jiali Chen
Xusen Hei
HongFei Liu
Yuancheng Wei
Zikun Deng
Jiayuan Xie
Yi Cai
Li Qing
48
0
0
28 May 2025
Principled Content Selection to Generate Diverse and Personalized Multi-Document Summaries
Principled Content Selection to Generate Diverse and Personalized Multi-Document Summaries
Vishakh Padmakumar
Zichao Wang
David Arbour
Jennifer Healey
25
0
0
28 May 2025
Text2Grad: Reinforcement Learning from Natural Language Feedback
Text2Grad: Reinforcement Learning from Natural Language Feedback
Hanyang Wang
Lu Wang
Chaoyun Zhang
Tianjun Mao
Si Qin
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
74
0
0
28 May 2025
Does Johnny Get the Message? Evaluating Cybersecurity Notifications for Everyday Users
Does Johnny Get the Message? Evaluating Cybersecurity Notifications for Everyday Users
V. Jüttner
Erik Buchmann
31
0
0
28 May 2025
Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation
Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation
Yunsoo Kim
Jinge Wu
Su-Hwan Kim
Pardeep Vasudev
Jiashu Shen
Honghan Wu
37
0
0
28 May 2025
Privacy-Preserving Chest X-ray Report Generation via Multimodal Federated Learning with ViT and GPT-2
Privacy-Preserving Chest X-ray Report Generation via Multimodal Federated Learning with ViT and GPT-2
Md. Zahid Hossain
Mustofa Ahmed
Most. Sharmin Sultana Samu
Md. Rakibul Islam
MedIm
59
0
0
27 May 2025
The Feasibility of Topic-Based Watermarking on Academic Peer Reviews
The Feasibility of Topic-Based Watermarking on Academic Peer Reviews
Alexander Nemecek
Yuzhou Jiang
Erman Ayday
WaLM
31
0
0
27 May 2025
Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation
Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation
Jong Hak Moon
Geon Choi
Paloma Rabaey
Min Gwan Kim
Hyuk Gi Hong
...
J. Kim
Harshita Sharma
Daniel Coelho De Castro
Javier Alvarez-Valle
Edward Choi
LM&MA
40
0
0
27 May 2025
Previous
12345...697071
Next