Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,519 papers shown
Title
Libra: Leveraging Temporal Images for Biomedical Radiology Analysis
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
MedIm
195
1
0
28 Nov 2024
Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator
Frederic Kirstein
Terry Ruas
Bela Gipp
173
2
0
27 Nov 2024
AMPS: ASR with Multimodal Paraphrase Supervision
Amruta Parulekar
Abhishek Gupta
Sameep Chattopadhyay
Preethi Jyothi
134
0
0
27 Nov 2024
DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model
JiHwan Moon
Jihoon Park
Jungeun Kim
Jongseong Bae
Hyeongwoo Jeon
Ha Young Kim
140
1
0
26 Nov 2024
Socio-Emotional Response Generation: A Human Evaluation Protocol for LLM-Based Conversational Systems
Lorraine Vanel
Ariel R. Ramos Vela
Alya Yacoubi
Chloé Clavel
106
0
0
26 Nov 2024
Safe to Serve: Aligning Instruction-Tuned Models for Safety and Helpfulness
Avinash Amballa
Durga Sandeep Saluru
Gayathri Akkinapalli
Abhishek Sureddy
Akshay Kumar Sureddy
ALM
109
0
0
26 Nov 2024
TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
Yuan-Ming Li
An-Lan Wang
Kun-Yu Lin
Yu-Ming Tang
Ling-an Zeng
Jian-Fang Hu
Wei-Shi Zheng
181
6
0
26 Nov 2024
From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Dawei Li
Bohan Jiang
Liangjie Huang
Alimohammad Beigi
Chengshuai Zhao
...
Canyu Chen
Tianhao Wu
Kai Shu
Lu Cheng
Huan Liu
ELM
AILaw
362
112
0
25 Nov 2024
LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation
Steven Song
Anirudh Subramanyam
Irene Madejski
Robert L. Grossman
MedIm
VLM
173
0
0
25 Nov 2024
LLM Augmentations to support Analytical Reasoning over Multiple Documents
Raquib Bin Yousuf
Nicholas Defelice
Mandar Sharma
Shengzhe Xu
Naren Ramakrishnan
89
2
0
25 Nov 2024
CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning
Duo Wu
Jiangming Wang
Yuan Meng
Yanning Zhang
Le Sun
Zhi Wang
524
0
0
25 Nov 2024
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
Bo Liu
K. Zou
Liming Zhan
Zexin Lu
Xiaoyu Dong
Yidi Chen
Chengqiang Xie
Jiannong Cao
Xiao-Ming Wu
Huazhu Fu
191
2
0
25 Nov 2024
AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset
Tobi Olatunji
Charles Nimo
A. Owodunni
Tassallah Abdullahi
Emmanuel Ayodele
...
Michael Best
Irfan Essa
Stephen E. Moore
Chris Fourie
Mercy Nyamewaa Asiedu
LM&MA
148
3
0
23 Nov 2024
ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation
Xiaoman Zhang
Hong-Yu Zhou
Xiaoli Yang
Oishi Banerjee
J. N. Acosta
Josh Miller
Ouwen Huang
Pranav Rajpurkar
LM&MA
173
5
0
22 Nov 2024
Evaluating LLM Prompts for Data Augmentation in Multi-label Classification of Ecological Texts
Anna Glazkova
Olga Zakharova
112
2
0
22 Nov 2024
Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains
Yurii Paniv
Artur Kiulian
Dmytro Chaplynskyi
M. Khandoga
Anton Polishko
Tetiana Bas
Guillermo Gabrielli
103
1
0
22 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
201
1
0
22 Nov 2024
PatentEdits: Framing Patent Novelty as Textual Entailment
Ryan Lee
Alexander Spangher
Xuezhe Ma
140
1
0
20 Nov 2024
Watermark under Fire: A Robustness Evaluation of LLM Watermarking
Jiacheng Liang
Zian Wang
Lauren Hong
Shouling Ji
Ting Wang
AAML
211
0
0
20 Nov 2024
Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control
Yunkee Chae
Eunsik Shin
Hwang Suntae
Seungryeol Paik
Kyogu Lee
113
1
0
20 Nov 2024
Human-In-the-Loop Software Development Agents
Wannita Takerngsaksiri
Jirat Pasuksmit
Patanamon Thongtanunam
Chakkrit Tantithamthavorn
Ruixiong Zhang
Fan Jiang
Jing Li
Evan Cook
Kun Chen
Ming Wu
LLMAG
169
6
0
19 Nov 2024
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
115
0
0
19 Nov 2024
CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Dongyoung Go
Taesun Whang
Chanhee Lee
Hwayeon Kim
Sunghoon Park
Seunghwan Ji
Dongchan Kim
Young-Bum Kim
Young-Bum Kim
LRM
527
1
0
19 Nov 2024
Membership Inference Attack against Long-Context Large Language Models
Zixiong Wang
Gaoyang Liu
Yang Yang
Chen Wang
147
1
0
18 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
89
8
0
18 Nov 2024
Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Md. Asif Haider
Ayesha Binte Mostofa
Sk. Sabit Bin Mosaddek
Anindya Iqbal
Toufique Ahmed
ALM
87
3
0
15 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
124
16
0
14 Nov 2024
Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data
Anum Afzal
Juraj Vladika
Gentrit Fazlija
Andrei Staradubets
Florian Matthes
RALM
66
0
0
13 Nov 2024
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Moran Yanuka
Assaf Ben-Kish
Yonatan Bitton
Idan Szpektor
Raja Giryes
VLM
152
3
0
13 Nov 2024
Ethical Concern Identification in NLP: A Corpus of ACL Anthology Ethics Statements
Antonia Karamolegkou
Sandrine Schiller Hansen
Ariadni Christopoulou
Filippos Stamatiou
Anne Lauscher
Anders Søgaard
54
0
0
12 Nov 2024
Knowledge-Augmented Multimodal Clinical Rationale Generation for Disease Diagnosis with Small Language Models
Shuai Niu
Jing Ma
Hongzhan Lin
Liang Bai
Zhihua Wang
Yida Xu
Yunya Song
Xian Yang
28
1
0
12 Nov 2024
The Inherent Adversarial Robustness of Analog In-Memory Computing
Corey Lammie
Julian Büchel
A. Vasilopoulos
Manuel Le Gallo
Abu Sebastian
AAML
117
2
0
11 Nov 2024
Evaluating Large Language Models on Financial Report Summarization: An Empirical Study
Xinqi Yang
Scott Zang
Yong Ren
Dingjie Peng
Zheng Wen
66
1
0
11 Nov 2024
Benchmarking LLMs' Judgments with No Gold Standard
Shengwei Xu
Yuxuan Lu
Grant Schoenebeck
Yuqing Kong
84
4
0
11 Nov 2024
Does This Summary Answer My Question? Modeling Query-Focused Summary Readers with Rational Speech Acts
Cesare Spinoso-Di Piano
Jackie Chi Kit Cheung
81
0
0
10 Nov 2024
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
Shruti Singh
Nandan Sarkar
Arman Cohan
94
1
0
08 Nov 2024
Tibyan Corpus: Balanced and Comprehensive Error Coverage Corpus Using ChatGPT for Arabic Grammatical Error Correction
Ahlam Alrehili
Areej Alhothali
51
0
0
07 Nov 2024
LLM-R: A Framework for Domain-Adaptive Maintenance Scheme Generation Combining Hierarchical Agents and RAG
Laifa Tao
Qixuan Huang
Xianjun Wu
Weiwei Zhang
Yunlong Wu
Bin Li
Chen Lu
Xingshuo Hai
83
0
0
07 Nov 2024
Bayesian Calibration of Win Rate Estimation with LLM Evaluators
Yicheng Gao
G. Xu
Zhe Wang
Arman Cohan
99
6
0
07 Nov 2024
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
Ming Cheng
Jiaying Gong
Chenhan Yuan
William A. Ingram
Edward A. Fox
Hoda Eldardiry
235
1
0
07 Nov 2024
Summarization of Opinionated Political Documents with Varied Perspectives
Nicholas Deas
Kathleen McKeown
60
1
0
06 Nov 2024
M3SciQA: A Multi-Modal Multi-Document Scientific QA Benchmark for Evaluating Foundation Models
Chuhan Li
Ziyao Shangguan
Yilun Zhao
Deyuan Li
Yongxu Liu
Arman Cohan
80
3
0
06 Nov 2024
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Ekaterina Artemova
Jason Samuel Lucas
Saranya Venkatraman
Jooyoung Lee
Sergei Tilga
Adaku Uchendu
Vladislav Mikhailov
DeLMO
MoE
156
8
0
06 Nov 2024
Examining Human-AI Collaboration for Co-Writing Constructive Comments Online
Farhana Shahid
Maximilian Dittgen
Mor Naaman
Aditya Vashistha
93
1
0
05 Nov 2024
Leveraging Large Language Models in Code Question Answering: Baselines and Issues
Georgy Andryushchenko
Vladimir Ivanov
Vladimir Makharev
Elizaveta Tukhtina
Aidar Valeev
ELM
63
2
0
05 Nov 2024
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
David Mayo
Christopher Wang
Asa Harbin
Abdulrahman Alabdulkareem
Albert Eaton Shaw
Boris Katz
Andrei Barbu
DiffM
105
2
0
05 Nov 2024
Context-Informed Machine Translation of Manga using Multimodal Large Language Models
Philip Lippmann
Konrad Skublicki
Joshua Tanner
Shonosuke Ishiwatari
Jie Yang
106
0
0
04 Nov 2024
Ontology Population using LLMs
Sanaz Saki Norouzi
Adrita Barua
Antrea Christou
Nikita Gautam
Andrew Eells
Pascal Hitzler
C. Shimizu
57
3
0
03 Nov 2024
One Arrow, Many Targets: Probing LLMs for Multi-Attribute Controllable Text Summarization
Tathagato Roy
Rahul Mishra
26
0
0
02 Nov 2024
Do LLMs Know to Respect Copyright Notice?
Jialiang Xu
Shenglan Li
Zhaozhuo Xu
Denghui Zhang
92
5
0
02 Nov 2024
Previous
1
2
3
...
12
13
14
...
69
70
71
Next