Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,519 papers shown
Title
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
90
3
0
29 May 2024
Can Automatic Metrics Assess High-Quality Translations?
Sweta Agrawal
António Farinhas
Ricardo Rei
André F. T. Martins
82
9
0
28 May 2024
Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints
Aryo Pradipta Gema
Chaeeun Lee
Pasquale Minervini
Luke Daines
T. I. Simpson
Beatrice Alex
80
1
0
28 May 2024
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations
Yi-Pei Chen
Noriki Nishida
Hideki Nakayama
Yuji Matsumoto
LLMAG
93
15
0
28 May 2024
Video Enriched Retrieval Augmented Generation Using Aligned Video Captions
Kevin Dela Rosa
60
5
0
27 May 2024
QUB-Cirdan at "Discharge Me!": Zero shot discharge letter generation by open-source LLM
Rui Guo
Greg Farnan
Niall McLaughlin
Barry Devereux
40
4
0
27 May 2024
UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models
Quan Van Nguyen
Huy Quang Pham
Dan Quang Tran
Thang Kien-Bao Nguyen
Nhat-Hao Nguyen-Dang
Bao-Thien Nguyen-Tat
MedIm
67
2
0
27 May 2024
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Robert Wolfe
Isaac Slaughter
Bin Han
Bingbing Wen
Yiwei Yang
...
Bernease Herman
E. Brown
Zening Qu
Nicholas Weber
Bill Howe
107
8
0
27 May 2024
Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
Peiran Yao
Denilson Barbosa
ELM
87
7
0
26 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
79
4
0
26 May 2024
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text
Han Yu
Peikun Guo
Akane Sano
80
19
0
26 May 2024
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Chenhao Zhang
Renhao Li
Minghuan Tan
Min Yang
Jingwei Zhu
Di Yang
Jiahao Zhao
Guancheng Ye
Chengming Li
Xiping Hu
133
29
0
26 May 2024
Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data
Yuhao Chen
Zhimu Wang
Bo Wen
F. Zulkernine
ELM
LM&MA
AI4MH
25
4
0
25 May 2024
Generating clickbait spoilers with an ensemble of large language models
M. Woźny
Mateusz Lango
61
1
0
25 May 2024
SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
Kun Zhao
Bohao Yang
Chen Tang
Chenghua Lin
Liang Zhan
79
5
0
24 May 2024
Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
Pranab Sahoo
Ayush Kumar Singh
Sriparna Saha
Aman Chadha
S. Mondal
59
3
0
24 May 2024
Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Jonas Becker
Jan Philip Wahle
Bela Gipp
Terry Ruas
122
11
0
24 May 2024
Detection and Positive Reconstruction of Cognitive Distortion sentences: Mandarin Dataset and Evaluation
Shuya Lin
Yuxiong Wang
Jonathan Dong
Shiguang Ni
58
2
0
24 May 2024
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Abbas Ghaddar
David Alfonso-Hermelo
Philippe Langlais
Mehdi Rezagholizadeh
Boxing Chen
Prasanna Parthasarathi
71
0
0
24 May 2024
A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns
Asaf Yehudai
Taelin Karidi
Gabriel Stanovsky
Ariel Goldstein
Omri Abend
86
1
0
23 May 2024
Unveiling the Achilles' Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
Yiming Chen
Chen Zhang
Danqing Luo
L. F. D’Haro
R. Tan
Haizhou Li
AAML
ELM
89
3
0
23 May 2024
Towards Transferable Attacks Against Vision-LLMs in Autonomous Driving with Typography
N. Chung
Sensen Gao
Tuan-Anh Vu
Jie M. Zhang
Aishan Liu
Yun Lin
Jin Song Dong
Qi Guo
AAML
103
11
0
23 May 2024
How Many Bytes Can You Take Out Of Brain-To-Text Decoding?
Richard Antonello
Nihita Sarma
Jerry Tang
Jiaru Song
Alexander G. Huth
75
1
0
22 May 2024
Trajectory Volatility for Out-of-Distribution Detection in Mathematical Reasoning
Yiming Wang
Pei Zhang
Baosong Yang
Derek F. Wong
Zhuosheng Zhang
Rui Wang
OODD
44
8
0
22 May 2024
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Cyril Chhun
Fabian M. Suchanek
Chloé Clavel
LRM
114
18
0
22 May 2024
DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning
Zijian Zhou
Xiaoqiang Lin
Xinyi Xu
Alok Prakash
Daniela Rus
K. H. Low
72
4
0
22 May 2024
RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts
Yuelyu Ji
Zhuochun Li
Rui Meng
Sonish Sivarajkumar
Yanshan Wang
Zeshui Yu
Hui Ji
Yushui Han
Hanyu Zeng
Daqing He
70
24
0
21 May 2024
OLAPH: Improving Factuality in Biomedical Long-form Question Answering
Minbyul Jeong
Hyeon Hwang
Chanwoong Yoon
Taewhoo Lee
Jaewoo Kang
MedIm
HILM
LM&MA
123
12
0
21 May 2024
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)
Yucheng Cai
Si Chen
Yi Huang
Junlan Feng
Zhijian Ou
105
2
0
21 May 2024
A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data
Xinyi Wang
Grazziela Figueredo
Ruizhe Li
Wei Emma Zhang
Weitong Chen
Xin Chen
MedIm
ViT
112
2
0
21 May 2024
Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging
Xiaobo Liang
Haoke Zhang
Helan hu
Juntao Li
Jun Xu
Min Zhang
ALM
77
3
0
20 May 2024
Multiple-Choice Questions are Efficient and Robust LLM Evaluators
Ziyin Zhang
Zhaokun Jiang
Lizhen Xu
Hong-ping Hao
Rui Wang
100
19
0
20 May 2024
WisPerMed at BioLaySumm: Adapting Autoregressive Large Language Models for Lay Summarization of Scientific Articles
T. M. G. Pakull
Hendrik Damm
Ahmad Idrissi-Yaghir
Henning Schafer
Peter A. Horn
Christoph M. Friedrich
61
2
0
20 May 2024
Cyber Risks of Machine Translation Critical Errors : Arabic Mental Health Tweets as a Case Study
Hadeel Saadany
Ashraf Tantawy
Constantin Orasan
68
1
0
19 May 2024
MICap: A Unified Model for Identity-aware Movie Descriptions
Haran Raajesh
Naveen Reddy Desanur
Zeeshan Khan
Makarand Tapaswi
74
4
0
19 May 2024
WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert Selection, and Priming Techniques on MIMIC-IV
Hendrik Damm
T. M. G. Pakull
Bahadir Eryilmaz
Helmut Becker
Ahmad Idrissi-Yaghir
Henning Schafer
Sergej Schultenkämper
Christoph M. Friedrich
69
3
0
18 May 2024
Prompt Exploration with Prompt Regression
Michael Feffer
Ronald Xu
Yuekai Sun
Mikhail Yurochkin
82
0
0
17 May 2024
Leveraging Discourse Structure for Extractive Meeting Summarization
Virgile Rennard
Guokan Shang
Michalis Vazirgiannis
Julie Hunter
124
1
0
17 May 2024
Automated Radiology Report Generation: A Review of Recent Advances
Phillip Sloan
Philip Clatworthy
Edwin Simpson
Majid Mirmehdi
81
21
0
17 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
111
79
0
17 May 2024
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi
Zeming Liu
Li Du
Yuxuan Wang
Hongru Wang
Yuhang Guo
Tong Ruan
Jie Xu
Shaoting Zhang
LM&MA
ELM
100
2
0
17 May 2024
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset
Jie Zhu
Junhui Li
Yalong Wen
Lifan Guo
ELM
ALM
77
8
0
17 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
130
21
0
16 May 2024
DocuMint: Docstring Generation for Python using Small Language Models
Bibek Poudel
Adam Cook
Sekou Traore
Shelah Ameli
ALM
64
1
0
16 May 2024
Red Teaming Language Models for Contradictory Dialogues
Xiaofei Wen
Bangzheng Li
Tenghao Huang
Muhao Chen
35
0
0
16 May 2024
ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset
Johannes Ruckert
Louise Bloch
Raphael Brüngel
Ahmad Idrissi-Yaghir
Henning Schafer
...
A. G. S. D. Herrera
Henning Müller
Peter A. Horn
F. Nensa
Christoph M. Friedrich
84
33
0
16 May 2024
SciQAG: A Framework for Auto-Generated Science Question Answering Dataset with Fine-grained Evaluation
Yuwei Wan
Yixuan Liu
Aswathy Ajith
Clara Grazian
B. Hoex
Wenjie Zhang
Chunyu Kit
Tong Xie
Ian Foster
95
10
0
16 May 2024
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation
Alex G. Kim
Keonwoo Kim
Sangwon Yoon
ELM
57
7
0
16 May 2024
Mitigating Text Toxicity with Counterfactual Generation
Milan Bhan
Jean-Noel Vittaut
Nina Achache
Victor Legrand
Nicolas Chesneau
A. Blangero
Juliette Murris
Marie-Jeanne Lesot
MedIm
215
0
0
16 May 2024
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge
Andong Wang
Bo Wu
Sunli Chen
Zhenfang Chen
Haotian Guan
Wei-Ning Lee
Li Erran Li
Chuang Gan
LRM
RALM
103
19
0
15 May 2024
Previous
1
2
3
...
24
25
26
...
69
70
71
Next