Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,519 papers shown
Title
AV-EmoDialog: Chat with Audio-Visual Users Leveraging Emotional Cues
Se Jin Park
Yeonju Kim
Hyeongseop Rha
Bella Godiva
Y. Ro
71
1
0
23 Dec 2024
LegalAgentBench: Evaluating LLM Agents in Legal Domain
Haoyang Li
Junjie Chen
Jingli Yang
Qingyao Ai
Wei Jia
...
Guozhi Yuan
Yiran Hu
Wuyue Wang
Yang Liu
Minlie Huang
LLMAG
AILaw
ELM
117
17
0
23 Dec 2024
Investigating Length Issues in Document-level Machine Translation
Ziqian Peng
Rachel Bawden
François Yvon
106
2
0
23 Dec 2024
From General to Specific: Tailoring Large Language Models for Personalized Healthcare
Ruize Shi
Hong Huang
Wei Zhou
Kehan Yin
Kai Zhao
Yun Zhao
LM&MA
AI4MH
108
0
0
20 Dec 2024
Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization
Sahil Wadhwa
Chengtian Xu
Haoming Chen
Aakash Mahalingam
Akankshya Kar
Divya Chaudhary
95
1
0
19 Dec 2024
Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimization
Yue Zhang
Liqiang Jing
Vibhav Gogate
205
5
0
19 Dec 2024
On the Compression of Language Models for Code: An Empirical Study on CodeBERT
Giordano dÁloisio
Luca Traini
Federica Sarro
A. Marco
87
1
0
18 Dec 2024
Mitigating Adversarial Attacks in LLMs through Defensive Suffix Generation
Minkyoung Kim
Yunha Kim
Hyeram Seo
Heejung Choi
Jiye Han
...
Hyoje Jung
Byeolhee Kim
Young-Hak Kim
Sanghyun Park
Tae Joon Jun
AAML
126
0
0
18 Dec 2024
PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological Counseling
Haojie Xie
Yirong Chen
Xiaofen Xing
Jingkai Lin
Xiangmin Xu
OffRL
144
5
0
18 Dec 2024
G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Tony Cheng Tong
Sirui He
Z. Shao
Dit-Yan Yeung
106
3
0
18 Dec 2024
LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning
Yansheng Mao
Jiaqi Li
Fanxu Meng
Jing Xiong
Zilong Zheng
Muhan Zhang
LLMAG
RALM
169
1
0
18 Dec 2024
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling
William Jongwon Han
Chaojing Duan
M. Rosenberg
Emerson Liu
Ding Zhao
138
1
0
18 Dec 2024
Towards Automatic Evaluation for Image Transcreation
Simran Khanuja
Vivek Iyer
Claire He
Graham Neubig
ViT
144
2
0
18 Dec 2024
A Survey of Calibration Process for Black-Box LLMs
Liangru Xie
Hui Liu
Jingying Zeng
Xianfeng Tang
Yan Han
Chen Luo
Jing Huang
Zhen Li
Suhang Wang
Qi He
142
4
0
17 Dec 2024
PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization
Yun Luo
Yingjie Li
Xiangkun Hu
Qinglin Qi
Fang Guo
Qipeng Guo
Zheng Zhang
Yue Zhang
123
0
0
17 Dec 2024
Precise Length Control in Large Language Models
Bradley Butcher
Michael O'Keefe
James Titchener
KELM
110
6
0
16 Dec 2024
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Mengna Zhu
Kaisheng Zeng
Mao Wang
Kaiming Xiao
Lei Hou
Hongbin Huang
Juanzi Li
513
1
0
16 Dec 2024
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Mohammad Aflah Khan
Neemesh Yadav
Sarah Masud
Md. Shad Akhtar
169
0
0
16 Dec 2024
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
Xuanliang Zhang
Dingzirui Wang
Baoxin Wang
Longxu Dou
Xinyuan Lu
Keyan Xu
Dayong Wu
Qingfu Zhu
Wanxiang Che
LMTD
509
2
0
16 Dec 2024
ACE-
M
3
M^3
M
3
: Automatic Capability Evaluator for Multimodal Medical Models
Xiechi Zhang
Shunfan Zheng
Linlin Wang
Gerard de Melo
Zhu Cao
Xiaoling Wang
Liang He
ELM
149
0
0
16 Dec 2024
Beyond Discrete Personas: Personality Modeling Through Journal Intensive Conversations
Sayantan Pal
Souvik Das
Rohini Srihari
143
1
0
15 Dec 2024
LAW: Legal Agentic Workflows for Custody and Fund Services Contracts
William Watson
Nicole Cho
Nishan Srishankar
Zhen Zeng
Lucas Cecchi
Daniel Scott
S. Siddagangappa
Rachneet Kaur
T. Balch
Manuela Veloso
AILaw
110
0
0
15 Dec 2024
Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track
D. Gupta
Dina Demner-Fushman
LM&MA
103
1
0
15 Dec 2024
Semantic Steganography: A Framework for Robust and High-Capacity Information Hiding using Large Language Models
Minhao Bai
Jinshuai Yang
Kaiyi Pang
Yongfeng Huang
Yue Gao
81
1
0
15 Dec 2024
An Enhanced Text Compression Approach Using Transformer-based Language Models
C. M. Rahman
Mahbub E Sobhani
Anika Tasnim Rodela
Swakkhar Shatabda
129
1
0
15 Dec 2024
Can LLMs Help Create Grammar?: Automating Grammar Creation for Endangered Languages with In-Context Learning
Piyapath T Spencer
Nanthipat Kongborrirak
107
1
0
14 Dec 2024
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation
Qilong Wu
Xiaoneng Xiang
Hejia Huang
Xuan Wang
Yeo Wei Jie
Ranjan Satapathy
Ricardo Shirota Filho
Bharadwaj Veeravalli
143
3
0
14 Dec 2024
Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
Poulami Ghosh
Raj Dabre
Pushpak Bhattacharyya
AAML
120
0
0
14 Dec 2024
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Changan Chen
Juze Zhang
S. K. Lakshmikanth
Yusu Fang
Ruizhi Shao
Gordon Wetzstein
L. Fei-Fei
Ehsan Adeli
VGen
133
5
0
13 Dec 2024
Neptune: The Long Orbit to Benchmarking Long Video Understanding
Arsha Nagrani
Ruotong Wang
Ramin Mehran
Rachel Hornung
N. B. Gundavarapu
...
Boqing Gong
Cordelia Schmid
Mikhail Sirotenko
Yukun Zhu
Tobias Weyand
179
8
0
12 Dec 2024
Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy
Alistair Plum
Tharindu Ranasinghe
Christoph Purschke
118
3
0
12 Dec 2024
SweetieChat: A Strategy-Enhanced Role-playing Framework for Diverse Scenarios Handling Emotional Support Agent
Jing Ye
Lu Xiang
Yaping Zhang
Chengqing Zong
160
6
0
11 Dec 2024
DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization
Phan Phuong Mai Chau
Souhail Bakkali
Antoine Doucet
128
0
0
11 Dec 2024
GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning
Yanjie Wang
Zhikang Zhang
Jue Wang
D. Fan
Zhenlin Xu
Linda Liu
Xiang Hao
Vimal Bhat
Xinyu Li
VLM
117
1
0
10 Dec 2024
CoMA: Compositional Human Motion Generation with Multi-modal Agents
Shanlin Sun
Gabriel De Araujo
Jiaqi Xu
S. Kevin Zhou
Hanwen Zhang
Ziheng Huang
Chenyu You
Xiaohui Xie
161
5
0
10 Dec 2024
QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
Shiyue Zhang
David Wan
Arie Cattan
Ayal Klein
Ido Dagan
Joey Tianyi Zhou
126
0
0
10 Dec 2024
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation
Eunsu Kim
Juyoung Suk
Seungone Kim
Niklas Muennighoff
Dongkwan Kim
Alice Oh
ELM
188
1
0
10 Dec 2024
MuMu-LLaMA: Multi-modal Music Understanding and Generation via Large Language Models
Shansong Liu
Atin Sakkeer Hussain
Qilong Wu
Chenshuo Sun
Ying Shan
AuLLM
116
4
0
09 Dec 2024
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor
Jiali Chen
Xusen Hei
Yuqi Xue
Yuancheng Wei
Jiayuan Xie
Yi Cai
Qing Li
MLLM
LRM
137
7
0
08 Dec 2024
A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions
Ola Shorinwa
Zhiting Mei
Justin Lidard
Allen Z. Ren
Anirudha Majumdar
HILM
LRM
137
19
0
07 Dec 2024
Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System
Fang Zeng
Zhiliang Lyu
Quanzheng Li
Xiang Li
90
4
0
06 Dec 2024
Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
LM&MA
139
2
0
06 Dec 2024
Speech Recognition-based Feature Extraction for Enhanced Automatic Severity Classification in Dysarthric Speech
Yerin Choi
Jeehyun Lee
M. Koo
69
0
0
05 Dec 2024
Towards Understanding and Quantifying Uncertainty for Text-to-Image Generation
Gianni Franchi
Dat Nguyen Trong
Nacim Belkhir
Guoxuan Xia
Andrea Pilzer
UQLM
122
1
0
04 Dec 2024
Enhancing Trust in Large Language Models with Uncertainty-Aware Fine-Tuning
R. Krishnan
Piyush Khanna
Omesh Tickoo
HILM
116
1
0
03 Dec 2024
MLD-EA: Check and Complete Narrative Coherence by Introducing Emotions and Actions
Jinming Zhang
Yunfei Long
144
1
0
03 Dec 2024
Medchain: Bridging the Gap Between LLM Agents and Clinical Practice through Interactive Sequential Benchmarking
Jie Liu
Wenxuan Wang
Zizhan Ma
Guolin Huang
Yihang Su
Kao-Jung Chang
Wenting Chen
Haoliang Li
Linlin Shen
Michael R. Lyu
127
8
0
02 Dec 2024
SiTSE: Sinhala Text Simplification Dataset and Evaluation
Surangika Ranathunga
Rumesh Sirithunga
Himashi Rathnayake
Lahiru De Silva
Thamindu Aluthwala
Saman Peramuna
Ravi Shekhar
157
1
0
02 Dec 2024
OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?
Zhongfu Chen
Tingzhu Chen
Wenjun Zhang
Guangtao Zhai
175
4
0
02 Dec 2024
Efficient Learning Content Retrieval with Knowledge Injection
Batuhan Sariturk
Rabia Bayraktar
Merve Elmas Erdem
119
0
0
28 Nov 2024
Previous
1
2
3
...
11
12
13
...
69
70
71
Next