Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,519 papers shown
Title
Conditioning LLMs to Generate Code-Switched Text
Maite Heredia
Gorka Labaka
Jeremy Barnes
A. Soroa
13
1
0
18 Feb 2025
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
Bosi Wen
Pei Ke
Yufei Sun
C. Wang
Xiaotao Gu
Jinfeng Zhou
Jie Tang
Hongning Wang
Minlie Huang
19
0
0
18 Feb 2025
Generating Text from Uniform Meaning Representation
Emma Markle
Reihaneh Iranmanesh
Shira Wein
50
0
0
17 Feb 2025
Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
Shintaro Ozaki
Kazuki Hayashi
Yusuke Sakai
Hidetaka Kamigaito
Katsuhiko Hayashi
Taro Watanabe
LRM
150
1
0
17 Feb 2025
A distributional simplicity bias in the learning dynamics of transformers
Riccardo Rende
Federica Gerace
Alessandro Laio
Sebastian Goldt
127
9
0
17 Feb 2025
ReviewEval: An Evaluation Framework for AI-Generated Reviews
Chavvi Kirtani
Madhav Krishan Garg
Tejash Prasad
Tanmay Singhal
Murari Mandal
Dhruv Kumar
117
1
0
17 Feb 2025
Idiosyncrasies in Large Language Models
Mingjie Sun
Yida Yin
Zhiqiu Xu
J. Zico Kolter
Zhuang Liu
119
7
0
17 Feb 2025
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
Chengyan Wu
Bolei Ma
Yang Liu
Zheyu Zhang
Ningyuan Deng
Yongqian Li
Baolan Chen
Yi Zhang
Yun Xue
Yun Xue
143
1
0
17 Feb 2025
Aligning Sentence Simplification with ESL Learner's Proficiency for Language Acquisition
Guanlin Li
Yuki Arase
Noel Crespi
78
1
0
17 Feb 2025
PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent
Jiateng Liu
Lin Ai
Zizhou Liu
Payam Karisani
Zheng Hui
May Fung
Preslav Nakov
Julia Hirschberg
Heng Ji
DiffM
160
5
0
17 Feb 2025
TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking
Shahriar Kabir Nahin
R. N. Nandi
Sagor Sarker
Quazi Sarwar Muhtaseem
Md. Kowsher
Apu Chandraw Shill
Md Ibrahim
Mehadi Hasan Menon
Tareq Al Muntasir
Firoj Alam
185
0
0
16 Feb 2025
Preconditioned Inexact Stochastic ADMM for Deep Model
Shenglong Zhou
Ouya Wang
Ziyan Luo
Yongxu Zhu
Geoffrey Ye Li
86
0
0
15 Feb 2025
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
Zhaoyi Zhou
Yuda Song
Andrea Zanette
ALM
158
0
0
14 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
348
7
0
12 Feb 2025
Bridging Brain Signals and Language: A Deep Learning Approach to EEG-to-Text Decoding
Mostafa El Gedawy
Omnia Nabil
Omar Mamdouh
Mahmoud Nady
Nour Alhuda Adel
Ahmed Fares
78
0
0
11 Feb 2025
A Large-Scale Benchmark for Vietnamese Sentence Paraphrases
Sang Quang Nguyen
Kiet Van Nguyen
147
0
0
11 Feb 2025
Tractable Transformers for Flexible Conditional Generation
Hoang Trung-Dung
Xuejie Liu
Dayuan Zhao
Mathias Niepert
Yitao Liang
Guy Van den Broeck
73
0
0
11 Feb 2025
Unsupervised Translation of Emergent Communication
Ido Levy
Orr Paradise
Boaz Carmeli
Ron Meir
S. Goldwasser
Yonatan Belinkov
450
0
0
11 Feb 2025
Who Taught You That? Tracing Teachers in Model Distillation
Somin Wadhwa
Chantal Shaib
Silvio Amir
Byron C. Wallace
261
2
0
10 Feb 2025
LegalViz: Legal Text Visualization by Text To Diagram Generation
Eri Onami
Taiki Miyanishi
Koki Maeda
Shuhei Kurita
AILaw
117
1
0
10 Feb 2025
Learning to Substitute Words with Model-based Score Ranking
Hongye Liu
Ricardo Henao
166
0
0
09 Feb 2025
On Memory Construction and Retrieval for Personalized Conversational Agents
Zhuoshi Pan
Qianhui Wu
Huiqiang Jiang
Xufang Luo
Hao Cheng
...
Yue Yang
Chin-Yew Lin
H. Vicky Zhao
Lili Qiu
Jianfeng Gao
RALM
156
7
0
08 Feb 2025
AnyEdit: Edit Any Knowledge Encoded in Language Models
Houcheng Jiang
Sihang Li
Ningyu Zhang
Guojun Ma
Mingyang Wan
Xiang Wang
Xiangnan He
Tat-Seng Chua
KELM
135
19
0
08 Feb 2025
Toward Copyright Integrity and Verifiability via Multi-Bit Watermarking for Intelligent Transportation Systems
Yihao Wang
Lingxiao Li
Yifan Tang
Ru Zhang
Jianyi Liu
49
1
0
08 Feb 2025
Enhancing Knowledge Graph Construction: Evaluating with Emphasis on Hallucination, Omission, and Graph Similarity Metrics
Hussam Ghanem
C. Cruz
127
0
0
07 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Jing Yang
Max Glockner
Anderson de Rezende Rocha
Iryna Gurevych
LRM
153
1
0
07 Feb 2025
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot
Xuejiao Zhao
Siyan Liu
Su-Yin Yang
Chunyan Miao
274
14
0
06 Feb 2025
MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal Generation
Qinhan Yu
Zhiyou Xiao
Binghui Li
Zhengren Wang
Chong Chen
Wentao Zhang
RALM
VLM
251
1
0
06 Feb 2025
Should Code Models Learn Pedagogically? A Preliminary Evaluation of Curriculum Learning for Real-World Software Engineering Tasks
Kyi Shin Khant
Hong Yi Lin
Patanamon Thongtanunam
ELM
207
0
0
06 Feb 2025
Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond
Mardhiyah Sanni
Tassallah Abdullahi
Devendra D. Kayande
Emmanuel Ayodele
Naome A. Etori
...
Chibuzor Okocha
L. Ismaila
Folafunmi Omofoye
Boluwatife A. Adewale
Tobi Olatunji
166
1
0
06 Feb 2025
Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales
Zhen Qian
Xiuzhen Zhang
Xiaofei Xu
Xiwei Xu
LRM
60
0
0
05 Feb 2025
The Cake that is Intelligence and Who Gets to Bake it: An AI Analogy and its Implications for Participation
Martin Mundt
Anaelia Ovalle
Felix Friedrich
A Pranav
Subarnaduti Paul
Manuel Brack
Kristian Kersting
William Agnew
713
0
0
05 Feb 2025
Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation
Atharva Mangeshkumar Agrawal
Rutika Pandurang Shinde
Vasanth Kumar Bhukya
Ashmita Chakraborty
Sagar Bharat Shah
Tanmay Shukla
Sree Pradeep Kumar Relangi
Nilesh Mutyam
LM&MA
AI4MH
145
0
0
04 Feb 2025
Preference Leakage: A Contamination Problem in LLM-as-a-judge
Dawei Li
Renliang Sun
Yue Huang
Ming Zhong
Bohan Jiang
Jiawei Han
Wei Wei
Wei Wang
Huan Liu
174
30
0
03 Feb 2025
Using LLM-Based Approaches to Enhance and Automate Topic Labeling
Trishia Khandelwal
42
0
0
03 Feb 2025
Classic4Children: Adapting Chinese Literary Classics for Children with Large Language Model
Jiali Chen
Xusen Hei
Yuqi Xue
Zihan Wu
Jiayuan Xie
Yi Cai
AI4Ed
151
2
0
03 Feb 2025
Evaluating Small Language Models for News Summarization: Implications and Factors Influencing Performance
Borui Xu
Yao Chen
Zeyi Wen
Weiguo Liu
Bingsheng He
186
2
0
02 Feb 2025
Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications
Yixin Wu
Ziqing Yang
Yun Shen
Michael Backes
Yang Zhang
79
1
0
02 Feb 2025
Multilingual State Space Models for Structured Question Answering in Indic Languages
A. Vats
Rahul Raja
Mrinal Mathur
Vinija Jain
Aman Chadha
169
1
0
01 Feb 2025
SoK: Towards Effective Automated Vulnerability Repair
Ying Li
Faysal hossain shezan
Bomin wei
Gang Wang
Yuan Tian
201
2
0
31 Jan 2025
Inkspire: Supporting Design Exploration with Generative AI through Analogical Sketching
David Chuan-En Lin
Hyeonsu B Kang
Nikolas Martelaro
A. Kittur
Yan-Ying Chen
Matthew K. Hong
160
3
0
30 Jan 2025
A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
Wiradee Imrattanatrai
Masaki Asada
Kimihiro Hasegawa
Zhi-Qi Cheng
Ken Fukuda
Teruko Mitamura
VGen
129
0
0
30 Jan 2025
Fake News Detection After LLM Laundering: Measurement and Explanation
Rupak Kumar Das
Jonathan Dodge
187
1
0
29 Jan 2025
Hybrid Graphs for Table-and-Text based Question Answering using LLMs
Ankush Agarwal
Ganesh S
Chaitanya Devaguptapu
LMTD
105
1
0
29 Jan 2025
mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
Nishat Raihan
Antonios Anastasopoulos
Marcos Zampieri
ELM
128
8
0
28 Jan 2025
Learning to Summarize from LLM-generated Feedback
Hwanjun Song
Taewon Yun
Yuho Lee
Jihwan Oh
Gihun Lee
Jason (Jinglun) Cai
Hang Su
225
10
0
28 Jan 2025
Speech Translation Refinement using Large Language Models
Huaixia Dou
Xinyu Tian
Xinglin Lyu
Jie Zhu
Junhui Li
Lifan Guo
466
0
0
28 Jan 2025
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images
Sami Baral
L. Lucy
Ryan Knight
Alice Ng
Luca Soldaini
Neil T. Heffernan
Kyle Lo
120
4
0
28 Jan 2025
SedarEval: Automated Evaluation using Self-Adaptive Rubrics
Zhiyuan Fan
Weinong Wang
Xing Wu
Debing Zhang
73
2
0
28 Jan 2025
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators
Dingkang Yang
Dongling Xiao
Jinjie Wei
Mingcheng Li
Zhaoyu Chen
Ke Li
Li Zhang
HILM
167
6
0
28 Jan 2025
Previous
1
2
3
...
9
10
11
...
69
70
71
Next