ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown
Title
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference
Jo˜ao Monteiro
Étienne Marcotte
Pierre-Andre Noel
Valentina Zantedeschi
David Vázquez
Nicolas Chapados
Christopher Pal
Perouz Taslakian
77
5
0
23 Apr 2024
Aligning LLM Agents by Learning Latent Preference from User Edits
Aligning LLM Agents by Learning Latent Preference from User Edits
Ge Gao
Alexey Taymanov
Eduardo Salinas
Paul Mineiro
Dipendra Kumar Misra
LLMAG
94
31
0
23 Apr 2024
Advances and Open Challenges in Federated Learning with Foundation
  Models
Advances and Open Challenges in Federated Learning with Foundation Models
Chao Ren
Han Yu
Hongyi Peng
Xiaoli Tang
Anran Li
...
A. Tan
Bo Zhao
Xiaoxiao Li
Zengxiang Li
Qiang Yang
FedMLAIFinAI4CE
152
11
0
23 Apr 2024
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection
  and Correction
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
Hang Hua
Jing Shi
Kushal Kafle
Simon Jenni
Daoan Zhang
John Collomosse
Scott D. Cohen
Jiebo Luo
CoGeVLM
92
12
0
23 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
212
61
0
23 Apr 2024
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical
  Error Detection and Correction
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction
Barry Rubin
Ronald Xie
Steven Palayew
Patrick R. Lawler
Bo Wang
43
3
0
22 Apr 2024
AutoAD III: The Prequel -- Back to the Pixels
AutoAD III: The Prequel -- Back to the Pixels
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGenDiffM
133
22
0
22 Apr 2024
Text-Tuple-Table: Towards Information Integration in Text-to-Table
  Generation via Global Tuple Extraction
Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Zheye Deng
Chunkit Chan
Weiqi Wang
Yuxi Sun
Wei Fan
Tianshi Zheng
Yauwai Yim
Yangqiu Song
LMTDRALM
97
15
0
22 Apr 2024
Protecting Your LLMs with Information Bottleneck
Protecting Your LLMs with Information Bottleneck
Zichuan Liu
Zefan Wang
Linjie Xu
Jinyu Wang
Lei Song
Tianchun Wang
Chunlin Chen
Wei Cheng
Jiang Bian
KELMAAML
116
18
0
22 Apr 2024
E-QGen: Educational Lecture Abstract-based Question Generation System
E-QGen: Educational Lecture Abstract-based Question Generation System
Mao-Siang Chen
An-Zi Yen
AI4Ed
71
2
0
21 Apr 2024
Movie101v2: Improved Movie Narration Benchmark
Movie101v2: Improved Movie Narration Benchmark
Zihao Yue
Yepeng Zhang
Ziheng Wang
Qin Jin
VGen
104
1
0
20 Apr 2024
LLMChain: Blockchain-based Reputation System for Sharing and Evaluating
  Large Language Models
LLMChain: Blockchain-based Reputation System for Sharing and Evaluating Large Language Models
Mouhamed Amine Bouchiha
Quentin Telnoff
Souhail Bakkali
R. Champagnat
Mourad Rabah
Mickael Coustaty
Y. Ghamri-Doudane
LRM
70
3
0
20 Apr 2024
Evaluating Character Understanding of Large Language Models via
  Character Profiling from Fictional Works
Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works
Xinfeng Yuan
Siyu Yuan
Yuhan Cui
Tianhe Lin
Xintao Wang
Rui Xu
Jiangjie Chen
Deqing Yang
LLMAG
115
21
0
19 Apr 2024
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification
  with Many Classes
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes
Asaf Yehudai
Elron Bandel
71
2
0
18 Apr 2024
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt
  Instruction Tuning
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
Hang Hua
Yunlong Tang
Chenliang Xu
Jiebo Luo
VGen
114
28
0
18 Apr 2024
Simultaneous Interpretation Corpus Construction by Large Language Models
  in Distant Language Pair
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair
Yusuke Sakai
Mana Makinae
Hidetaka Kamigaito
Taro Watanabe
103
5
0
18 Apr 2024
FedEval-LLM: Federated Evaluation of Large Language Models on Downstream
  Tasks with Collective Wisdom
FedEval-LLM: Federated Evaluation of Large Language Models on Downstream Tasks with Collective Wisdom
Yuanqin He
Yan Kang
Lixin Fan
Qiang Yang
62
3
0
18 Apr 2024
RAM: Towards an Ever-Improving Memory System by Learning from
  Communications
RAM: Towards an Ever-Improving Memory System by Learning from Communications
Jiaqi Li
Xiaobo Wang
Wentao Ding
Zihao Wang
Yipeng Kang
Zixia Jia
Zilong Zheng
103
3
0
18 Apr 2024
Sequential Compositional Generalization in Multimodal Models
Sequential Compositional Generalization in Multimodal Models
Semih Yagcioglu
Osman Batur .Ince
Aykut Erdem
Erkut Erdem
Desmond Elliott
Deniz Yuret
78
1
0
18 Apr 2024
EVIT: Event-Oriented Instruction Tuning for Event Reasoning
EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Zhengwei Tao
Xiancai Chen
Zhi Jin
Xiaoying Bai
Haiyan Zhao
Yiwei Lou
106
3
0
18 Apr 2024
Evaluating Span Extraction in Generative Paradigm: A Reflection on
  Aspect-Based Sentiment Analysis
Evaluating Span Extraction in Generative Paradigm: A Reflection on Aspect-Based Sentiment Analysis
Soyoung Yang
Won Ik Cho
40
0
0
17 Apr 2024
Image Generative Semantic Communication with Multi-Modal Similarity
  Estimation for Resource-Limited Networks
Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks
Eri Hosonuma
Taku Yamazaki
Takumi Miyoshi
Akihito Taya
Yuuki Nishiyama
K. Sezaki
DiffM
101
1
0
17 Apr 2024
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out
  Document
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
Joonho Yang
Seunghyun Yoon
Byeongjeong Kim
Hwanhee Lee
HILM
114
7
0
17 Apr 2024
What's under the hood: Investigating Automatic Metrics on Meeting
  Summarization
What's under the hood: Investigating Automatic Metrics on Meeting Summarization
Frederic Kirstein
Jan Philip Wahle
Terry Ruas
Bela Gipp
86
6
0
17 Apr 2024
Consistency Training by Synthetic Question Generation for Conversational
  Question Answering
Consistency Training by Synthetic Question Generation for Conversational Question Answering
Hamed Hematian Hemati
Hamid Beigy
48
2
1
17 Apr 2024
LaDiC: Are Diffusion Models Really Inferior to Autoregressive
  Counterparts for Image-to-Text Generation?
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Yuchi Wang
Shuhuai Ren
Rundong Gao
Linli Yao
Qingyan Guo
Kaikai An
Jianhong Bai
Xu Sun
DiffMVLM
106
9
0
16 Apr 2024
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Shusheng Xu
Wei Fu
Jiaxuan Gao
Wenjie Ye
Weiling Liu
Zhiyu Mei
Guangju Wang
Chao Yu
Yi Wu
162
165
0
16 Apr 2024
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level
  Granularity
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Moshe Berchansky
Daniel Fleischer
Moshe Wasserblat
Peter Izsak
LRM
133
6
0
16 Apr 2024
MEEL: Multi-Modal Event Evolution Learning
MEEL: Multi-Modal Event Evolution Learning
Zhengwei Tao
Zhi Jin
Junqiang Huang
Xiancai Chen
Xiaoying Bai
Haiyan Zhao
Yifan Zhang
Chongyang Tao
75
1
0
16 Apr 2024
Disentangling Instructive Information from Ranked Multiple Candidates
  for Multi-Document Scientific Summarization
Disentangling Instructive Information from Ranked Multiple Candidates for Multi-Document Scientific Summarization
Pancheng Wang
Shasha Li
Dong Li
Kehan Long
Jintao Tang
Ting Wang
67
2
0
16 Apr 2024
Balancing Speciality and Versatility: a Coarse to Fine Framework for
  Supervised Fine-tuning Large Language Model
Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Hengyuan Zhang
Yanru Wu
Dawei Li
Zacc Yang
Rui Zhao
Yong Jiang
Fei Tan
ALM
142
1
0
16 Apr 2024
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal
  Summarization and Text-Units-Text Generation
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text Generation
Yue Zhou
Barbara Di Eugenio
Brian Ziebart
Lisa Sharp
Bing Liu
Nikolaos Agadakos
59
3
0
16 Apr 2024
MAD Speech: Measures of Acoustic Diversity of Speech
MAD Speech: Measures of Acoustic Diversity of Speech
Matthieu Futeral
A. Agostinelli
Marco Tagliasacchi
Neil Zeghidour
Eugene Kharitonov
141
1
0
16 Apr 2024
Memory Sharing for Large Language Model based Agents
Memory Sharing for Large Language Model based Agents
Hang Gao
Yongfeng Zhang
LLMAG
86
10
0
15 Apr 2024
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data
  Annotation
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation
Juhwan Choi
Jungmin Yun
Kyohoon Jin
Youngbin Kim
90
6
0
15 Apr 2024
Mitigating Hallucination in Abstractive Summarization with
  Domain-Conditional Mutual Information
Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information
Kyubyung Chae
Jaepill Choi
Yohan Jo
Taesup Kim
HILM
90
2
0
15 Apr 2024
WikiSplit++: Easy Data Refinement for Split and Rephrase
WikiSplit++: Easy Data Refinement for Split and Rephrase
Hayato Tsukagoshi
Tsutomu Hirao
Makoto Morishita
Katsuki Chousa
Ryohei Sasano
Koichi Takeda
81
1
0
13 Apr 2024
Towards Enhancing Health Coaching Dialogue in Low-Resource Settings
Towards Enhancing Health Coaching Dialogue in Low-Resource Settings
Yue Zhou
Barbara Di Eugenio
Brian Ziebart
Lisa Sharp
Bing Liu
Ben S. Gerber
Nikolaos Agadakos
S. Yadav
62
4
0
13 Apr 2024
Latent Guard: a Safety Framework for Text-to-image Generation
Latent Guard: a Safety Framework for Text-to-image Generation
Runtao Liu
Ashkan Khakzar
Jindong Gu
Qifeng Chen
Philip Torr
Fabio Pizzati
96
31
0
11 Apr 2024
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models'
  Understanding of Engineering Documentation
DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation
Anna C. Doris
Daniele Grandi
Ryan Tomich
Md Ferdous Alam
Hyunmin Cheong
Faez Ahmed
71
20
0
11 Apr 2024
Scalable Language Model with Generalized Continual Learning
Scalable Language Model with Generalized Continual Learning
Bohao Peng
Zhuotao Tian
Shu Liu
Mingchang Yang
Jiaya Jia
ALMCLLKELM
89
18
0
11 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Analyzing the Performance of Large Language Models on Code Summarization
Rajarshi Haldar
Julia Hockenmaier
73
19
0
10 Apr 2024
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition
Kehua Feng
Keyan Ding
Hongzhi Tan
Kede Ma
Zhihua Wang
...
Yuzhou Cheng
Ge Sun
Guozhou Zheng
Qiang Zhang
H. Chen
126
13
0
10 Apr 2024
AgentQuest: A Modular Benchmark Framework to Measure Progress and
  Improve LLM Agents
AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents
Luca Gioacchini
G. Siracusano
D. Sanvito
Kiril Gashteovski
David Friede
Roberto Bifulco
Carolin (Haas) Lawrence
ELMLLMAG
79
14
0
09 Apr 2024
Rethinking How to Evaluate Language Model Jailbreak
Rethinking How to Evaluate Language Model Jailbreak
Hongyu Cai
Arjun Arunasalam
Leo Y. Lin
Antonio Bianchi
Z. Berkay Celik
ALM
65
8
0
09 Apr 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation
  of Large Language Models
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Zhuohao Yu
Chang Gao
Wenjin Yao
Yidong Wang
Zhengran Zeng
Wei Ye
Jindong Wang
Yue Zhang
Shikun Zhang
61
3
0
09 Apr 2024
Language-Independent Representations Improve Zero-Shot Summarization
Language-Independent Representations Improve Zero-Shot Summarization
V. Solovyev
Danni Liu
Jan Niehues
80
0
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
169
38
0
07 Apr 2024
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Yann Dubois
Balázs Galambosi
Percy Liang
Tatsunori Hashimoto
ALM
171
403
0
06 Apr 2024
EASSE-DE: Easier Automatic Sentence Simplification Evaluation for German
EASSE-DE: Easier Automatic Sentence Simplification Evaluation for German
Regina Stodden
82
1
0
04 Apr 2024
Previous
123...262728...697071
Next