ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,519 papers shown
Title
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
Deeparghya Dutta Barua
Md Sakib Ul Rahman Sourove
Md Fahim
Fabiha Haider
Fariha Tanjim Shifat
Md Tasmim Rahman Adib
Anam Borhan Uddin
Md Farhan Ishmam
Md Farhad Alam
71
0
0
19 Oct 2024
Diversity Explains Inference Scaling Laws: Through a Case Study of Minimum Bayes Risk Decoding
Diversity Explains Inference Scaling Laws: Through a Case Study of Minimum Bayes Risk Decoding
Hidetaka Kamigaito
Hiroyuki Deguchi
Yusuke Sakai
Katsuhiko Hayashi
Taro Watanabe
120
0
0
19 Oct 2024
Cross-Document Event-Keyed Summarization
Cross-Document Event-Keyed Summarization
William Walden
Pavlo Kuchmiichuk
Alexander Martin
Chihsheng Jin
Angela Cao
Claire Sun
Curisia Allen
Aaron Steven White
RALM
53
0
0
18 Oct 2024
Tell me what I need to know: Exploring LLM-based (Personalized)
  Abstractive Multi-Source Meeting Summarization
Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting Summarization
Frederic Kirstein
Terry Ruas
Robert Kratel
Bela Gipp
42
3
0
18 Oct 2024
CAPE: A Chinese Dataset for Appraisal-based Emotional Generation using
  Large Language Models
CAPE: A Chinese Dataset for Appraisal-based Emotional Generation using Large Language Models
June M. Liu
He Cao
Renliang Sun
Rui Wang
Yu Li
Jiaxing Zhang
97
0
0
18 Oct 2024
Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning
Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning
Jialu Tang
Tong Xia
Yuan Lu
Cecilia Mascolo
Aaqib Saeed
AI4MH
100
3
0
18 Oct 2024
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph
Maitreya Prafulla Chitale
Uday Bindal
Rajakrishnan Rajkumar
Rahul Mishra
106
1
0
18 Oct 2024
Enabling Scalable Evaluation of Bias Patterns in Medical LLMs
Enabling Scalable Evaluation of Bias Patterns in Medical LLMs
Hamed Fayyaz
Raphael Poulain
Rahmatollah Beheshti
104
2
0
18 Oct 2024
Style-Compress: An LLM-Based Prompt Compression Framework Considering
  Task-Specific Styles
Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles
Xiao Pu
Tianxing He
Xiaojun Wan
VLM
84
3
0
17 Oct 2024
Measuring and Modifying the Readability of English Texts with GPT-4
Measuring and Modifying the Readability of English Texts with GPT-4
Sean Trott
Pamela D. Rivière
54
3
0
17 Oct 2024
Generating Signed Language Instructions in Large-Scale Dialogue Systems
Generating Signed Language Instructions in Large-Scale Dialogue Systems
Mert Inan
Katherine Atwell
Anthony Sicilia
Lorna C. Quandt
Malihe Alikhani
SLR
69
2
0
17 Oct 2024
Unlocking Legal Knowledge: A Multilingual Dataset for Judicial
  Summarization in Switzerland
Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland
Luca Rolshoven
Vishvaksenan Rasiah
Srinanda Brügger Bose
Matthias Sturmer
Joel Niklaus
ELMAILaw
77
2
0
17 Oct 2024
Measuring Free-Form Decision-Making Inconsistency of Language Models in
  Military Crisis Simulations
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Aryan Shrivastava
Jessica Hullman
Max Lamparth
80
6
0
17 Oct 2024
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model
  with Meta-Exploration
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Yun-Yen Chuang
Hung-Min Hsu
Kevin Lin
Chen-Sheng Gu
Ling Zhen Li
Ray-I Chang
Hung-yi Lee
DiffMVLM
66
1
0
17 Oct 2024
A Little Human Data Goes A Long Way
A Little Human Data Goes A Long Way
Dhananjay Ashok
Jonathan May
SyDa
120
4
0
17 Oct 2024
Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Ryotaro Shimizu
Takashi Wada
Yu Wang
Johannes Kruse
Sean O'Brien
...
Yuya Yoshikawa
Yuki Saito
Fugee Tsung
M. Goto
Julian McAuley
60
0
0
17 Oct 2024
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
271
3
0
17 Oct 2024
VividMed: Vision Language Model with Versatile Visual Grounding for
  Medicine
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
Lingxiao Luo
Bingda Tang
Xuanzhong Chen
Rong Han
Ting Chen
VLM
88
3
0
16 Oct 2024
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
Jingming Zhuo
Shanghang Zhang
Xinyu Fang
Haodong Duan
Dahua Lin
Kai Chen
80
30
0
16 Oct 2024
Evaluation of Attribution Bias in Retrieval-Augmented Large Language
  Models
Evaluation of Attribution Bias in Retrieval-Augmented Large Language Models
Amin Abolghasemi
Leif Azzopardi
Seyyed Hadi Hashemi
Maarten de Rijke
Suzan Verberne
74
1
0
16 Oct 2024
An Automatic and Cost-Efficient Peer-Review Framework for Language
  Generation Evaluation
An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Junjie Chen
Weihang Su
Zhumin Chu
Haitao Li
Qinyao Ai
Yiqun Liu
Min Zhang
Shaoping Ma
63
3
0
16 Oct 2024
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
Xiaonan Jing
Srinivas Billa
Danny Godbout
HILM
125
0
0
16 Oct 2024
Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models
Mengze Hong
Yuanfeng Song
Di Jiang
Lu Wang
Zichang Guo
Yuanqin He
Zhiyang Su
Qing Li
81
2
0
16 Oct 2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Genta Indra Winata
Frederikus Hudi
Patrick Amadeus Irawan
David Anugraha
Rifki Afina Putri
...
Alham Fikri Aji
Taro Watanabe
Derry Wijaya
Alice Oh
Chong-Wah Ngo
CoGe
203
16
0
16 Oct 2024
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs
Herun Wan
Minnan Luo
Zhixiong Su
Guang Dai
Xiang Zhao
DeLMO
110
1
0
16 Oct 2024
Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings
Towards Realistic Evaluation of Commit Message Generation by Matching Online and Offline Settings
Petr Tsvetkov
Aleksandra V. Eliseeva
Danny Dig
A. Bezzubov
Yaroslav Golubev
T. Bryksin
Yaroslav Zharov
56
0
0
15 Oct 2024
Towards More Effective Table-to-Text Generation: Assessing In-Context
  Learning and Self-Evaluation with Open-Source Models
Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models
Sahar Iravani
Tim . O . F Conrad
LMTD
71
0
0
15 Oct 2024
Data Quality Control in Federated Instruction-tuning of Large Language Models
Data Quality Control in Federated Instruction-tuning of Large Language Models
Yaxin Du
Guangyi Liu
Fengting Yuchi
W. Zhao
Jingjing Qu
Yanjie Wang
Siheng Chen
ALMFedML
129
2
0
15 Oct 2024
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
Fei Tang
Yongliang Shen
Hang Zhang
Zeqi Tan
Wenqi Zhang
Guiyang Hou
Kaitao Song
Weiming Lu
Yueting Zhuang
124
0
0
15 Oct 2024
BookWorm: A Dataset for Character Description and Analysis
BookWorm: A Dataset for Character Description and Analysis
Argyrios Papoudakis
Mirella Lapata
Frank Keller
54
2
0
14 Oct 2024
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Peng Xia
Siwei Han
Shi Qiu
Yiyang Zhou
Zhaoyang Wang
...
Chenhang Cui
Mingyu Ding
Linjie Li
Lijuan Wang
Huaxiu Yao
159
16
0
14 Oct 2024
Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs
Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs
Haozhen Zhang
Tao Feng
Jiaxuan You
AI4TSRALM
128
5
0
14 Oct 2024
4-LEGS: 4D Language Embedded Gaussian Splatting
4-LEGS: 4D Language Embedded Gaussian Splatting
Gal Fiebelman
Tamir Cohen
Ayellet Morgenstern
Peter Hedman
Hadar Averbuch-Elor
3DGS
146
3
0
14 Oct 2024
Can We Predict Performance of Large Models across Vision-Language Tasks?
Can We Predict Performance of Large Models across Vision-Language Tasks?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
128
0
0
14 Oct 2024
MARS: Multilingual Aspect-centric Review Summarisation
MARS: Multilingual Aspect-centric Review Summarisation
Sandeep Sricharan Mukku
Abinesh Kanagarajan
Chetan Aggarwal
Promod Yenigalla
41
0
0
13 Oct 2024
Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble
  for Zero-shot Learning
Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning
Pengfei Jin
Peng Shu
Sekeun Kim
Qing Xiao
S. Song
Cheng Chen
Tianming Liu
Xiang Li
Quanzheng Li
102
1
0
13 Oct 2024
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of
  LLMs
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
Yijie Li
Yuan Sun
ELM
60
1
0
13 Oct 2024
GPTON: Generative Pre-trained Transformers enhanced with Ontology
  Narration for accurate annotation of biological data
GPTON: Generative Pre-trained Transformers enhanced with Ontology Narration for accurate annotation of biological data
Rongbin Li
Wenbo Chen
Jinbo Li
Hanwen Xing
Hua Xu
Zhao Li
W. Zheng
LM&MA
50
0
0
12 Oct 2024
SciGisPy: a Novel Metric for Biomedical Text Simplification via Gist
  Inference Score
SciGisPy: a Novel Metric for Biomedical Text Simplification via Gist Inference Score
Chen Lyu
Gabriele Pergola
58
3
0
12 Oct 2024
Quebec Automobile Insurance Question-Answering With Retrieval-Augmented
  Generation
Quebec Automobile Insurance Question-Answering With Retrieval-Augmented Generation
David Beauchemin
Zachary Gagnon
Ricahrd Khoury
AILaw
47
1
0
12 Oct 2024
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks
  in English
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English
T. Y. S. S. Santosh
Cornelius Weiss
Matthias Grabmair
AILawELM
99
2
0
12 Oct 2024
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?
HyoJung Han
Akiko Eriguchi
Haoran Xu
Hieu T. Hoang
Marine Carpuat
Huda Khayrallah
VLM
89
3
0
12 Oct 2024
Audio Description Generation in the Era of LLMs and VLMs: A Review of
  Transferable Generative AI Technologies
Audio Description Generation in the Era of LLMs and VLMs: A Review of Transferable Generative AI Technologies
Yingqiang Gao
Lukas Fischer
Alexa Lintner
Sarah Ebling
69
1
0
11 Oct 2024
SocialGaze: Improving the Integration of Human Social Norms in Large
  Language Models
SocialGaze: Improving the Integration of Human Social Norms in Large Language Models
Anvesh Rao Vijjini
Rakesh R Menon
Jiayi Fu
Shashank Srivastava
Snigdha Chaturvedi
ALM
69
0
0
11 Oct 2024
JurEE not Judges: safeguarding llm interactions with small, specialised
  Encoder Ensembles
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles
Dom Nasrabadi
87
1
0
11 Oct 2024
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELMLRM
115
0
0
11 Oct 2024
SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models
SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models
H. Xia
Zhengbang Yang
Junbo Zou
Rhys Tracy
Yuqing Wang
...
Xun Shao
Zhuoqing Xie
Yuan-fang Wang
Weining Shen
Hanjie Chen
ReLMLRMELM
120
4
0
11 Oct 2024
Private Language Models via Truncated Laplacian Mechanism
Private Language Models via Truncated Laplacian Mechanism
Tianhao Huang
Tao Yang
Ivan Habernal
Lijie Hu
Di Wang
64
1
0
10 Oct 2024
Toward Relieving Clinician Burden by Automatically Generating Progress
  Notes using Interim Hospital Data
Toward Relieving Clinician Burden by Automatically Generating Progress Notes using Interim Hospital Data
Sarvesh Soni
Dina Demner-Fushman
34
1
0
10 Oct 2024
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)
Abhijit Mishra
Shreya Shukla
Jose Torres
Jacek Gwizdka
Shounak Roychowdhury
125
7
0
10 Oct 2024
Previous
123...141516...697071
Next