ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown
Title
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for
  Text Generation
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation
Tomohito Kasahara
Daisuke Kawahara
82
3
0
17 Oct 2023
Instructive Dialogue Summarization with Query Aggregations
Instructive Dialogue Summarization with Query Aggregations
Bin Wang
Zhengyuan Liu
Nancy F. Chen
89
3
0
17 Oct 2023
Towards reducing hallucination in extracting information from financial
  reports using Large Language Models
Towards reducing hallucination in extracting information from financial reports using Large Language Models
Bhaskarjit Sarmah
Tianjie Zhu
Dhagash Mehta
Stefano Pasquali
RALM
32
12
0
16 Oct 2023
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology
Odhran O'Donoghue
Aleksandar Shtedritski
John Ginger
Ralph Abboud
Ali E. Ghareeb
Justin Booth
Samuel G. Rodriques
93
21
0
16 Oct 2023
Generating Summaries with Controllable Readability Levels
Generating Summaries with Controllable Readability Levels
Leonardo F. R. Ribeiro
Mohit Bansal
Markus Dreyer
152
19
0
16 Oct 2023
On Context Utilization in Summarization with Large Language Models
On Context Utilization in Summarization with Large Language Models
Mathieu Ravaut
Aixin Sun
Nancy F. Chen
Shafiq Joty
93
14
0
16 Oct 2023
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder
  for Language Modeling
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling
Jingcheng Deng
Liang Pang
Huawei Shen
Xueqi Cheng
RALM
103
12
0
16 Oct 2023
Demonstrations Are All You Need: Advancing Offensive Content
  Paraphrasing using In-Context Learning
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning
Anirudh Som
Karan Sikka
Helen Gent
Ajay Divakaran
A. Kathol
D. Vergyri
64
4
0
16 Oct 2023
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method
  for Aligning Large Language Models
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Ziniu Li
Tian Xu
Yushun Zhang
Zhihang Lin
Yang Yu
Ruoyu Sun
Zhimin Luo
139
79
0
16 Oct 2023
Contextual Data Augmentation for Task-Oriented Dialog Systems
Contextual Data Augmentation for Task-Oriented Dialog Systems
Dustin Axman
Avik Ray
Shubham Garg
Jing Huang
54
2
0
16 Oct 2023
Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts
Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts
Christina Chance
Da Yin
Dakuo Wang
Kai-Wei Chang
100
0
0
16 Oct 2023
Large Language Model Unlearning
Large Language Model Unlearning
Yuanshun Yao
Xiaojun Xu
Yang Liu
MU
137
148
0
14 Oct 2023
Surveying the Landscape of Text Summarization with Deep Learning: A
  Comprehensive Review
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TSAILaw
91
4
0
13 Oct 2023
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Chen Zhang
L. F. D’Haro
Chengguang Tang
Ke Shi
Guohua Tang
Haizhou Li
ELM
72
11
0
13 Oct 2023
Human-in-the-loop Machine Translation with Large Language Model
Human-in-the-loop Machine Translation with Large Language Model
Xinyi Yang
Runzhe Zhan
Derek F. Wong
Junchao Wu
Lidia S. Chao
82
3
0
13 Oct 2023
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
PerturbScore: Connecting Discrete and Continuous Perturbations in NLP
Linyang Li
Ke Ren
Yunfan Shao
Pengyu Wang
Xipeng Qiu
66
6
0
13 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models
Calibrating Likelihoods towards Consistency in Summarization Models
Polina Zablotskaia
Misha Khalman
Rishabh Joshi
Livio Baldini Soares
Shoshana Jakobovits
Joshua Maynez
Shashi Narayan
47
4
0
12 Oct 2023
Prometheus: Inducing Fine-grained Evaluation Capability in Language
  Models
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Seungone Kim
Jamin Shin
Yejin Cho
Joel Jang
Shayne Longpre
...
Sangdoo Yun
Seongjin Shin
Sungdong Kim
James Thorne
Minjoon Seo
ALMLM&MAELM
113
240
0
12 Oct 2023
Towards Better Evaluation of Instruction-Following: A Case-Study in
  Summarization
Towards Better Evaluation of Instruction-Following: A Case-Study in Summarization
Ondrej Skopek
Rahul Aralikatte
Sian Gooding
Victor Carbune
ELM
102
19
0
12 Oct 2023
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems
  via Knowledge Enhancement and Alignment
Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment
Boyang Xue
Weichao Wang
Hongru Wang
Fei Mi
Rui Wang
Yasheng Wang
Lifeng Shang
Xin Jiang
Qun Liu
Kam-Fai Wong
KELMHILM
299
18
0
12 Oct 2023
Simplicity Level Estimate (SLE): A Learned Reference-Less Metric for
  Sentence Simplification
Simplicity Level Estimate (SLE): A Learned Reference-Less Metric for Sentence Simplification
Liam Cripwell
Joël Legrand
Claire Gardent
58
13
0
12 Oct 2023
Context Compression for Auto-regressive Transformers with Sentinel
  Tokens
Context Compression for Auto-regressive Transformers with Sentinel Tokens
Siyu Ren
Qi Jia
Kenny Q. Zhu
51
11
0
12 Oct 2023
Fine-grained Conversational Decoding via Isotropic and Proximal Search
Fine-grained Conversational Decoding via Isotropic and Proximal Search
Yuxuan Yao
Han Wu
Qiling Xu
Linqi Song
76
1
0
12 Oct 2023
Low-Resource Clickbait Spoiling for Indonesian via Question Answering
Low-Resource Clickbait Spoiling for Indonesian via Question Answering
Ni Putu Intan Maharani
Ayu Purwarianti
Alham Fikri Aji
70
2
0
12 Oct 2023
Training Generative Question-Answering on Synthetic Data Obtained from
  an Instruct-tuned Model
Training Generative Question-Answering on Synthetic Data Obtained from an Instruct-tuned Model
Kosuke Takahashi
Takahiro Omi
Kosuke Arima
Tatsuya Ishigaki
47
1
0
12 Oct 2023
A Resilient and Accessible Distribution-Preserving Watermark for Large
  Language Models
A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
Yihan Wu
Zhengmian Hu
Junfeng Guo
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
89
23
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng Zhang
Yue Zhang
HILMKELM
172
202
0
11 Oct 2023
PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a
  Language Model
PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model
Tatsuya Ide
Eiki Murata
Daisuke Kawahara
T. Yamazaki
Shengzhe Li
K. Shinzato
Toshinori Sato
LRM
102
2
0
11 Oct 2023
AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
AutoAD II: The Sequel -- Who, When, and What in Movie Audio Description
Tengda Han
Max Bain
Arsha Nagrani
Gül Varol
Weidi Xie
Andrew Zisserman
VGenDiffM
77
39
0
10 Oct 2023
A New Benchmark and Reverse Validation Method for Passage-level
  Hallucination Detection
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
Shiping Yang
Renliang Sun
Xiao-Yi Wan
HILM
109
43
0
10 Oct 2023
MuseChat: A Conversational Music Recommendation System for Videos
MuseChat: A Conversational Music Recommendation System for Videos
Zhikang Dong
Bin Chen
Xiulong Liu
Paweł Polak
Peng Zhang
LRM
121
27
0
10 Oct 2023
Compressing Context to Enhance Inference Efficiency of Large Language
  Models
Compressing Context to Enhance Inference Efficiency of Large Language Models
Yucheng Li
Bo Dong
Chenghua Lin
Frank Guerin
63
74
0
09 Oct 2023
DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for
  Accelerated Seq2Seq Diffusion Models
DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
127
23
0
09 Oct 2023
Aligning Language Models with Human Preferences via a Bayesian Approach
Aligning Language Models with Human Preferences via a Bayesian Approach
Jiashuo Wang
Haozhao Wang
Shichao Sun
Wenjie Li
ALM
105
25
0
09 Oct 2023
LLMLingua: Compressing Prompts for Accelerated Inference of Large
  Language Models
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang
Qianhui Wu
Chin-Yew Lin
Yuqing Yang
Lili Qiu
111
119
0
09 Oct 2023
A Closer Look into Automatic Evaluation Using Large Language Models
A Closer Look into Automatic Evaluation Using Large Language Models
Cheng-Han Chiang
Hunghuei Lee
ELMALMLM&MA
90
13
0
09 Oct 2023
Towards Verifiable Generation: A Benchmark for Knowledge-aware Language
  Model Attribution
Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution
Xinze Li
Yixin Cao2
Liangming Pan
Yubo Ma
Aixin Sun
HILM
40
21
0
09 Oct 2023
Generative Judge for Evaluating Alignment
Generative Judge for Evaluating Alignment
Junlong Li
Shichao Sun
Weizhe Yuan
Run-Ze Fan
Hai Zhao
Pengfei Liu
ELMALM
112
91
0
09 Oct 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy
  in Mental Health and Beyond
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
Siyang Liu
Naihao Deng
Sahand Sabour
Yilin Jia
Minlie Huang
Rada Mihalcea
91
23
0
09 Oct 2023
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation
  with the GeNTE Corpus
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus
Andrea Piergentili
Beatrice Savoldi
Dennis Fucci
Matteo Negri
L. Bentivogli
62
23
0
08 Oct 2023
Factuality Challenges in the Era of Large Language Models
Factuality Challenges in the Era of Large Language Models
Isabelle Augenstein
Timothy Baldwin
Meeyoung Cha
Tanmoy Chakraborty
Giovanni Luca Ciampaglia
...
Rubén Míguez
Preslav Nakov
Dietram A. Scheufele
Shivam Sharma
Giovanni Zagni
HILM
81
44
0
08 Oct 2023
Harnessing the Power of Large Language Models for Empathetic Response
  Generation: Empirical Investigations and Improvements
Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements
Yushan Qian
Weinan Zhang
Ting Liu
AI4MH
84
44
0
08 Oct 2023
WikiIns: A High-Quality Dataset for Controlled Text Editing by Natural
  Language Instruction
WikiIns: A High-Quality Dataset for Controlled Text Editing by Natural Language Instruction
Xiang Chen
Zheng Li
Xiaojun Wan
49
0
0
08 Oct 2023
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain
  Everyday Tasks
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks
Jingyuan Qi
Minqian Liu
Ying Shen
Zhiyang Xu
Lifu Huang
LRMVGen
89
2
0
08 Oct 2023
Exploring the Usage of Chinese Pinyin in Pretraining
Exploring the Usage of Chinese Pinyin in Pretraining
Baojun Wang
Kun Xu
Lifeng Shang
AI4CE
34
0
0
08 Oct 2023
Measuring Information in Text Explanations
Measuring Information in Text Explanations
Zining Zhu
Frank Rudzicz
FAtt
70
0
0
06 Oct 2023
Amortizing intractable inference in large language models
Amortizing intractable inference in large language models
Marvin Schmitt
Moksh Jain
Daniel Habermann
Younesse Kaddar
Ullrich Kothe
Stefan T. Radev
Nikolay Malkin
AIFinBDL
130
58
0
06 Oct 2023
A Comprehensive Evaluation of Large Language Models on Benchmark
  Biomedical Text Processing Tasks
A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks
Fangshuo Liao
Md Tahmid Rahman Laskar
Cruz Barnum
Jimmy Xiangji Huang
AI4MHLM&MA
97
82
0
06 Oct 2023
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Wanyun Cui
Qianle Wang
LRM
92
9
0
06 Oct 2023
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text
  Generation
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Abe Bohan Hou
Jingyu Zhang
Tianxing He
Yichen Wang
Yung-Sung Chuang
Hongwei Wang
Lingfeng Shen
Benjamin Van Durme
Daniel Khashabi
Yulia Tsvetkov
WaLM
92
0
0
06 Oct 2023
Previous
123...383940...697071
Next