Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,522 papers shown
Title
Multilingual Simplification of Medical Texts
Sebastian Antony Joseph
Kathryn Kazanas
Keziah Reina
Vishnesh J. Ramanathan
Wei Xu
Byron C. Wallace
Junyi Jessy Li
114
14
0
21 May 2023
Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification
Renliang Sun
Wei Xu
Xiaojun Wan
CLL
95
19
0
21 May 2023
Evaluating Open-QA Evaluation
Cunxiang Wang
Sirui Cheng
Qipeng Guo
Yuanhao Yue
Bowen Ding
Zhikun Xu
Yidong Wang
Xiangkun Hu
Zheng Zhang
Yue Zhang
ELM
123
33
0
21 May 2023
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs
Paul Burgess
Nigel Collier
Wray Buntine
Ehsan Shareghi
121
41
0
21 May 2023
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
Yatin Nandwani
Vineet Kumar
Dinesh Raghu
Sachindra Joshi
Luis A. Lastras
82
6
0
20 May 2023
Revisiting Automated Topic Model Evaluation with Large Language Models
Dominik Stammbach
Vilém Zouhar
Alexander Miserlis Hoyle
Mrinmaya Sachan
Elliott Ash
ELM
ALM
78
4
0
20 May 2023
Movie101: A New Movie Understanding Benchmark
Zihao Yue
Qi Zhang
Anwen Hu
Liang Zhang
Ziheng Wang
Qin Jin
VGen
79
17
0
20 May 2023
"What do others think?": Task-Oriented Conversational Modeling with Subjective Knowledge
Chao Zhao
Spandana Gella
Seokhwan Kim
Di Jin
Devamanyu Hazarika
Alexandros Papangelis
Behnam Hedayatnia
Mahdi Namazifar
Yang Liu
Dilek Z. Hakkani-Tür
94
7
0
20 May 2023
Evaluation of medium-large Language Models at zero-shot closed book generative question answering
René Peinl
Johannes Wirth
ELM
51
7
0
19 May 2023
Reducing Sequence Length by Predicting Edit Operations with Large Language Models
Masahiro Kaneko
Naoaki Okazaki
73
4
0
19 May 2023
Pseudo-Label Training and Model Inertia in Neural Machine Translation
B. Hsu
Anna Currey
Xing Niu
Maria Nuadejde
Georgiana Dinu
ODL
91
2
0
19 May 2023
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach
Masahiro Kaneko
Graham Neubig
Naoaki Okazaki
116
6
0
19 May 2023
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
Junyi Li
Xiaoxue Cheng
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
HILM
VLM
123
254
0
19 May 2023
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Mario Giulianelli
Joris Baan
Wilker Aziz
Raquel Fernández
Barbara Plank
UQLM
126
32
0
19 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang
Pu Zhao
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
106
51
0
19 May 2023
DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion
Chao-Hong Tan
Jia-Chen Gu
Zhen-Hua Ling
DiffM
67
1
0
19 May 2023
PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning
Chengfeng Dou
Zhi Jin
Wenpin Jiao
Haiyan Zhao
Zhengwei Tao
Yongqiang Zhao
LM&MA
MedIm
106
8
0
19 May 2023
A Topic-aware Summarization Framework with Different Modal Side Information
Preslav Nakov
Mingzhe Li
Shen Gao
Xin Cheng
Qiang Yang
Qishen Zhang
Xin Gao
Xiangliang Zhang
99
14
0
19 May 2023
Recent Trends in Unsupervised Summarization
Mohammad Khosravani
Amine Trabelsi
87
0
0
18 May 2023
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
70
19
0
18 May 2023
Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus
Yu Jiang
Tianyu Liu
Shuming Ma
Dongdong Zhang
Mrinmaya Sachan
Ryan Cotterell
75
7
0
18 May 2023
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
77
5
0
18 May 2023
Take a Break in the Middle: Investigating Subgoals towards Hierarchical Script Generation
Xinze Li
Yixin Cao
Muhao Chen
Aixin Sun
62
7
0
18 May 2023
Are Large Language Models Fit For Guided Reading?
Peter Ochieng
LM&MA
ELM
AI4Ed
78
2
0
18 May 2023
From chocolate bunny to chocolate crocodile: Do Language Models Understand Noun Compounds?
Jordan Coil
Vered Shwartz
LRM
49
16
0
17 May 2023
Elaborative Simplification as Implicit Questions Under Discussion
Yating Wu
William Sheffield
Kyle Mahowald
Junyi Jessy Li
84
15
0
17 May 2023
FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy
Zuhao Yang
Yingfang Yuan
Yang Xu
Shuo Zhan
Huajun Bai
Kefan Chen
CVBM
54
4
0
17 May 2023
Towards More Robust NLP System Evaluation: Handling Missing Scores in Benchmarks
Anas Himmi
Ekhine Irurozki
Nathan Noiry
Stephan Clémençon
Pierre Colombo
193
9
0
17 May 2023
Boosting Distress Support Dialogue Responses with Motivational Interviewing Strategy
A. Welivita
Pearl Pu
OffRL
100
17
0
17 May 2023
Balancing Lexical and Semantic Quality in Abstractive Summarization
Jeewoo Sul
Y. Choi
81
6
0
17 May 2023
Explaining black box text modules in natural language with language models
Chandan Singh
Aliyah R. Hsu
Richard Antonello
Shailee Jain
Alexander G. Huth
Bin Yu
Jianfeng Gao
MILM
82
58
0
17 May 2023
SGP-TOD: Building Task Bots Effortlessly via Schema-Guided LLM Prompting
Xiaoying Zhang
Baolin Peng
Kun Li
Jingyan Zhou
Helen M. Meng
139
46
0
15 May 2023
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs
Afra Feyza Akyürek
Ekin Akyürek
Aman Madaan
Ashwin Kalyan
Peter Clark
Derry Wijaya
Niket Tandon
ALM
KELM
114
102
0
15 May 2023
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist
Iftitahu Ni'mah
Meng Fang
Vlado Menkovski
Mykola Pechenizkiy
80
14
0
15 May 2023
Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks
Moritz Plenz
Juri Opitz
Philipp Heinisch
Philipp Cimiano
Anette Frank
95
9
0
15 May 2023
TESS: Text-to-Text Self-Conditioned Simplex Diffusion
Rabeeh Karimi Mahabadi
Hamish Ivison
Jaesung Tae
James Henderson
Iz Beltagy
Matthew E. Peters
Arman Cohan
104
28
0
15 May 2023
Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence Modeling
Y. Zhu
Xuebing Yang
Yuanyuan Wu
Wensheng Zhang
MedIm
41
2
0
15 May 2023
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback
Shang-ling Hsu
Raj Sanjay Shah
Prathik Senthil
Zahra Ashktorab
Casey Dugan
Werner Geyer
Diyi Yang
119
24
0
15 May 2023
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng
Vidhisha Balachandran
Yuyang Bai
Yulia Tsvetkov
KELM
HILM
79
59
0
14 May 2023
A Cognitive Stimulation Dialogue System with Multi-source Knowledge Fusion for Elders with Cognitive Impairment
Jiyue Jiang
Sheng Wang
Qintong Li
Lingpeng Kong
Chuan Wu
99
8
0
14 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Wen-tau Yih
Ziyu Yao
73
19
0
14 May 2023
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and Generation
Yulun Du
Lydia B. Chilton
88
8
0
14 May 2023
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems
Sarik Ghazarian
Yijia Shao
Rujun Han
Aram Galstyan
Nanyun Peng
84
7
0
12 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Yingce Xia
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
100
8
0
12 May 2023
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
Wei-Lin Chen
An-Zi Yen
Cheng-Kuang Wu
Hen-Hsen Huang
Hsin-Hsi Chen
ReLM
LRM
52
11
0
12 May 2023
IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images
Varuna Krishna
S. Suryavardan
Shreyash Mishra
Sathyanarayanan Ramamoorthy
Parth Patwa
Megha Chakraborty
Aman Chadha
Amitava Das
Amit P. Sheth
VLM
121
3
0
12 May 2023
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Brihi Joshi
Ziyi Liu
Sahana Ramnath
Aaron Chan
Zhewei Tong
Shaoliang Nie
Qifan Wang
Yejin Choi
Xiang Ren
HAI
LRM
85
35
0
11 May 2023
Evaluating Open-Domain Question Answering in the Era of Large Language Models
Ehsan Kamalloo
Nouha Dziri
C. Clarke
Davood Rafiei
ELM
81
110
0
11 May 2023
PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization
Xinbei Ma
Yeyun Gong
Pengcheng He
Hai Zhao
Nan Duan
63
2
0
11 May 2023
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models
Hongyuan Lu
Haoran Yang
Haoyang Huang
Dongdong Zhang
Wai Lam
Furu Wei
LRM
AI4CE
108
18
0
11 May 2023
Previous
1
2
3
...
46
47
48
...
69
70
71
Next