Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.11520
Cited By
BARTScore: Evaluating Generated Text as Text Generation
22 June 2021
Weizhe Yuan
Graham Neubig
Pengfei Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BARTScore: Evaluating Generated Text as Text Generation"
50 / 535 papers shown
Title
INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained Feedback
Wenda Xu
Danqing Wang
Liangming Pan
Zhenqiao Song
Markus Freitag
Luu Anh Tuan
Lei Li
ALM
ELM
38
17
0
23 May 2023
SciMON: Scientific Inspiration Machines Optimized for Novelty
Qingyun Wang
Doug Downey
Heng Ji
Tom Hope
LLMAG
37
62
0
23 May 2023
Towards Graph-hop Retrieval and Reasoning in Complex Question Answering over Textual Database
Minjun Zhu
Yixuan Weng
Shizhu He
Kang Liu
Jun Zhao
RALM
LRM
25
1
0
23 May 2023
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
Lucy Lu Wang
Yulia Otmakhova
Jay DeYoung
Thinh Hung Truong
Bailey Kuehl
Erin Bransom
Byron C. Wallace
113
20
0
23 May 2023
Evaluating Factual Consistency of Texts with Semantic Role Labeling
Jing Fan
Dennis Aumiller
Michael Gertz
HILM
36
4
0
22 May 2023
Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
Chenhui Shen
Liying Cheng
Xuan-Phi Nguyen
Yang You
Lidong Bing
ELM
ALM
47
64
0
22 May 2023
Evaluating Open-QA Evaluation
Cunxiang Wang
Sirui Cheng
Qipeng Guo
Yuanhao Yue
Bowen Ding
Zhikun Xu
Yidong Wang
Xiangkun Hu
Zheng Zhang
Yue Zhang
ELM
34
29
0
21 May 2023
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Mario Giulianelli
Joris Baan
Wilker Aziz
Raquel Fernández
Barbara Plank
UQLM
40
30
0
19 May 2023
Recent Trends in Unsupervised Summarization
Mohammad Khosravani
Amine Trabelsi
37
0
0
18 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Wen-tau Yih
Ziyu Yao
35
17
0
14 May 2023
ParaLS: Lexical Substitution via Pretrained Paraphraser
Jipeng Qiang
Kang Liu
Yun Li
Yunhao Yuan
Yi Zhu
KELM
31
11
0
14 May 2023
Zero-shot Faithful Factual Error Correction
Kung-Hsiang Huang
Hou Pong Chan
Heng Ji
KELM
HILM
26
30
0
13 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Yingce Xia
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
30
8
0
12 May 2023
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue Generation
Lei Liu
J. Huang
CLL
29
2
0
12 May 2023
Context-Aware Document Simplification
Liam Cripwell
Joël Legrand
Claire Gardent
32
5
0
10 May 2023
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes
Yassir Fathullah
Puria Radmard
Adian Liusie
Mark J. F. Gales
OODD
32
1
0
09 May 2023
The Current State of Summarization
Fabian Retkowski
23
6
0
08 May 2023
SkillQG: Learning to Generate Question for Reading Comprehension Assessment
Xiaoqiang Wang
Bang Liu
Siliang Tang
Lingfei Wu
25
3
0
08 May 2023
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
Yufen Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
...
Zeng Zhao
Zhou Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
VLM
28
21
0
06 May 2023
Repairing Deep Neural Networks Based on Behavior Imitation
Zhen Liang
Taoran Wu
Changyuan Zhao
Wanwei Liu
Bai Xue
Wenjing Yang
J. Wang
AAML
42
5
0
05 May 2023
Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings
Daniel Philip Rose
Vaishnavi Himakunthala
Andy Ouyang
Ryan He
Alex Mei
Yujie Lu
Michael Stephen Saxon
Chinmay Sonar
Diba Mirza
William Yang Wang
LRM
72
38
0
03 May 2023
Background Knowledge Grounding for Readable, Relevant, and Factual Biomedical Lay Summaries
Domenic Rosati
HILM
33
0
0
03 May 2023
Causality-aware Concept Extraction based on Knowledge-guided Prompting
Siyu Yuan
Deqing Yang
Jinxi Liu
Shuyu Tian
Jiaqing Liang
Yanghua Xiao
R. Xie
69
13
0
03 May 2023
Turning Flowchart into Dialog: Augmenting Flowchart-grounded Troubleshooting Dialogs via Synthetic Data Generation
Haolan Zhan
Sameen Maruf
Lizhen Qu
Yufei Wang
Ingrid Zukerman
Gholamreza Haffari
35
1
0
02 May 2023
string2string: A Modern Python Library for String-to-String Algorithms
Mirac Suzgun
Stuart M. Shieber
Dan Jurafsky
39
7
0
27 Apr 2023
Learning Human-Human Interactions in Images from Weak Textual Supervision
Morris Alper
Hadar Averbuch-Elor
VLM
45
2
0
27 Apr 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
32
45
0
21 Apr 2023
MixPro: Simple yet Effective Data Augmentation for Prompt-based Learning
Bohan Li
Longxu Dou
Yutai Hou
Yunlong Feng
Honglin Mu
Qingfu Zhu
Qinghua Sun
Wanxiang Che
VLM
37
3
0
19 Apr 2023
WebBrain: Learning to Generate Factually Correct Articles for Queries by Grounding on Large Web Corpus
Hongjing Qian
Yutao Zhu
Zhicheng Dou
Haoqi Gu
Xinyu Zhang
Zheng Liu
Ruofei Lai
Bo Zhao
J. Nie
Ji-Rong Wen
38
25
0
10 Apr 2023
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang
Shaoxiong Ji
Tianlin Zhang
Qianqian Xie
Zi-Zhou Kuang
Sophia Ananiadou
ELM
AI4MH
LRM
35
59
0
06 Apr 2023
ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about
Aman Rangapur
Haoran Wang
AI4MH
39
3
0
06 Apr 2023
Human-like Summarization Evaluation with ChatGPT
Mingqi Gao
Jie Ruan
Renliang Sun
Xunjian Yin
Shiping Yang
Xiaojun Wan
ALM
AI4MH
29
125
0
05 Apr 2023
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
Yi Chen
Rui Wang
Haiyun Jiang
Shuming Shi
Ruifeng Xu
LM&MA
38
75
0
03 Apr 2023
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Sifan Long
Zhen Zhao
Junkun Yuan
Zichang Tan
Jiangjiang Liu
Luping Zhou
Sheng-sheng Wang
Jingdong Wang
VLM
31
2
0
30 Mar 2023
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Yang Liu
Dan Iter
Yichong Xu
Shuohang Wang
Ruochen Xu
Chenguang Zhu
ELM
ALM
LM&MA
56
1,082
0
29 Mar 2023
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
Zheheng Luo
Qianqian Xie
Sophia Ananiadou
ELM
HILM
ALM
41
73
0
27 Mar 2023
KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation
Di Wu
Da Yin
Kai-Wei Chang
39
1
0
27 Mar 2023
Data-centric Artificial Intelligence: A Survey
Daochen Zha
Zaid Pervaiz Bhat
Kwei-Herng Lai
Fan Yang
Zhimeng Jiang
Shaochen Zhong
Xia Hu
27
192
0
17 Mar 2023
DeltaScore: Fine-Grained Story Evaluation with Perturbations
Zhuohan Xie
Miao Li
Trevor Cohn
Jey Han Lau
32
7
0
15 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark J. F. Gales
HILM
LRM
152
396
0
15 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
30
37
0
14 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
34
7
0
09 Mar 2023
Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Jiaan Wang
Yunlong Liang
Fandong Meng
Zengkui Sun
Haoxiang Shi
Zhixu Li
Jinan Xu
Jianfeng Qu
Jie Zhou
LM&MA
ELM
ALM
AI4MH
62
446
0
07 Mar 2023
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization
Griffin Adams
Jason Zucker
Noémie Elhadad
54
23
0
07 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Chenyu You
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
15
27
0
07 Mar 2023
Choice Over Control: How Users Write with Large Language Models using Diegetic and Non-Diegetic Prompting
Hai Dang
Sven Goller
Florian Lehmann
Daniel Buschek
AI4CE
96
73
0
06 Mar 2023
Interactive Text Generation
Felix Faltings
Michel Galley
Baolin Peng
Kianté Brantley
Weixin Cai
Yizhe Zhang
Jianfeng Gao
Bill Dolan
33
0
0
02 Mar 2023
Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Kashun Shum
Shizhe Diao
Tong Zhang
ReLM
LRM
28
129
0
24 Feb 2023
Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback
Baolin Peng
Michel Galley
Pengcheng He
Hao Cheng
Yujia Xie
...
Qiuyuan Huang
Lars Liden
Zhou Yu
Weizhu Chen
Jianfeng Gao
KELM
HILM
LRM
25
376
0
24 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLM
KELM
LLMAG
LRM
31
121
0
23 Feb 2023
Previous
1
2
3
...
10
11
7
8
9
Next