Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.11520
Cited By
BARTScore: Evaluating Generated Text as Text Generation
22 June 2021
Weizhe Yuan
Graham Neubig
Pengfei Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BARTScore: Evaluating Generated Text as Text Generation"
50 / 537 papers shown
Title
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness
Danna Zheng
Danyang Liu
Mirella Lapata
Jeff Z. Pan
HILM
49
6
0
19 Feb 2024
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema
Junru Lu
Siyu An
Min Zhang
Yulan He
Di Yin
Xing Sun
48
2
0
19 Feb 2024
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
Tejpalsingh Siledar
Swaroop Nath
Sankara Sri Raghava Ravindra Muddu
Rupasai Rangaraju
Swaprava Nath
...
Suman Banerjee
Amey Patil
Sudhanshu Singh
M. Chelliah
Nikesh Garera
ALM
LRM
35
6
0
18 Feb 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Jaylen Jones
Lingbo Mo
Eric Fosler-Lussier
Huan Sun
56
3
0
18 Feb 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
48
17
0
18 Feb 2024
Humans or LLMs as the Judge? A Study on Judgement Biases
Guiming Hardy Chen
Shunian Chen
Ziche Liu
Feng Jiang
Benyou Wang
82
93
0
16 Feb 2024
Can We Verify Step by Step for Incorrect Answer Detection?
Xin Xu
Shizhe Diao
Can Yang
Yang Wang
LRM
130
14
0
16 Feb 2024
Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence
Yinhong Liu
Yixuan Su
Ehsan Shareghi
Nigel Collier
36
4
0
15 Feb 2024
GPTs Are Multilingual Annotators for Sequence Generation Tasks
Juhwan Choi
Eunju Lee
Kyohoon Jin
Youngbin Kim
25
10
0
08 Feb 2024
An Empirical Analysis of Diversity in Argument Summarization
Michiel van der Meer
Piek T. J. M. Vossen
Catholijn M. Jonker
P. Murukannaiah
18
6
0
02 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELM
LM&MA
65
29
0
02 Feb 2024
PAP-REC: Personalized Automatic Prompt for Recommendation Language Model
Zelong Li
Jianchao Ji
Yingqiang Ge
Wenyue Hua
Yongfeng Zhang
19
5
0
01 Feb 2024
MT-Ranker: Reference-free machine translation evaluation by inter-system ranking
Ibraheem Muhammad Moosa
Rui Zhang
Wenpeng Yin
24
5
0
30 Jan 2024
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Ansar Aynetdinov
Alan Akbik
ALM
44
12
0
30 Jan 2024
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
Takaaki Saeki
Soumi Maiti
Shinnosuke Takamichi
Shinji Watanabe
Hiroshi Saruwatari
24
11
0
30 Jan 2024
MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim Verification
Xia Zeng
A. Zubiaga
49
5
0
29 Jan 2024
Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media
Jiayu Song
Jenny Chim
Adam Tsakalidis
Julia Ive
Dana Atzil-Slonim
M. Liakata
27
3
0
29 Jan 2024
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
Haochen Tan
Zhijiang Guo
Zhan Shi
Lu Xu
Zhili Liu
...
Xiaoguang Li
Yasheng Wang
Lifeng Shang
Qun Liu
Linqi Song
43
12
0
26 Jan 2024
UR4NNV: Neural Network Verification, Under-approximation Reachability Works!
Zhen Liang
Taoran Wu
Ran Zhao
Bai Xue
Ji Wang
Wenjing Yang
Shaojun Deng
Wanwei Liu
AAML
30
0
0
23 Jan 2024
Hallucination Detection and Hallucination Mitigation: An Investigation
Junliang Luo
Tianyu Li
Di Wu
Michael R. M. Jenkin
Steve Liu
Gregory Dudek
HILM
LLMAG
44
22
0
16 Jan 2024
PRewrite: Prompt Rewriting with Reinforcement Learning
Weize Kong
Spurthi Amba Hombaiah
Mingyang Zhang
Qiaozhu Mei
Michael Bendersky
LLMAG
13
14
0
16 Jan 2024
Enhancing Robustness of LLM-Synthetic Text Detectors for Academic Writing: A Comprehensive Analysis
Zhicheng Dou
Yuchen Guo
Ching-Chun Chang
H. Nguyen
Isao Echizen
DeLMO
13
2
0
16 Jan 2024
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges
Zhen Li
Xiaohan Xu
Tao Shen
Can Xu
Jia-Chen Gu
Yuxuan Lai
Chongyang Tao
Shuai Ma
LM&MA
ELM
39
9
0
13 Jan 2024
Knowledge-Centric Templatic Views of Documents
Isabel Cachola
Silviu Cucerzan
Allen Herring
Vuksan Mijovic
Erik Oveson
S. Jauhar
17
1
0
13 Jan 2024
Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
Xu Huang
Zhirui Zhang
Xiang Geng
Yichao Du
Jiajun Chen
Shujian Huang
48
7
0
12 Jan 2024
A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer
Yong Ma
Senlin Luo
Yu-Ming Shang
Zhengjun Li
Yong Liu
VLM
28
2
0
10 Jan 2024
LUNA: A Framework for Language Understanding and Naturalness Assessment
Marat Saidov
A. Bakalova
Ekaterina Taktasheva
Vladislav Mikhailov
Ekaterina Artemova
ELM
39
1
0
09 Jan 2024
The Critique of Critique
Shichao Sun
Junlong Li
Weizhe Yuan
Ruifeng Yuan
Wenjie Li
Pengfei Liu
ELM
40
0
0
09 Jan 2024
LightHouse: A Survey of AGI Hallucination
Feng Wang
LRM
HILM
VLM
32
3
0
08 Jan 2024
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Wendi Cui
Jiaxin Zhang
Zhuohang Li
Lopez Damien
Kamalika Das
Bradley Malin
Kumar Sricharan
27
2
0
04 Jan 2024
Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering
Pierre Erbacher
Louis Falissard
Vincent Guigue
Laure Soulier
HILM
RALM
24
4
0
03 Jan 2024
Large Language Models in Mental Health Care: a Scoping Review
Yining Hua
Fenglin Liu
Kailai Yang
Zehan Li
Y. Sheu
Peilin Zhou
Lauren V. Moran
Sophia Ananiadou
Andrew Beam
AI4MH
LM&MA
32
36
0
01 Jan 2024
Large Language Models for Conducting Advanced Text Analytics Information Systems Research
Benjamin Ampel
Chi-Heng Yang
Junjie Hu
Hsinchun Chen
38
7
0
27 Dec 2023
Speech Translation with Large Language Models: An Industrial Practice
Zhichao Huang
Rong Ye
Tom Ko
Qianqian Dong
Shanbo Cheng
Mingxuan Wang
Hang Li
70
15
0
21 Dec 2023
CoAScore: Chain-of-Aspects Prompting for NLG Evaluation
Peiyuan Gong
Jiaxin Mao
ELM
54
10
0
16 Dec 2023
PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching
Zhenting Qi
Xiaoyu Tan
Shaojie Shi
Chao Qu
Yinghui Xu
Yuan Qi
ALM
43
10
0
09 Dec 2023
Towards leveraging LLMs for Conditional QA
Syed-Amad Hussain
Parag Dakle
SaiKrishna Rallabandi
Preethi Raghavan
ELM
21
2
0
02 Dec 2023
KNVQA: A Benchmark for evaluation knowledge-based VQA
Sirui Cheng
Siyu Zhang
Jiayi Wu
Muchen Lan
19
1
0
21 Nov 2023
Countering Misinformation via Emotional Response Generation
Daniel Russo
Shane P. Kaszefski-Yaschuk
Jacopo Staiano
Marco Guerini
OffRL
26
9
0
17 Nov 2023
LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores
Yiqi Liu
N. Moosavi
Chenghua Lin
ELM
30
48
0
16 Nov 2023
Event Causality Is Key to Computational Story Understanding
Yidan Sun
Qin Chao
Boyang Albert Li
26
5
0
16 Nov 2023
Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization
Gaurav Sahu
Olga Vechtomova
I. Laradji
39
1
0
16 Nov 2023
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation
Haoyi Qiu
Kung-Hsiang Huang
Jingnong Qu
Nanyun Peng
HILM
28
6
0
16 Nov 2023
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization
G. Chrysostomou
Zhixue Zhao
Miles Williams
Nikolaos Aletras
HILM
34
10
0
15 Nov 2023
Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects -- A Survey
Ashok Urlana
Pruthwik Mishra
Tathagato Roy
Rahul Mishra
37
8
0
15 Nov 2023
Fusion-Eval: Integrating Assistant Evaluators with LLMs
Lei Shu
Nevan Wichers
Liangchen Luo
Yun Zhu
Yinxiao Liu
Jindong Chen
Lei Meng
ELM
13
3
0
15 Nov 2023
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects
Minqian Liu
Ying Shen
Zhiyang Xu
Yixin Cao
Eunah Cho
Vaibhav Kumar
Reza Ghanadan
Lifu Huang
ELM
LM&MA
ALM
52
25
0
15 Nov 2023
Plum: Prompt Learning using Metaheuristic
Rui Pan
Shuo Xing
Shizhe Diao
Wenhe Sun
Xiang Liu
Kashun Shum
Renjie Pi
Jipeng Zhang
Tong Zhang
VLM
OffRL
LRM
44
6
0
14 Nov 2023
Insights into Classifying and Mitigating LLMs' Hallucinations
Alessandro Bruno
P. Mazzeo
Aladine Chetouani
Marouane Tliba
M. A. Kerkouri
HILM
51
10
0
14 Nov 2023
Fair Abstractive Summarization of Diverse Perspectives
Yusen Zhang
Nan Zhang
Yixin Liu
Alexander R. Fabbri
Junru Liu
...
Caiming Xiong
Jieyu Zhao
Dragomir R. Radev
Kathleen McKeown
Rui Zhang
36
8
0
14 Nov 2023
Previous
1
2
3
4
5
6
...
9
10
11
Next