ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.12693
  4. Cited By
QuestEval: Summarization Asks for Fact-based Evaluation

QuestEval: Summarization Asks for Fact-based Evaluation

23 March 2021
Thomas Scialom
Paul-Alexis Dray
Patrick Gallinari
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
Alex Jinpeng Wang
    HILM
ArXivPDFHTML

Papers citing "QuestEval: Summarization Asks for Fact-based Evaluation"

50 / 181 papers shown
Title
Interpretable Automatic Fine-grained Inconsistency Detection in Text
  Summarization
Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization
Hou Pong Chan
Qi Zeng
Chenhui Xu
HILM
29
12
0
23 May 2023
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Philippe Laban
Wojciech Kry'sciñski
Divyansh Agarwal
Alexander R. Fabbri
Caiming Xiong
Shafiq Joty
Chien-Sheng Wu
ALM
HILM
35
33
0
23 May 2023
USB: A Unified Summarization Benchmark Across Tasks and Domains
USB: A Unified Summarization Benchmark Across Tasks and Domains
Kundan Krishna
Prakhar Gupta
S. Ramprasad
Byron C. Wallace
Jeffrey P. Bigham
Zachary Chase Lipton
HILM
43
8
0
23 May 2023
Evaluating Factual Consistency of Summaries with Large Language Models
Evaluating Factual Consistency of Summaries with Large Language Models
Shiqi Chen
Siyang Gao
Junxian He
ELM
LRM
HILM
37
6
0
23 May 2023
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Yifu Qiu
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
HILM
61
43
0
23 May 2023
LM vs LM: Detecting Factual Errors via Cross Examination
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen
May Hamri
Mor Geva
Amir Globerson
HILM
41
120
0
22 May 2023
TrueTeacher: Learning Factual Consistency Evaluation with Large Language
  Models
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models
Zorik Gekhman
Jonathan Herzig
Roee Aharoni
Chen Elkind
Idan Szpektor
HILM
ELM
31
72
0
18 May 2023
What You See is What You Read? Improving Text-Image Alignment Evaluation
What You See is What You Read? Improving Text-Image Alignment Evaluation
Michal Yarom
Yonatan Bitton
Soravit Changpinyo
Roee Aharoni
Jonathan Herzig
Oran Lang
E. Ofek
Idan Szpektor
EGVM
59
74
0
17 May 2023
FactKB: Generalizable Factuality Evaluation using Language Models
  Enhanced with Factual Knowledge
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng
Vidhisha Balachandran
Yuyang Bai
Yulia Tsvetkov
KELM
HILM
29
52
0
14 May 2023
Expository Text Generation: Imitate, Retrieve, Paraphrase
Expository Text Generation: Imitate, Retrieve, Paraphrase
Nishant Balepur
Jie Huang
Kevin Chen-Chuan Chang
16
8
0
05 May 2023
Extractive Summarization via ChatGPT for Faithful Summary Generation
Extractive Summarization via ChatGPT for Faithful Summary Generation
Haopeng Zhang
Xiao Liu
Jiawei Zhang
38
76
0
09 Apr 2023
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
Zheheng Luo
Qianqian Xie
Sophia Ananiadou
ELM
HILM
ALM
41
74
0
27 Mar 2023
Faithfulness-Aware Decoding Strategies for Abstractive Summarization
Faithfulness-Aware Decoding Strategies for Abstractive Summarization
David Wan
Mengwen Liu
Kathleen McKeown
Markus Dreyer
Joey Tianyi Zhou
HILM
111
32
0
06 Mar 2023
WiCE: Real-World Entailment for Claims in Wikipedia
WiCE: Real-World Entailment for Claims in Wikipedia
Ryo Kamoi
Tanya Goyal
Juan Diego Rodriguez
Greg Durrett
41
81
0
02 Mar 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
31
15
0
17 Feb 2023
GPTScore: Evaluate as You Desire
GPTScore: Evaluate as You Desire
Jinlan Fu
See-Kiong Ng
Zhengbao Jiang
Pengfei Liu
LM&MA
ALM
ELM
15
268
0
08 Feb 2023
Leveraging Summary Guidance on Medical Report Summarization
Leveraging Summary Guidance on Medical Report Summarization
Yunqi Zhu
Xuebing Yang
Yuanyuan Wu
Wensheng Zhang
33
9
0
08 Feb 2023
Do Multi-Document Summarization Models Synthesize?
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
24
8
0
31 Jan 2023
Improving Open-Domain Dialogue Evaluation with a Causal Inference Model
Improving Open-Domain Dialogue Evaluation with a Causal Inference Model
Cat P. Le
Luke Dai
Michael Johnston
Yang Liu
M. Walker
R. Ghanadan
ELM
19
10
0
31 Jan 2023
MQAG: Multiple-choice Question Answering and Generation for Assessing
  Information Consistency in Summarization
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
21
35
0
28 Jan 2023
mFACE: Multilingual Summarization with Factual Consistency Evaluation
mFACE: Multilingual Summarization with Factual Consistency Evaluation
Roee Aharoni
Shashi Narayan
Joshua Maynez
Jonathan Herzig
Elizabeth Clark
Mirella Lapata
HILM
27
44
0
20 Dec 2022
WeCheck: Strong Factual Consistency Checker via Weakly Supervised
  Learning
WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning
Wenhao Wu
Wei Li
Xinyan Xiao
Jiachen Liu
Sujian Li
Yajuan Lv
HILM
31
4
0
20 Dec 2022
BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of
  Faithfulness Metrics
BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics
Liang Ma
Shuyang Cao
IV RobertL.Logan
Di Lu
Shihao Ran
Kecheng Zhang
Joel R. Tetreault
A. Jaimes
17
6
0
20 Dec 2022
Revisiting the Gold Standard: Grounding Summarization Evaluation with
  Robust Human Evaluation
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Yixin Liu
Alexander R. Fabbri
Pengfei Liu
Yilun Zhao
Linyong Nan
...
Simeng Han
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
ALM
31
133
0
15 Dec 2022
Scientific Paper Extractive Summarization Enhanced by Citation Graphs
Scientific Paper Extractive Summarization Enhanced by Citation Graphs
Preslav Nakov
Mingzhe Li
Shen Gao
Rui Yan
Xin Gao
Xiangliang Zhang
25
12
0
08 Dec 2022
RHO ($ρ$): Reducing Hallucination in Open-domain Dialogues with
  Knowledge Grounding
RHO (ρρρ): Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding
Ziwei Ji
Zihan Liu
Nayeon Lee
Tiezheng Yu
Bryan Wilie
Mini Zeng
Pascale Fung
HILM
23
53
0
03 Dec 2022
Prompted Opinion Summarization with GPT-3.5
Prompted Opinion Summarization with GPT-3.5
Adithya Bhaskar
Alexander R. Fabbri
Greg Durrett
ELM
19
51
0
29 Nov 2022
HaRiM$^+$: Evaluating Summary Quality with Hallucination Risk
HaRiM+^++: Evaluating Summary Quality with Hallucination Risk
Seonil Son
Junsoo Park
J. Hwang
Junghwa Lee
Hyungjong Noh
Yeonsoo Lee
HILM
19
8
0
22 Nov 2022
Evaluating the Factual Consistency of Large Language Models Through News
  Summarization
Evaluating the Factual Consistency of Large Language Models Through News Summarization
Derek Tam
Anisha Mascarenhas
Shiyue Zhang
Sarah Kwan
Joey Tianyi Zhou
Colin Raffel
HILM
33
96
0
15 Nov 2022
Evaluating and Improving Factuality in Multimodal Abstractive
  Summarization
Evaluating and Improving Factuality in Multimodal Abstractive Summarization
David Wan
Joey Tianyi Zhou
20
10
0
04 Nov 2022
Time-aware Prompting for Text Generation
Time-aware Prompting for Text Generation
Shuyang Cao
Lu Wang
29
11
0
03 Nov 2022
RQUGE: Reference-Free Metric for Evaluating Question Generation by
  Answering the Question
RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question
Alireza Mohammadshahi
Thomas Scialom
Majid Yazdani
Pouya Yanki
Angela Fan
James Henderson
Marzieh Saeidi
31
20
0
02 Nov 2022
Improving abstractive summarization with energy-based re-ranking
Improving abstractive summarization with energy-based re-ranking
Diogo Pernes
Afonso Mendes
André F. T. Martins
23
6
0
27 Oct 2022
How "Multi" is Multi-Document Summarization?
How "Multi" is Multi-Document Summarization?
Ruben Wolhandler
Arie Cattan
Ori Ernst
Ido Dagan
84
12
0
23 Oct 2022
On the Limitations of Reference-Free Evaluations of Generated Text
On the Limitations of Reference-Free Evaluations of Generated Text
Daniel Deutsch
Rotem Dror
Dan Roth
40
45
0
22 Oct 2022
Precisely the Point: Adversarial Augmentations for Faithful and
  Informative Text Generation
Precisely the Point: Adversarial Augmentations for Faithful and Informative Text Generation
Wenhao Wu
Wei Li
Jiachen Liu
Xinyan Xiao
Sujian Li
Yajuan Lyu
42
3
0
22 Oct 2022
Analyzing and Evaluating Faithfulness in Dialogue Summarization
Analyzing and Evaluating Faithfulness in Dialogue Summarization
Bin Wang
Chen Zhang
Yan Zhang
Yiming Chen
Haizhou Li
HILM
41
15
0
21 Oct 2022
Language Generation Models Can Cause Harm: So What Can We Do About It?
  An Actionable Survey
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
Sachin Kumar
Vidhisha Balachandran
Lucille Njoo
Antonios Anastasopoulos
Yulia Tsvetkov
ELM
81
85
0
14 Oct 2022
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Towards a Unified Multi-Dimensional Evaluator for Text Generation
Ming Zhong
Yang Liu
Da Yin
Yuning Mao
Yizhu Jiao
Peng Liu
Chenguang Zhu
Heng Ji
Jiawei Han
ELM
50
256
0
13 Oct 2022
Shortcomings of Question Answering Based Factuality Frameworks for Error
  Localization
Shortcomings of Question Answering Based Factuality Frameworks for Error Localization
Ryo Kamoi
Tanya Goyal
Greg Durrett
HILM
44
14
0
13 Oct 2022
Just ClozE! A Novel Framework for Evaluating the Factual Consistency
  Faster in Abstractive Summarization
Just ClozE! A Novel Framework for Evaluating the Factual Consistency Faster in Abstractive Summarization
Yiyang Li
Lei Li
Marina Litvak
N. Vanetik
Dingxing Hu
Yuze Li
Yanquan Zhou
HILM
40
0
0
06 Oct 2022
Towards Improving Faithfulness in Abstractive Summarization
Towards Improving Faithfulness in Abstractive Summarization
Preslav Nakov
Mingzhe Li
Xin Gao
Xiangliang Zhang
HILM
33
26
0
04 Oct 2022
News Summarization and Evaluation in the Era of GPT-3
News Summarization and Evaluation in the Era of GPT-3
Tanya Goyal
Junyi Jessy Li
Greg Durrett
ELM
31
387
0
26 Sep 2022
Summarization Programs: Interpretable Abstractive Summarization with
  Neural Modular Trees
Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
Swarnadeep Saha
Shiyue Zhang
Peter Hase
Joey Tianyi Zhou
29
19
0
21 Sep 2022
Extractive is not Faithful: An Investigation of Broad Unfaithfulness
  Problems in Extractive Summarization
Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
Shiyue Zhang
David Wan
Joey Tianyi Zhou
HILM
52
27
0
08 Sep 2022
Podcast Summary Assessment: A Resource for Evaluating Summary Assessment
  Methods
Podcast Summary Assessment: A Resource for Evaluating Summary Assessment Methods
Potsawee Manakul
Mark Gales
15
5
0
28 Aug 2022
PEER: A Collaborative Language Model
PEER: A Collaborative Language Model
Timo Schick
Jane Dwivedi-Yu
Zhengbao Jiang
Fabio Petroni
Patrick Lewis
Gautier Izacard
Qingfei You
Christoforos Nalmpantis
Edouard Grave
Sebastian Riedel
ALM
50
93
0
24 Aug 2022
Joint Generator-Ranker Learning for Natural Language Generation
Joint Generator-Ranker Learning for Natural Language Generation
Weizhou Shen
Yeyun Gong
Yelong Shen
Song Wang
Xiaojun Quan
Nan Duan
Weizhu Chen
42
5
0
28 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in
  Language-guided RL
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
28
24
0
20 Jun 2022
Counseling Summarization using Mental Health Knowledge Guided Utterance
  Filtering
Counseling Summarization using Mental Health Knowledge Guided Utterance Filtering
Aseem Srivastava
Tharun Suresh
Sarah Peregrine
S. P. Lord
Md. Shad Akhtar
Tanmoy Chakraborty
19
13
0
08 Jun 2022
Previous
1234
Next