ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.12840
  4. Cited By
Evaluating the Factual Consistency of Abstractive Text Summarization

Evaluating the Factual Consistency of Abstractive Text Summarization

28 October 2019
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
    HILM
ArXivPDFHTML

Papers citing "Evaluating the Factual Consistency of Abstractive Text Summarization"

50 / 464 papers shown
Title
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long
  Form Text Generation
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Wen-tau Yih
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
ALM
86
611
0
23 May 2023
ManiTweet: A New Benchmark for Identifying Manipulation of News on
  Social Media
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media
Kung-Hsiang Huang
Hou Pong Chan
Kathleen McKeown
Heng Ji
39
1
0
23 May 2023
Evaluating Factual Consistency of Summaries with Large Language Models
Evaluating Factual Consistency of Summaries with Large Language Models
Shiqi Chen
Siyang Gao
Junxian He
ELM
LRM
HILM
37
6
0
23 May 2023
Automated Metrics for Medical Multi-Document Summarization Disagree with
  Human Evaluations
Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations
Lucy Lu Wang
Yulia Otmakhova
Jay DeYoung
Thinh Hung Truong
Bailey Kuehl
Erin Bransom
Byron C. Wallace
113
20
0
23 May 2023
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Detecting and Mitigating Hallucinations in Multilingual Summarisation
Yifu Qiu
Yftah Ziser
Anna Korhonen
Edoardo Ponti
Shay B. Cohen
HILM
61
43
0
23 May 2023
Evaluating Factual Consistency of Texts with Semantic Role Labeling
Evaluating Factual Consistency of Texts with Semantic Role Labeling
Jing Fan
Dennis Aumiller
Michael Gertz
HILM
39
4
0
22 May 2023
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization
  Evaluation
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Elizabeth Clark
Shruti Rijhwani
Sebastian Gehrmann
Joshua Maynez
Roee Aharoni
Vitaly Nikolaev
Thibault Sellam
Aditya Siddhant
Dipanjan Das
Ankur P. Parikh
32
38
0
22 May 2023
Revisiting the Architectures like Pointer Networks to Efficiently
  Improve the Next Word Distribution, Summarization Factuality, and Beyond
Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond
Haw-Shiuan Chang
Zonghai Yao
Alolika Gon
Hong-ye Yu
Andrew McCallum
46
10
0
20 May 2023
Pointwise Mutual Information Based Metric and Decoding Strategy for
  Faithful Generation in Document Grounded Dialogs
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
Yatin Nandwani
Vineet Kumar
Dinesh Raghu
Sachindra Joshi
Luis A. Lastras
35
6
0
20 May 2023
TrueTeacher: Learning Factual Consistency Evaluation with Large Language
  Models
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models
Zorik Gekhman
Jonathan Herzig
Roee Aharoni
Chen Elkind
Idan Szpektor
HILM
ELM
31
72
0
18 May 2023
Counterfactual Debiasing for Generating Factually Consistent Text
  Summaries
Counterfactual Debiasing for Generating Factually Consistent Text Summaries
Chenhe Dong
Yuexiang Xie
Yaliang Li
Ying Shen
CML
HILM
36
0
0
18 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
  Language Models
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
52
30
0
17 May 2023
FactKB: Generalizable Factuality Evaluation using Language Models
  Enhanced with Factual Knowledge
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge
Shangbin Feng
Vidhisha Balachandran
Yuyang Bai
Yulia Tsvetkov
KELM
HILM
31
52
0
14 May 2023
Zero-shot Faithful Factual Error Correction
Zero-shot Faithful Factual Error Correction
Kung-Hsiang Huang
Hou Pong Chan
Heng Ji
KELM
HILM
32
30
0
13 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying
  Correlates on Long Form Scientific Summarization
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Yingce Xia
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
36
8
0
12 May 2023
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
Wei-Lin Chen
An-Zi Yen
Cheng-Kuang Wu
Hen-Hsen Huang
Hsin-Hsi Chen
ReLM
LRM
24
11
0
12 May 2023
PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive
  Summarization
PROM: A Phrase-level Copying Mechanism with Pre-training for Abstractive Summarization
Xinbei Ma
Yeyun Gong
Pengcheng He
Hai Zhao
Nan Duan
42
2
0
11 May 2023
Automatic Evaluation of Attribution by Large Language Models
Automatic Evaluation of Attribution by Large Language Models
Xiang Yue
Boshi Wang
Ziru Chen
Kai Zhang
Yu-Chuan Su
Huan Sun
ALM
LRM
HILM
41
55
0
10 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
23
6
0
08 May 2023
HistAlign: Improving Context Dependency in Language Generation by
  Aligning with History
HistAlign: Improving Context Dependency in Language Generation by Aligning with History
David Wan
Shiyue Zhang
Joey Tianyi Zhou
AI4TS
40
6
0
08 May 2023
Expository Text Generation: Imitate, Retrieve, Paraphrase
Expository Text Generation: Imitate, Retrieve, Paraphrase
Nishant Balepur
Jie Huang
Kevin Chen-Chuan Chang
18
8
0
05 May 2023
Entity-Based Evaluation of Political Bias in Automatic Summarization
Entity-Based Evaluation of Political Bias in Automatic Summarization
Karen Zhou
Chenhao Tan
39
1
0
03 May 2023
Can LMs Generalize to Future Data? An Empirical Analysis on Text
  Summarization
Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization
C. Cheang
Hou Pong Chan
Derek F. Wong
Xuebo Liu
Zhao Li
Yanming Sun
Shudong Liu
Lidia S. Chao
205
6
0
03 May 2023
DiffuSum: Generation Enhanced Extractive Summarization with Diffusion
DiffuSum: Generation Enhanced Extractive Summarization with Diffusion
Haopeng Zhang
Xiao Liu
Jiawei Zhang
DiffM
75
40
0
02 May 2023
Text-Blueprint: An Interactive Platform for Plan-based Conditional
  Generation
Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Fantine Huot
Joshua Maynez
Shashi Narayan
Reinald Kim Amplayo
Kuzman Ganchev
Annie Louis
Anders Sandholm
Dipanjan Das
Mirella Lapata
39
7
0
28 Apr 2023
Learning Human-Human Interactions in Images from Weak Textual
  Supervision
Learning Human-Human Interactions in Images from Weak Textual Supervision
Morris Alper
Hadar Averbuch-Elor
VLM
52
2
0
27 Apr 2023
A Survey for Biomedical Text Summarization: From Pre-trained to Large
  Language Models
A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models
Qianqian Xie
Zheheng Luo
Benyou Wang
Sophia Ananiadou
LM&MA
VLM
34
8
0
18 Apr 2023
OpenAssistant Conversations -- Democratizing Large Language Model
  Alignment
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Andreas Kopf
Yannic Kilcher
Dimitri von Rutte
Sotiris Anagnostidis
Zhi Rui Tam
...
Arnav Dantuluri
Andrew Maguire
Christoph Schuhmann
Huu Nguyen
A. Mattick
ALM
LM&MA
65
591
0
14 Apr 2023
Extractive Summarization via ChatGPT for Faithful Summary Generation
Extractive Summarization via ChatGPT for Faithful Summary Generation
Haopeng Zhang
Xiao Liu
Jiawei Zhang
38
76
0
09 Apr 2023
Human-like Summarization Evaluation with ChatGPT
Human-like Summarization Evaluation with ChatGPT
Mingqi Gao
Jie Ruan
Renliang Sun
Xunjian Yin
Shiping Yang
Xiaojun Wan
ALM
AI4MH
29
125
0
05 Apr 2023
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
Yang Liu
Dan Iter
Yichong Xu
Shuohang Wang
Ruochen Xu
Chenguang Zhu
ELM
ALM
LM&MA
80
1,090
0
29 Mar 2023
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
ChatGPT as a Factual Inconsistency Evaluator for Text Summarization
Zheheng Luo
Qianqian Xie
Sophia Ananiadou
ELM
HILM
ALM
41
74
0
27 Mar 2023
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for
  Generative Large Language Models
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
LRM
152
397
0
15 Mar 2023
Automatically Summarizing Evidence from Clinical Trials: A Prototype
  Highlighting Current Challenges
Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges
S. Ramprasad
Denis Jered McInerney
Iain J. Marshal
Byron C. Wallace
32
9
0
07 Mar 2023
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course
  Summarization
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization
Griffin Adams
Jason Zucker
Noémie Elhadad
57
23
0
07 Mar 2023
Faithfulness-Aware Decoding Strategies for Abstractive Summarization
Faithfulness-Aware Decoding Strategies for Abstractive Summarization
David Wan
Mengwen Liu
Kathleen McKeown
Markus Dreyer
Joey Tianyi Zhou
HILM
111
32
0
06 Mar 2023
Models See Hallucinations: Evaluating the Factuality in Video Captioning
Models See Hallucinations: Evaluating the Factuality in Video Captioning
Hui Liu
Xiaojun Wan
HILM
37
10
0
06 Mar 2023
Factual Consistency Oriented Speech Recognition
Factual Consistency Oriented Speech Recognition
Naoyuki Kanda
Takuya Yoshioka
Yang Liu
43
0
0
24 Feb 2023
Learning with Rejection for Abstractive Text Summarization
Learning with Rejection for Abstractive Text Summarization
Mengyao Cao
Yue Dong
Jingyi He
Jackie C.K. Cheung
25
8
0
16 Feb 2023
Leveraging Summary Guidance on Medical Report Summarization
Leveraging Summary Guidance on Medical Report Summarization
Yunqi Zhu
Xuebing Yang
Yuanyuan Wu
Wensheng Zhang
33
9
0
08 Feb 2023
Do Multi-Document Summarization Models Synthesize?
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
24
8
0
31 Jan 2023
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form
  Summarization
LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization
Kalpesh Krishna
Erin Bransom
Bailey Kuehl
Mohit Iyyer
Pradeep Dasigi
Arman Cohan
Kyle Lo
22
90
0
30 Jan 2023
MQAG: Multiple-choice Question Answering and Generation for Assessing
  Information Consistency in Summarization
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization
Potsawee Manakul
Adian Liusie
Mark Gales
HILM
21
36
0
28 Jan 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
Kung-Hsiang Huang
Siffi Singh
Xiaofei Ma
Wei Xiao
Wei Xiao
Nicholas Dingwall
William Yang Wang
Kathleen McKeown
HILM
40
13
0
25 Jan 2023
On the State of German (Abstractive) Text Summarization
On the State of German (Abstractive) Text Summarization
Dennis Aumiller
Jing Fan
Michael Gertz
28
1
0
17 Jan 2023
Active Learning for Abstractive Text Summarization
Active Learning for Abstractive Text Summarization
Akim Tsvigun
Ivan Lysenko
Danila Sedashov
Ivan Lazichny
Eldar Damirov
...
Leonid Sanochkin
Maxim Panov
Alexander Panchenko
Andrey Kravchenko
Artem Shelmanov
27
11
0
09 Jan 2023
A Survey on Knowledge-Enhanced Pre-trained Language Models
A Survey on Knowledge-Enhanced Pre-trained Language Models
Chaoqi Zhen
Yanlei Shang
Xiangyu Liu
Yifei Li
Yong Chen
Dell Zhang
VLM
KELM
32
3
0
27 Dec 2022
PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and
  Entailment Recognition
PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and Entailment Recognition
Sihao Chen
S. Buthpitiya
Alex Fabrikant
Dan Roth
Tal Schuster
24
23
0
21 Dec 2022
mFACE: Multilingual Summarization with Factual Consistency Evaluation
mFACE: Multilingual Summarization with Factual Consistency Evaluation
Roee Aharoni
Shashi Narayan
Joshua Maynez
Jonathan Herzig
Elizabeth Clark
Mirella Lapata
HILM
27
44
0
20 Dec 2022
Toward Human-Like Evaluation for Natural Language Generation with Error
  Analysis
Toward Human-Like Evaluation for Natural Language Generation with Error Analysis
Qingyu Lu
Liang Ding
Liping Xie
Kanjian Zhang
Derek F. Wong
Dacheng Tao
ELM
ALM
36
14
0
20 Dec 2022
Previous
123456...8910
Next