ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09525
  4. Cited By
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in
  Summarization

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

18 November 2021
Philippe Laban
Tobias Schnabel
Paul N. Bennett
Marti A. Hearst
    HILM
ArXivPDFHTML

Papers citing "SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization"

50 / 87 papers shown
Title
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation
Chengwei Qin
Wenxuan Zhou
Karthik Abinav Sankararaman
Nanshu Wang
Tengyu Xu
...
Aditya Tayade
Sinong Wang
Chenyu You
Han Fang
Hao Ma
HILM
LRM
12
0
0
18 May 2025
Towards Automated Situation Awareness: A RAG-Based Framework for Peacebuilding Reports
Towards Automated Situation Awareness: A RAG-Based Framework for Peacebuilding Reports
Poli A. Nemkova
Suleyman O. Polat
Rafid I. Jahan
Sagnik Ray Choudhury
Sun-joo Lee
Shouryadipta Sarkar
Mark V. Albert
17
0
0
14 May 2025
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation
Galann Pennec
Zhengyuan Liu
Nicholas Asher
Philippe Muller
Nancy F. Chen
VGen
31
0
0
10 May 2025
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation
SEval-Ex: A Statement-Level Framework for Explainable Summarization Evaluation
Tanguy Herserant
Vincent Guigue
ELM
45
0
0
04 May 2025
Explanatory Summarization with Discourse-Driven Planning
Explanatory Summarization with Discourse-Driven Planning
Dongqi Liu
Xi Yu
Vera Demberg
Mirella Lapata
55
0
0
27 Apr 2025
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?
Summarization Metrics for Spanish and Basque: Do Automatic Scores and LLM-Judges Correlate with Humans?
Jeremy Barnes
Naiara Perez
Alba Bonet-Jover
Begoña Altuna
64
1
0
21 Mar 2025
Unequal Opportunities: Examining the Bias in Geographical Recommendations by Large Language Models
Unequal Opportunities: Examining the Bias in Geographical Recommendations by Large Language Models
Shiran Dudy
Thulasi Tholeti
R. Ramachandranpillai
Muhammad Ali
Toby Jia-Jun Li
Ricardo Baeza-Yates
36
0
0
16 Mar 2025
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
Introducing Verification Task of Set Consistency with Set-Consistency Energy Networks
Mooho Song
Hyeryung Son
Jay-Yoon Lee
52
0
0
12 Mar 2025
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization
Ryan Barron
Maksim E. Eren
Olga M. Serafimova
Cynthia Matuszek
Boian S. Alexandrov
AILaw
78
0
0
27 Feb 2025
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
Qianqi Yan
Yue Fan
Hongquan Li
Shan Jiang
Yang Zhao
Xinze Guan
Ching-Chen Kuo
Junfeng Fang
VLM
LRM
92
2
0
22 Feb 2025
Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation
Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation
Joy Mahapatra
Soumyajit Roy
Utpal Garain
HILM
ALM
88
0
0
17 Feb 2025
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies
Sunnie S. Y. Kim
J. Vaughan
Q. V. Liao
Tania Lombrozo
Olga Russakovsky
112
5
0
12 Feb 2025
Context-Aware Hierarchical Merging for Long Document Summarization
Context-Aware Hierarchical Merging for Long Document Summarization
Litu Ou
Mirella Lapata
MoMe
274
1
0
03 Feb 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDa
HILM
LRM
78
1
0
28 Jan 2025
Learning to Summarize from LLM-generated Feedback
Learning to Summarize from LLM-generated Feedback
Hwanjun Song
Taewon Yun
Yuho Lee
Jihwan Oh
Gihun Lee
Jason (Jinglun) Cai
Hang Su
73
4
0
28 Jan 2025
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Mengna Zhu
Kaisheng Zeng
Mao Wang
Kaiming Xiao
Lei Hou
Hongbin Huang
Juanzi Li
271
1
0
16 Dec 2024
Coverage-based Fairness in Multi-document Summarization
Coverage-based Fairness in Multi-document Summarization
Haoyuan Li
Yusen Zhang
Rui Zhang
Snigdha Chaturvedi
80
0
0
11 Dec 2024
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization
Catarina G. Belem
Pouya Pezeskhpour
Hayate Iso
Seiji Maekawa
Nikita Bhutani
Estevam R. Hruschka
HILM
75
2
0
17 Oct 2024
Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?
Decomposition Dilemmas: Does Claim Decomposition Boost or Burden Fact-Checking Performance?
Qisheng Hu
Quanyu Long
Wenya Wang
200
5
0
17 Oct 2024
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation
Xiaonan Jing
Srinivas Billa
Danny Godbout
HILM
45
0
0
16 Oct 2024
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Hadas Orgad
Michael Toker
Zorik Gekhman
Roi Reichart
Idan Szpektor
Hadas Kotek
Yonatan Belinkov
HILM
AIFin
61
32
0
03 Oct 2024
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Huajian Zhang
Laura Perez-Beltrachini
HILM
44
0
0
01 Aug 2024
STORYSUMM: Evaluating Faithfulness in Story Summarization
STORYSUMM: Evaluating Faithfulness in Story Summarization
Melanie Subbiah
Faisal Ladhak
Akankshya Mishra
Griffin Adams
Lydia B. Chilton
Kathleen McKeown
50
4
0
09 Jul 2024
Enhancing Hallucination Detection through Perturbation-Based Synthetic
  Data Generation in System Responses
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
Dongxu Zhang
Varun Gangal
B. Lattimer
Yi Yang
40
6
0
07 Jul 2024
Applicability of Large Language Models and Generative Models for Legal
  Case Judgement Summarization
Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization
Aniket Deroy
Kripabandhu Ghosh
Saptarshi Ghosh
ELM
AILaw
53
16
0
06 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language
  Models
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
45
5
0
05 Jul 2024
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Philippe Laban
Alexander R. Fabbri
Caiming Xiong
Chien-Sheng Wu
RALM
53
41
0
01 Jul 2024
FineSurE: Fine-grained Summarization Evaluation using LLMs
FineSurE: Fine-grained Summarization Evaluation using LLMs
Hwanjun Song
Hang Su
Igor Shalyminov
Jason (Jinglun) Cai
Saab Mansour
HILM
41
32
0
01 Jul 2024
Factual Dialogue Summarization via Learning from Large Language Models
Factual Dialogue Summarization via Learning from Large Language Models
Rongxin Zhu
Jey Han Lau
Jianzhong Qi
HILM
57
1
0
20 Jun 2024
ReadCtrl: Personalizing text generation with readability-controlled
  instruction learning
ReadCtrl: Personalizing text generation with readability-controlled instruction learning
Hieu Tran
Zonghai Yao
Lingxi Li
Hong-ye Yu
54
2
0
13 Jun 2024
Can LLMs Solve longer Math Word Problems Better?
Can LLMs Solve longer Math Word Problems Better?
Xin Xu
Tong Xiao
Zitong Chao
Zhenya Huang
Can Yang
Yang Wang
70
12
0
23 May 2024
WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare
  with Large Language Models, Dynamic Expert Selection, and Priming Techniques
  on MIMIC-IV
WisPerMed at "Discharge Me!": Advancing Text Generation in Healthcare with Large Language Models, Dynamic Expert Selection, and Priming Techniques on MIMIC-IV
Hendrik Damm
T. M. G. Pakull
Bahadir Eryilmaz
Helmut Becker
Ahmad Idrissi-Yaghir
Henning Schafer
Sergej Schultenkämper
Christoph M. Friedrich
28
3
0
18 May 2024
Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize
  Hallucinations
Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations
Hassan Shakil
Zeydy Ortiz
Grant C. Forbes
26
3
0
07 May 2024
CASPR: Automated Evaluation Metric for Contrastive Summarization
CASPR: Automated Evaluation Metric for Contrastive Summarization
Nirupan Ananthamurugan
Dat Duong
Philip George
Ankita Gupta
Sandeep Tata
Beliz Gunel
27
0
0
23 Apr 2024
Semi-Supervised Dialogue Abstractive Summarization via High-Quality
  Pseudolabel Selection
Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Jianfeng He
Hang Su
Jason (Jinglun) Cai
Igor Shalyminov
Hwanjun Song
Saab Mansour
39
4
0
06 Mar 2024
German also Hallucinates! Inconsistency Detection in News Summaries with
  the Absinth Dataset
German also Hallucinates! Inconsistency Detection in News Summaries with the Absinth Dataset
Laura Mascarell
Ribin Chalumattu
Annette Rios
HILM
46
0
0
06 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
68
80
0
05 Mar 2024
How Much Annotation is Needed to Compare Summarization Models?
How Much Annotation is Needed to Compare Summarization Models?
Chantal Shaib
Joe Barrow
Alexa F. Siu
Byron C. Wallace
A. Nenkova
59
2
0
28 Feb 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation
  for Diverse Summarisation Tasks
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
36
10
0
27 Feb 2024
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Kundan Krishna
S. Ramprasad
Prakhar Gupta
Byron C. Wallace
Zachary Chase Lipton
Jeffrey P. Bigham
HILM
KELM
SyDa
52
9
0
19 Feb 2024
Can We Verify Step by Step for Incorrect Answer Detection?
Can We Verify Step by Step for Incorrect Answer Detection?
Xin Xu
Shizhe Diao
Can Yang
Yang Wang
LRM
130
14
0
16 Feb 2024
PROXYQA: An Alternative Framework for Evaluating Long-Form Text
  Generation with Large Language Models
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
Haochen Tan
Zhijiang Guo
Zhan Shi
Lu Xu
Zhili Liu
...
Xiaoguang Li
Yasheng Wang
Lifeng Shang
Qun Liu
Linqi Song
48
12
0
26 Jan 2024
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in
  Chart Captioning
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
Kung-Hsiang Huang
Mingyang Zhou
Hou Pong Chan
Yi R. Fung
Zhenhailong Wang
Lingyu Zhang
Shih-Fu Chang
Chenhui Xu
21
33
0
15 Dec 2023
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization
Joseph Peper
Wenzhao Qiu
Lu Wang
30
0
0
16 Nov 2023
SEMQA: Semi-Extractive Multi-Source Question Answering
SEMQA: Semi-Extractive Multi-Source Question Answering
Tal Schuster
Á. Lelkes
Haitian Sun
Jai Gupta
Jonathan Berant
W. Cohen
Donald Metzler
38
13
0
08 Nov 2023
Language Models Hallucinate, but May Excel at Fact Verification
Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan
Jesse Dodge
David Wadden
Minlie Huang
Hao Peng
LRM
HILM
40
28
0
23 Oct 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in
  LLM-Generated Reference Letters
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters
Yixin Wan
George Pu
Jiao Sun
Aparna Garimella
Kai-Wei Chang
Nanyun Peng
41
163
0
13 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models
Calibrating Likelihoods towards Consistency in Summarization Models
Polina Zablotskaia
Misha Khalman
Rishabh Joshi
Livio Baldini Soares
Shoshana Jakobovits
Joshua Maynez
Shashi Narayan
31
3
0
12 Oct 2023
STRONG -- Structure Controllable Legal Opinion Summary Generation
STRONG -- Structure Controllable Legal Opinion Summary Generation
Yang Zhong
Diane Litman
ELM
AILaw
32
1
0
29 Sep 2023
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed
  Question Answering
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering
Pei Ke
Fei Huang
Fei Mi
Yasheng Wang
Qun Liu
Xiaoyan Zhu
Minlie Huang
ReLM
ELM
38
10
0
13 Jul 2023
12
Next