ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.00661
  4. Cited By
On Faithfulness and Factuality in Abstractive Summarization

On Faithfulness and Factuality in Abstractive Summarization

2 May 2020
Joshua Maynez
Shashi Narayan
Bernd Bohnet
Ryan T. McDonald
    HILM
ArXivPDFHTML

Papers citing "On Faithfulness and Factuality in Abstractive Summarization"

50 / 257 papers shown
Title
Calibrated Language Models Must Hallucinate
Calibrated Language Models Must Hallucinate
Adam Tauman Kalai
Santosh Vempala
HILM
30
76
0
24 Nov 2023
Evaluating Generative Ad Hoc Information Retrieval
Evaluating Generative Ad Hoc Information Retrieval
Lukas Gienapp
Harrisen Scells
Niklas Deckers
Janek Bevendorff
Shuai Wang
...
Maik Fröbe
Guide Zucoon
Benno Stein
Matthias Hagen
Martin Potthast
RALM
49
11
0
08 Nov 2023
Constituency Parsing using LLMs
Constituency Parsing using LLMs
Xuefeng Bai
Jialong Wu
Yulong Chen
Zhongqing Wang
Yue Zhang
41
1
0
30 Oct 2023
EHRTutor: Enhancing Patient Understanding of Discharge Instructions
EHRTutor: Enhancing Patient Understanding of Discharge Instructions
Zihao Zhang
Zonghai Yao
Huixue Zhou
Feiyun Ouyang
Hong-ye Yu
LM&MA
AI4Ed
40
4
0
30 Oct 2023
Theoretically Grounded Loss Functions and Algorithms for Score-Based
  Multi-Class Abstention
Theoretically Grounded Loss Functions and Algorithms for Score-Based Multi-Class Abstention
Anqi Mao
M. Mohri
Yutao Zhong
32
22
0
23 Oct 2023
Right, No Matter Why: AI Fact-checking and AI Authority in
  Health-related Inquiry Settings
Right, No Matter Why: AI Fact-checking and AI Authority in Health-related Inquiry Settings
Elena Sergeeva
Anastasia Sergeeva
Huiyun Tang
Kerstin Bongard-Blanchy
Peter Szolovits
27
1
0
22 Oct 2023
Factored Verification: Detecting and Reducing Hallucination in Summaries
  of Academic Papers
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers
Charlie George
Andreas Stuhlmuller
HILM
28
5
0
16 Oct 2023
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
Sagi Shaier
Lawrence E Hunter
K. Wense
46
3
0
16 Oct 2023
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using
  LLMs
Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs
Chenyang Yang
Rishabh Rustogi
Rachel A. Brower-Sinning
Grace A. Lewis
Christian Kastner
Tongshuang Wu
KELM
40
12
0
14 Oct 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in
  LLM-Generated Reference Letters
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters
Yixin Wan
George Pu
Jiao Sun
Aparna Garimella
Kai-Wei Chang
Nanyun Peng
44
163
0
13 Oct 2023
Calibrating Likelihoods towards Consistency in Summarization Models
Calibrating Likelihoods towards Consistency in Summarization Models
Polina Zablotskaia
Misha Khalman
Rishabh Joshi
Livio Baldini Soares
Shoshana Jakobovits
Joshua Maynez
Shashi Narayan
31
3
0
12 Oct 2023
A New Benchmark and Reverse Validation Method for Passage-level
  Hallucination Detection
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
Shiping Yang
Renliang Sun
Xiao-Yi Wan
HILM
40
41
0
10 Oct 2023
Improving Automatic VQA Evaluation Using Large Language Models
Improving Automatic VQA Evaluation Using Large Language Models
Oscar Manas
Benno Krojer
Aishwarya Agrawal
32
21
0
04 Oct 2023
FELM: Benchmarking Factuality Evaluation of Large Language Models
FELM: Benchmarking Factuality Evaluation of Large Language Models
Shiqi Chen
Yiran Zhao
Jinghan Zhang
Ethan Chern
Siyang Gao
Pengfei Liu
Junxian He
HILM
41
33
0
01 Oct 2023
Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive
  Synthesis using Large Language Models and Satisfiability Solving
Neuro Symbolic Reasoning for Planning: Counterexample Guided Inductive Synthesis using Large Language Models and Satisfiability Solving
Matthias Zeller
Susmit Jha
Patrick Lincoln
Jens Behley
Alvaro Velasquez
Rickard Ewetz
C. Stachniss
LRM
20
7
0
28 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
50
77
0
13 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large
  Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
A. Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
53
523
0
03 Sep 2023
An Examination of the Compositionality of Large Generative
  Vision-Language Models
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
39
2
0
21 Aug 2023
Enhancing Network Management Using Code Generated by Large Language
  Models
Enhancing Network Management Using Code Generated by Large Language Models
Sathiya Kumaran Mani
Yajie Zhou
Kevin Hsieh
Santiago Segarra
Ranveer Chandra
Srikanth Kandula
44
22
0
11 Aug 2023
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative
  AI Tool
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool
Youyang Ng
Daisuke Miyashita
Yasuto Hoshi
Yasuhiro Morioka
Osamu Torii
Tomoya Kodama
J. Deguchi
RALM
15
9
0
08 Aug 2023
Tackling Hallucinations in Neural Chart Summarization
Tackling Hallucinations in Neural Chart Summarization
Saad Obaid ul Islam
Iza vSkrjanec
Ondrej Dusek
Vera Demberg
HILM
42
7
0
01 Aug 2023
Comparing Traditional and LLM-based Search for Consumer Choice: A
  Randomized Experiment
Comparing Traditional and LLM-based Search for Consumer Choice: A Randomized Experiment
S. Spatharioti
David M. Rothschild
D. Goldstein
Jake M. Hofman
36
47
0
07 Jul 2023
Named Entity Inclusion in Abstractive Text Summarization
Named Entity Inclusion in Abstractive Text Summarization
S. Berezin
Tatiana Batura
39
7
0
05 Jul 2023
PatternGPT :A Pattern-Driven Framework for Large Language Model Text
  Generation
PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation
Le Xiao
Xin Shan
23
5
0
02 Jul 2023
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap
Q. V. Liao
J. Vaughan
58
159
0
02 Jun 2023
Knowledge Graph-Augmented Language Models for Knowledge-Grounded
  Dialogue Generation
Knowledge Graph-Augmented Language Models for Knowledge-Grounded Dialogue Generation
Minki Kang
Jin Myung Kwak
Jinheon Baek
Sung Ju Hwang
RALM
16
57
0
30 May 2023
A Critical Evaluation of Evaluations for Long-form Question Answering
A Critical Evaluation of Evaluations for Long-form Question Answering
Fangyuan Xu
Yixiao Song
Mohit Iyyer
Eunsol Choi
ELM
39
97
0
29 May 2023
UMSE: Unified Multi-scenario Summarization Evaluation
UMSE: Unified Multi-scenario Summarization Evaluation
Shen Gao
Zhitao Yao
Chongyang Tao
Preslav Nakov
Pengjie Ren
Zhaochun Ren
Zhumin Chen
37
5
0
26 May 2023
AlignScore: Evaluating Factual Consistency with a Unified Alignment
  Function
AlignScore: Evaluating Factual Consistency with a Unified Alignment Function
Yuheng Zha
Yichi Yang
Ruichen Li
Zhiting Hu
HILM
26
182
0
26 May 2023
Annotating and Detecting Fine-grained Factual Errors for Dialogue
  Summarization
Annotating and Detecting Fine-grained Factual Errors for Dialogue Summarization
Rongxin Zhu
Jianzhong Qi
Jey Han Lau
54
10
0
26 May 2023
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Philippe Laban
Wojciech Kry'sciñski
Divyansh Agarwal
Alexander R. Fabbri
Caiming Xiong
Chenyu You
Chien-Sheng Wu
ALM
HILM
38
33
0
23 May 2023
Evaluating Factual Consistency of Summaries with Large Language Models
Evaluating Factual Consistency of Summaries with Large Language Models
Shiqi Chen
Siyang Gao
Junxian He
ELM
LRM
HILM
37
6
0
23 May 2023
Look-back Decoding for Open-Ended Text Generation
Look-back Decoding for Open-Ended Text Generation
Nan Xu
Chunting Zhou
Asli Celikyilmaz
Xuezhe Ma
31
9
0
22 May 2023
Element-aware Summarization with Large Language Models: Expert-aligned
  Evaluation and Chain-of-Thought Method
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method
Yiming Wang
ZhuoSheng Zhang
Rui Wang
46
79
0
22 May 2023
Evaluating Factual Consistency of Texts with Semantic Role Labeling
Evaluating Factual Consistency of Texts with Semantic Role Labeling
Jing Fan
Dennis Aumiller
Michael Gertz
HILM
39
4
0
22 May 2023
LM vs LM: Detecting Factual Errors via Cross Examination
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen
May Hamri
Mor Geva
Amir Globerson
HILM
41
120
0
22 May 2023
Pointwise Mutual Information Based Metric and Decoding Strategy for
  Faithful Generation in Document Grounded Dialogs
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
Yatin Nandwani
Vineet Kumar
Dinesh Raghu
Sachindra Joshi
Luis A. Lastras
35
6
0
20 May 2023
Counterfactual Debiasing for Generating Factually Consistent Text
  Summaries
Counterfactual Debiasing for Generating Factually Consistent Text Summaries
Chenhe Dong
Yuexiang Xie
Yaliang Li
Ying Shen
CML
HILM
36
0
0
18 May 2023
Zero-shot Faithful Factual Error Correction
Zero-shot Faithful Factual Error Correction
Kung-Hsiang Huang
Hou Pong Chan
Heng Ji
KELM
HILM
32
30
0
13 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying
  Correlates on Long Form Scientific Summarization
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Yingce Xia
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
39
8
0
12 May 2023
Active Retrieval Augmented Generation
Active Retrieval Augmented Generation
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
30
256
0
11 May 2023
Text-Blueprint: An Interactive Platform for Plan-based Conditional
  Generation
Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Fantine Huot
Joshua Maynez
Shashi Narayan
Reinald Kim Amplayo
Kuzman Ganchev
Annie Louis
Anders Sandholm
Dipanjan Das
Mirella Lapata
44
7
0
28 Apr 2023
Learning Human-Human Interactions in Images from Weak Textual
  Supervision
Learning Human-Human Interactions in Images from Weak Textual Supervision
Morris Alper
Hadar Averbuch-Elor
VLM
52
2
0
27 Apr 2023
Why Does ChatGPT Fall Short in Providing Truthful Answers?
Why Does ChatGPT Fall Short in Providing Truthful Answers?
Shen Zheng
Jie Huang
Kevin Chen-Chuan Chang
HILM
AI4MH
32
51
0
20 Apr 2023
Lay Text Summarisation Using Natural Language Processing: A Narrative
  Literature Review
Lay Text Summarisation Using Natural Language Processing: A Narrative Literature Review
Oliver Vinzelberg
M. Jenkins
Gordon Morison
David McMinn
Z. Tieges
37
6
0
24 Mar 2023
cTBLS: Augmenting Large Language Models with Conversational Tables
cTBLS: Augmenting Large Language Models with Conversational Tables
Anirudh S. Sundar
Larry Heck
LMTD
29
8
0
21 Mar 2023
Automatically Summarizing Evidence from Clinical Trials: A Prototype
  Highlighting Current Challenges
Automatically Summarizing Evidence from Clinical Trials: A Prototype Highlighting Current Challenges
S. Ramprasad
Denis Jered McInerney
Iain J. Marshal
Byron C. Wallace
32
9
0
07 Mar 2023
PDSum: Prototype-driven Continuous Summarization of Evolving
  Multi-document Sets Stream
PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream
Susik Yoon
Hou Pong Chan
Jiawei Han
42
7
0
10 Feb 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
Toolformer: Language Models Can Teach Themselves to Use Tools
Timo Schick
Jane Dwivedi-Yu
Roberto Dessì
Roberta Raileanu
Maria Lomeli
Luke Zettlemoyer
Nicola Cancedda
Thomas Scialom
SyDa
RALM
43
1,608
0
09 Feb 2023
Benchmarking Large Language Models for News Summarization
Benchmarking Large Language Models for News Summarization
Tianyi Zhang
Faisal Ladhak
Esin Durmus
Percy Liang
Kathleen McKeown
Tatsunori B. Hashimoto
ELM
43
487
0
31 Jan 2023
Previous
123456
Next