DialFact: A Benchmark for Fact-Checking in Dialogue

DialFact: A Benchmark for Fact-Checking in Dialogue

15 October 2021

Papers citing "DialFact: A Benchmark for Fact-Checking in Dialogue"

18 / 18 papers shown

Title
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses Dongxu Zhang Varun Gangal B. Lattimer Yi Yang 35 6 0 07 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu Ziwei Ji Wenwei Zhang Chengqi Lyu Dahua Lin Kai Chen HILM 36 5 0 05 Jul 2024
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection Shiping Yang Renliang Sun Xiao-Yi Wan HILM 34 41 0 10 Oct 2023
ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about Aman Rangapur Haoran Wang AI4MH 36 3 0 06 Apr 2023
Diving Deep into Modes of Fact Hallucinations in Dialogue Systems Souvik Das Sougata Saha R. Srihari HILM 15 30 0 11 Jan 2023
WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning Wenhao Wu Wei Li Xinyan Xiao Jiachen Liu Sujian Li Yajuan Lv HILM 26 4 0 20 Dec 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation Yanyang Li Jianqiao Zhao M. Lyu Liwei Wang 16 15 0 03 Nov 2022
Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation Max Glockner Yufang Hou Iryna Gurevych OffRL 38 38 0 25 Oct 2022
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning Prakhar Gupta Cathy Jiao Yi-Ting Yeh Shikib Mehri M. Eskénazi Jeffrey P. Bigham ALM 36 47 0 25 May 2022
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation Prakhar Gupta Harsh Jhamtani Jeffrey P. Bigham 49 12 0 19 May 2022
Stretching Sentence-pair NLI Models to Reason over Long Documents and Clusters Tal Schuster Sihao Chen S. Buthpitiya Alex Fabrikant Donald Metzler 20 41 0 15 Apr 2022
Survey of Hallucination in Natural Language Generation Ziwei Ji Nayeon Lee Rita Frieske Tiezheng Yu D. Su ... Delong Chen Wenliang Dai Ho Shu Chan Andrea Madotto Pascale Fung HILM LRM 49 2,234 0 08 Feb 2022
Measuring Attribution in Natural Language Generation Models Hannah Rashkin Vitaly Nikolaev Matthew Lamm Lora Aroyo Michael Collins Dipanjan Das Slav Petrov Gaurav Singh Tomar Iulia Turc David Reitter 30 173 0 23 Dec 2021
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System Libo Qin Tianbao Xie Shijue Huang Qiguang Chen Xiao Xu Wanxiang Che 55 20 0 23 Sep 2021
Overview of the CLEF--2021 CheckThat! Lab on Detecting Check-Worthy Claims, Previously Fact-Checked Claims, and Fake News Preslav Nakov Giovanni Da San Martino Tamer Elsayed Alberto Barrón-Cedeño Rubén Míguez ... Gautam Kishore Shahi Julia Maria Struß Thomas Mandl Mucahid Kutlu Yavuz Selim Kartal 76 92 0 23 Sep 2021
Internet-Augmented Dialogue Generation M. Komeili Kurt Shuster Jason Weston RALM 238 280 0 15 Jul 2021
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features Hannah Rashkin David Reitter Gaurav Singh Tomar Dipanjan Das 167 101 0 14 Jul 2021
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark Nouha Dziri Hannah Rashkin Tal Linzen David Reitter ALM 192 79 0 30 Apr 2021