Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.08222
Cited By
DialFact: A Benchmark for Fact-Checking in Dialogue
15 October 2021
Prakhar Gupta
C. Wu
Wenhao Liu
Caiming Xiong
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DialFact: A Benchmark for Fact-Checking in Dialogue"
18 / 18 papers shown
Title
Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
Dongxu Zhang
Varun Gangal
B. Lattimer
Yi Yang
35
6
0
07 Jul 2024
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
Yuzhe Gu
Ziwei Ji
Wenwei Zhang
Chengqi Lyu
Dahua Lin
Kai Chen
HILM
36
5
0
05 Jul 2024
A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection
Shiping Yang
Renliang Sun
Xiao-Yi Wan
HILM
34
41
0
10 Oct 2023
ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about
Aman Rangapur
Haoran Wang
AI4MH
36
3
0
06 Apr 2023
Diving Deep into Modes of Fact Hallucinations in Dialogue Systems
Souvik Das
Sougata Saha
R. Srihari
HILM
15
30
0
11 Jan 2023
WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning
Wenhao Wu
Wei Li
Xinyan Xiao
Jiachen Liu
Sujian Li
Yajuan Lv
HILM
26
4
0
20 Dec 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation
Yanyang Li
Jianqiao Zhao
M. Lyu
Liwei Wang
16
15
0
03 Nov 2022
Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation
Max Glockner
Yufang Hou
Iryna Gurevych
OffRL
38
38
0
25 Oct 2022
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Prakhar Gupta
Cathy Jiao
Yi-Ting Yeh
Shikib Mehri
M. Eskénazi
Jeffrey P. Bigham
ALM
36
47
0
25 May 2022
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation
Prakhar Gupta
Harsh Jhamtani
Jeffrey P. Bigham
49
12
0
19 May 2022
Stretching Sentence-pair NLI Models to Reason over Long Documents and Clusters
Tal Schuster
Sihao Chen
S. Buthpitiya
Alex Fabrikant
Donald Metzler
20
41
0
15 Apr 2022
Survey of Hallucination in Natural Language Generation
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILM
LRM
49
2,234
0
08 Feb 2022
Measuring Attribution in Natural Language Generation Models
Hannah Rashkin
Vitaly Nikolaev
Matthew Lamm
Lora Aroyo
Michael Collins
Dipanjan Das
Slav Petrov
Gaurav Singh Tomar
Iulia Turc
David Reitter
30
173
0
23 Dec 2021
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System
Libo Qin
Tianbao Xie
Shijue Huang
Qiguang Chen
Xiao Xu
Wanxiang Che
55
20
0
23 Sep 2021
Overview of the CLEF--2021 CheckThat! Lab on Detecting Check-Worthy Claims, Previously Fact-Checked Claims, and Fake News
Preslav Nakov
Giovanni Da San Martino
Tamer Elsayed
Alberto Barrón-Cedeño
Rubén Míguez
...
Gautam Kishore Shahi
Julia Maria Struß
Thomas Mandl
Mucahid Kutlu
Yavuz Selim Kartal
76
92
0
23 Sep 2021
Internet-Augmented Dialogue Generation
M. Komeili
Kurt Shuster
Jason Weston
RALM
238
280
0
15 Jul 2021
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features
Hannah Rashkin
David Reitter
Gaurav Singh Tomar
Dipanjan Das
167
101
0
14 Jul 2021
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark
Nouha Dziri
Hannah Rashkin
Tal Linzen
David Reitter
ALM
192
79
0
30 Apr 2021
1