ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.14564
  4. Cited By
Language Models Hallucinate, but May Excel at Fact Verification

Language Models Hallucinate, but May Excel at Fact Verification

23 October 2023
Jian-Yu Guan
Jesse Dodge
David Wadden
Minlie Huang
Hao Peng
    LRM
    HILM
ArXivPDFHTML

Papers citing "Language Models Hallucinate, but May Excel at Fact Verification"

28 / 28 papers shown
Title
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs
Yingming Zheng
Xiaoliang Liu
Peng Wu
Li Pan
LRM
38
0
0
21 Apr 2025
A Graph-based Verification Framework for Fact-Checking
Yani Huang
Richong Zhang
Zhijie Nie
J. Chen
Xuefeng Zhang
39
0
0
10 Mar 2025
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets
Preetam Prabhu Srikar Dammu
Himanshu Naidu
Chirag Shah
42
0
0
06 Mar 2025
VilBias: A Study of Bias Detection through Linguistic and Visual Cues , presenting Annotation Strategies, Evaluation, and Key Challenges
VilBias: A Study of Bias Detection through Linguistic and Visual Cues , presenting Annotation Strategies, Evaluation, and Key Challenges
Shaina Raza
Caesar Saleh
Emrul Hasan
Franklin Ogidi
Maximus Powers
Veronica Chatrath
Marcelo Lotif
Roya Javadi
Anam Zahid
Vahid Reza Khazaie
76
0
0
20 Feb 2025
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks
Jing Yang
Max Glockner
Anderson de Rezende Rocha
Iryna Gurevych
LRM
73
1
0
07 Feb 2025
FarExStance: Explainable Stance Detection for Farsi
FarExStance: Explainable Stance Detection for Farsi
Majid Zarharan
Maryam Hashemi
Malika Behroozrazegh
Sauleh Eetemadi
Mohammad Taher Pilehvar
Jennifer Foster
85
0
0
18 Dec 2024
LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve
  Factualness in Large Language Models
LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models
Hieu Tran
Junda Wang
Yujan Ting
Weijing Huang
Terrence Chen
HILM
KELM
45
0
0
31 Oct 2024
CrediRAG: Network-Augmented Credibility-Based Retrieval for
  Misinformation Detection in Reddit
CrediRAG: Network-Augmented Credibility-Based Retrieval for Misinformation Detection in Reddit
Ashwin Ram
Yigit E. Bayiz
Arash Amini
Mustafa Munir
R. Marculescu
26
0
0
15 Oct 2024
Claim Verification in the Age of Large Language Models: A Survey
Claim Verification in the Age of Large Language Models: A Survey
A. Dmonte
Roland Oruche
Marcos Zampieri
Prasad Calyam
Isabelle Augenstein
44
8
0
26 Aug 2024
Generative Large Language Models in Automated Fact-Checking: A Survey
Generative Large Language Models in Automated Fact-Checking: A Survey
Ivan Vykopal
Matúš Pikuliak
Simon Ostermann
Marian Simko
HILM
38
5
0
02 Jul 2024
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open
  Models
How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Jaeyoung Lee
Ximing Lu
Jack Hessel
Faeze Brahman
Youngjae Yu
Yonatan Bisk
Yejin Choi
Saadia Gabriel
36
3
0
29 Jun 2024
VERISCORE: Evaluating the factuality of verifiable claims in long-form
  text generation
VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation
Yixiao Song
Yekyung Kim
Mohit Iyyer
HILM
34
25
0
27 Jun 2024
REAL Sampling: Boosting Factuality and Diversity of Open-Ended
  Generation via Asymptotic Entropy
REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy
Haw-Shiuan Chang
Nanyun Peng
Mohit Bansal
Anil Ramakrishna
Tagyoung Chung
HILM
42
2
0
11 Jun 2024
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Sneha Singhania
Simon Razniewski
G. Weikum
RALM
34
1
0
04 May 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang
Philippe Laban
Greg Durrett
HILM
SyDa
43
74
0
16 Apr 2024
ClaimVer: Explainable Claim-Level Verification and Evidence Attribution
  of Text Through Knowledge Graphs
ClaimVer: Explainable Claim-Level Verification and Evidence Attribution of Text Through Knowledge Graphs
Preetam Prabhu Srikar Dammu
Himanshu Naidu
Mouly Dewan
YoungMin Kim
Tanya Roosta
Aman Chadha
Chirag Shah
46
6
0
12 Mar 2024
Language Models with Conformal Factuality Guarantees
Language Models with Conformal Factuality Guarantees
Christopher Mohri
Tatsunori Hashimoto
HILM
39
33
0
15 Feb 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine
  Inference Gaps in Entailment Verification
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
Soumya Sanyal
Tianyi Xiao
Jiacheng Liu
Wenya Wang
Xiang Ren
LRM
ReLM
49
12
0
06 Feb 2024
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through
  Process Feedback
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
Jian-Yu Guan
Wei Yu Wu
Zujie Wen
Peng Xu
Hongning Wang
Minlie Huang
LRM
23
16
0
02 Feb 2024
LLM-based NLG Evaluation: Current Status and Challenges
LLM-based NLG Evaluation: Current Status and Challenges
Mingqi Gao
Xinyu Hu
Jie Ruan
Xiao Pu
Xiaojun Wan
ELM
LM&MA
60
29
0
02 Feb 2024
Fake News in Sheep's Clothing: Robust Fake News Detection Against
  LLM-Empowered Style Attacks
Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks
Jiaying Wu
Bryan Hooi
31
54
0
16 Oct 2023
How Language Model Hallucinations Can Snowball
How Language Model Hallucinations Can Snowball
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
82
253
0
22 May 2023
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
367
8,495
0
28 Jan 2022
A Token-level Reference-free Hallucination Detection Benchmark for
  Free-form Text Generation
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation
Tianyu Liu
Yizhe Zhang
Chris Brockett
Yi Mao
Zhifang Sui
Weizhu Chen
W. Dolan
HILM
222
143
0
18 Apr 2021
Entity-level Factual Consistency of Abstractive Text Summarization
Entity-level Factual Consistency of Abstractive Text Summarization
Feng Nan
Ramesh Nallapati
Zhiguo Wang
Cicero Nogueira dos Santos
Henghui Zhu
Dejiao Zhang
Kathleen McKeown
Bing Xiang
HILM
144
157
0
18 Feb 2021
Explainable Automated Fact-Checking for Public Health Claims
Explainable Automated Fact-Checking for Public Health Claims
Neema Kotonya
Francesca Toni
218
248
0
19 Oct 2020
Towards Faithful Neural Table-to-Text Generation with Content-Matching
  Constraints
Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints
Zhenyi Wang
Xiaoyang Wang
Bang An
Dong Yu
Changyou Chen
LMTD
168
84
0
03 May 2020
PubMedQA: A Dataset for Biomedical Research Question Answering
PubMedQA: A Dataset for Biomedical Research Question Answering
Qiao Jin
Bhuwan Dhingra
Zhengping Liu
William W. Cohen
Xinghua Lu
210
812
0
13 Sep 2019
1