Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.14552
Cited By
Sources of Hallucination by Large Language Models on Inference Tasks
23 May 2023
Nick McKenna
Tianyi Li
Liang Cheng
Mohammad Javad Hosseini
Mark Johnson
Mark Steedman
LRM
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sources of Hallucination by Large Language Models on Inference Tasks"
50 / 119 papers shown
Title
Towards Contamination Resistant Benchmarks
Rahmatullah Musawi
Sheng Lu
42
0
0
13 May 2025
Bridging AI and Carbon Capture: A Dataset for LLMs in Ionic Liquids and CBE Research
Gaurab Sarkar
Sougata Saha
30
0
0
11 May 2025
Osiris: A Lightweight Open-Source Hallucination Detection System
Alex Shan
John Bauer
Christopher D. Manning
HILM
VLM
50
0
0
07 May 2025
Multimodal Large Language Models for Medicine: A Comprehensive Survey
Jiarui Ye
Hao Tang
LM&MA
89
0
0
29 Apr 2025
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Charles OÑeill
Tirthankar Ghosal
Roberta Răileanu
Mike Walmsley
Thang Bui
Kevin Schawinski
I. Ciucă
LRM
56
0
0
17 Apr 2025
Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs
Yingjian Chen
Feiyang Li
Xingyu Song
Tianxiao Li
Zixin Xu
Xiujie Chen
Issey Sukeda
Irene Z Li
28
0
0
15 Apr 2025
Model-Agnostic Policy Explanations with Large Language Models
Zhang Xi-Jia
Yue (Sophie) Guo
Shufei Chen
Simon Stepputtis
Matthew C. Gombolay
Katia P. Sycara
Joseph Campbell
LM&Ro
LRM
57
0
0
08 Apr 2025
Can ChatGPT Learn My Life From a Week of First-Person Video?
Keegan Harris
23
0
0
04 Apr 2025
GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments
Enjun Du
Xunkai Li
Tian Jin
Zhihan Zhang
R. Li
Guoren Wang
40
0
0
01 Apr 2025
The Mind in the Machine: A Survey of Incorporating Psychological Theories in LLMs
Zizhou Liu
Ziwei Gong
Lin Ai
Zheng Hui
Run Chen
Colin Wayne Leach
Michelle R. Greene
Julia Hirschberg
LLMAG
156
0
0
28 Mar 2025
Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models
Nariman Naderi
Seyed Amir Ahmad Safavi-Naini
Thomas Savage
Zahra Atf
Peter Lewis
Girish Nadkarni
Ali Soroush
ELM
94
1
0
24 Mar 2025
Neutralizing Bias in LLM Reasoning using Entailment Graphs
Liang Cheng
Tianyi Li
Zhaowei Wang
Tianyang Liu
Mark Steedman
41
0
0
14 Mar 2025
HalluVerse25: Fine-grained Multilingual Benchmark Dataset for LLM Hallucinations
Samir Abdaljalil
Hasan Kurban
Erchin Serpedin
HILM
64
0
0
10 Mar 2025
TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors
Jingyi Zheng
Junfeng Wang
Zhen Sun
Wenhan Dong
Yule Liu
Xinlei He
AAML
50
0
0
10 Mar 2025
GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking
Yingjian Chen
Haoran Liu
Yinhong Liu
Rui Yang
Han Yuan
Yanran Fu
Pengyuan Zhou
Qingyu Chen
James Caverlee
Irene Z Li
HILM
50
0
0
23 Feb 2025
What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis
Peiran Wang
Yang Liu
Yunfei Lu
Jue Hong
Ye Wu
HILM
LRM
77
0
0
20 Feb 2025
Delta - Contrastive Decoding Mitigates Text Hallucinations in Large Language Models
Cheng Peng Huang
Hao-Yuan Chen
HILM
66
0
0
09 Feb 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
93
154
0
28 Jan 2025
Episodic Memories Generation and Evaluation Benchmark for Large Language Models
Alexis Huet
Zied Ben-Houidi
Dario Rossi
LLMAG
56
0
0
21 Jan 2025
Reasoning-Oriented and Analogy-Based Methods for Locating and Editing in Zero-Shot Event-Relational Reasoning
Jingyao Tang
Lishuang Li
Liteng Mi
Haiming Wu
Hongbin Lu
KELM
34
0
0
03 Jan 2025
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
Michael Pichat
William Pogrund
Armanush Gasparian
Paloma Pichat
Samuel Demarchi
Michael Veillet-Guillem
42
3
0
26 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
77
0
0
10 Dec 2024
A Novel Approach to Eliminating Hallucinations in Large Language Model-Assisted Causal Discovery
Grace Sng
Yanming Zhang
Klaus Mueller
62
0
0
16 Nov 2024
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Moran Yanuka
Assaf Ben-Kish
Yonatan Bitton
Idan Szpektor
Raja Giryes
VLM
47
2
0
13 Nov 2024
VERITAS: A Unified Approach to Reliability Evaluation
Rajkumar Ramamurthy
Meghana Arakkal Rajeev
Oliver Molenschot
James Zou
Nazneen Rajani
HILM
52
1
0
05 Nov 2024
GraphAide: Advanced Graph-Assisted Query and Reasoning System
Sumit Purohit
George Chin
Patrick S Mackey
Joseph A Cottam
39
0
0
29 Oct 2024
Task Calibration: Calibrating Large Language Models on Inference Tasks
Yingjie Li
Yun Luo
Xiaotian Xie
Yue Zhang
LRM
21
0
0
24 Oct 2024
Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Yiming Chen
Xianghu Yue
Xiaoxue Gao
Chen Zhang
L. F. D’Haro
R. Tan
Haizhou Li
AuLLM
32
0
0
27 Sep 2024
Enhancing Guardrails for Safe and Secure Healthcare AI
Ananya Gangavarapu
26
0
0
25 Sep 2024
Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations
Abe Bohan Hou
William Jurayj
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELM
28
0
0
16 Sep 2024
ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs
Hua Shen
Tiffany Knearem
Reshmi Ghosh
Yu-Ju Yang
Tanushree Mitra
Yun Huang
Yun Huang
64
0
0
15 Sep 2024
AI-LieDar: Examine the Trade-off Between Utility and Truthfulness in LLM Agents
Zhe Su
Xuhui Zhou
Sanketh Rangreji
Anubha Kabra
Julia Mendelsohn
Faeze Brahman
Maarten Sap
LLMAG
106
3
0
13 Sep 2024
Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding
Xiaoyu Liang
Jiayuan Yu
Lianrui Mu
Jiedong Zhuang
Jiaqi Hu
Yuchen Yang
Jiangnan Ye
Lu Lu
Jian Chen
Haoji Hu
VLM
45
2
0
10 Sep 2024
Explicit Inductive Inference using Large Language Models
Tianyang Liu
Tianyi Li
Liang Cheng
Mark Steedman
32
0
0
26 Aug 2024
OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context
Steffen Kleinle
Jakob Prange
Annemarie Friedrich
RALM
35
0
0
22 Jul 2024
Look Within, Why LLMs Hallucinate: A Causal Perspective
He Li
Haoang Chi
Mingyu Liu
Wenjing Yang
LRM
37
5
0
14 Jul 2024
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILM
ELM
AILaw
39
12
0
10 Jul 2024
The Need for Guardrails with Large Language Models in Medical Safety-Critical Settings: An Artificial Intelligence Application in the Pharmacovigilance Ecosystem
Joe B Hakim
Jeffery L Painter
D. Ramcharran
V. Kara
Greg Powell
Paulina Sobczak
Chiho Sato
Andrew Bate
Andrew Beam
39
2
0
01 Jul 2024
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
Siyuan Wu
Yue Huang
Chujie Gao
Dongping Chen
Qihui Zhang
...
Tianyi Zhou
Xiangliang Zhang
Jianfeng Gao
Chaowei Xiao
Lichao Sun
SyDa
38
22
0
27 Jun 2024
INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness
Hung Le
Yingbo Zhou
Caiming Xiong
Silvio Savarese
Doyen Sahoo
52
2
0
23 Jun 2024
Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination
Jongyoon Song
Sangwon Yu
Sungroh Yoon
HILM
38
3
0
20 Jun 2024
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors
Ying Zhou
Ben He
Le Sun
DeLMO
36
1
0
13 Jun 2024
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost
Masha Belyi
Robert Friel
Shuai Shao
Atindriyo Sanyal
HILM
RALM
64
5
0
03 Jun 2024
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Tianyang Xu
Shujin Wu
Shizhe Diao
Xiaoze Liu
Xingyao Wang
Yangyi Chen
Jing Gao
LRM
29
27
0
31 May 2024
Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations
Zilin Ma
Susannah Su
Su
Nathan Zhao
Linn Bieske
...
Boxiang Wang
Jinglun Gao
Zihan Wen
Claude Bruderlein
Weiwei Pan
30
0
0
30 May 2024
Large Language Model Sentinel: LLM Agent for Adversarial Purification
Guang Lin
Qibin Zhao
Qibin Zhao
AAML
56
2
0
24 May 2024
Spectral Editing of Activations for Large Language Model Alignment
Yifu Qiu
Zheng Zhao
Yftah Ziser
Anna Korhonen
E. Ponti
Shay B. Cohen
KELM
LLMSV
28
15
0
15 May 2024
Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval
Mengjia Niu
Hao Li
Jie Shi
Hamed Haddadi
Fan Mo
HILM
51
10
0
10 May 2024
D-NLP at SemEval-2024 Task 2: Evaluating Clinical Inference Capabilities of Large Language Models
Duygu Altinok
AI4MH
LRM
LM&MA
ELM
26
2
0
07 May 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
36
6
0
18 Apr 2024
1
2
3
Next