ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.03629
  4. Cited By
Survey of Hallucination in Natural Language Generation
v1v2v3v4v5v6v7 (latest)

Survey of Hallucination in Natural Language Generation

8 February 2022
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
Yan Xu
Etsuko Ishii
Yejin Bang
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
    HILMLRM
ArXiv (abs)PDFHTML

Papers citing "Survey of Hallucination in Natural Language Generation"

50 / 1,118 papers shown
Title
Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World
  Data
Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data
Nahema Marchal
Rachel Xu
Rasmi Elasmar
Iason Gabriel
Beth Goldberg
William S. Isaac
LLMAG
70
19
0
19 Jun 2024
Towards Minimal Targeted Updates of Language Models with Targeted
  Negative Training
Towards Minimal Targeted Updates of Language Models with Targeted Negative Training
Lily H. Zhang
Rajesh Ranganath
Arya Tafvizi
117
1
0
19 Jun 2024
BeHonest: Benchmarking Honesty in Large Language Models
BeHonest: Benchmarking Honesty in Large Language Models
Steffi Chern
Zhulin Hu
Yuqing Yang
Ethan Chern
Yuan Guo
Jiahe Jin
Binjie Wang
Pengfei Liu
HILMALM
143
6
0
19 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
85
1
0
19 Jun 2024
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
Zhepei Wei
Wei-Lin Chen
Yu Meng
RALM
169
29
0
19 Jun 2024
$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with
  Sparse Mixture-of-Experts
MoE-RBench\texttt{MoE-RBench}MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts
Guanjie Chen
Xinyu Zhao
Tianlong Chen
Yu Cheng
MoE
116
5
0
17 Jun 2024
A Systematic Survey of Text Summarization: From Statistical Methods to
  Large Language Models
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models
Haopeng Zhang
Philip S. Yu
Jiawei Zhang
143
27
0
17 Jun 2024
Semantic Membership Inference Attack against Large Language Models
Semantic Membership Inference Attack against Large Language Models
Hamid Mozaffari
Virendra J. Marathe
MIALM
112
4
0
14 Jun 2024
Detecting and Evaluating Medical Hallucinations in Large Vision Language
  Models
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models
Jiawei Chen
Dingkang Yang
Tong Wu
Yue Jiang
Xiaolu Hou
Mingcheng Li
Shunli Wang
Dongling Xiao
Ke Li
Lihua Zhang
LM&MAVLM
85
22
0
14 Jun 2024
A Unified Data Augmentation Framework for Low-Resource Multi-Domain
  Dialogue Generation
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
Yongkang Liu
Ercong Nie
Shi Feng
Zheng Hua
Zifeng Ding
Daling Wang
Yifei Zhang
Hinrich Schütze
86
2
0
14 Jun 2024
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
A. B. M. A. Rahman
Saeed Anwar
Muhammad Usman
Ajmal Mian
HILM
78
3
0
13 Jun 2024
ContraSolver: Self-Alignment of Language Models by Resolving Internal
  Preference Contradictions
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
Xu Zhang
Xunjian Yin
Xiaojun Wan
81
3
0
13 Jun 2024
Causality for Tabular Data Synthesis: A High-Order Structure Causal
  Benchmark Framework
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework
Ruibo Tu
Zineb Senane
Lele Cao
Cheng Zhang
Hedvig Kjellström
G. Henter
CML
98
4
0
12 Jun 2024
We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs
We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs
Joseph Spracklen
Raveen Wijewickrama
A. H. M. N. Sakib
Anindya Maiti
Murtuza Jadliwala
Murtuza Jadliwala
170
13
0
12 Jun 2024
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation
Bairu Hou
Yang Zhang
Jacob Andreas
Shiyu Chang
170
7
0
11 Jun 2024
AutoSurvey: Large Language Models Can Automatically Write Surveys
AutoSurvey: Large Language Models Can Automatically Write Surveys
Yidong Wang
Qi Guo
Wenjin Yao
Hongbo Zhang
Xin Zhang
...
Hao Fei
Qingsong Wen
Wei Ye
Shikun Zhang
Yue Zhang
LM&MA
93
33
0
10 Jun 2024
CRAG -- Comprehensive RAG Benchmark
CRAG -- Comprehensive RAG Benchmark
Xiao Yang
Kai Sun
Hao Xin
Yushi Sun
Nikita Bhalla
...
Nirav Shah
Rakesh Wanga
Anuj Kumar
Wen-tau Yih
Xin Luna Dong
92
32
0
07 Jun 2024
The Reasonable Person Standard for AI
The Reasonable Person Standard for AI
Sunayana Rane
26
0
0
07 Jun 2024
LinkGPT: Teaching Large Language Models To Predict Missing Links
LinkGPT: Teaching Large Language Models To Predict Missing Links
Zhongmou He
Jing Zhu
Shengyi Qian
Joyce Chai
Danai Koutra
LRM
81
2
0
07 Jun 2024
DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase
  for Math Reasoning
DICE: Detecting In-distribution Contamination in LLM's Fine-tuning Phase for Math Reasoning
Shangqing Tu
Kejian Zhu
Yushi Bai
Zijun Yao
Lei Hou
Juanzi Li
107
7
0
06 Jun 2024
The Challenges of Evaluating LLM Applications: An Analysis of Automated,
  Human, and LLM-Based Approaches
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches
Bhashithe Abeysinghe
Ruhan Circi
ELM
119
23
0
05 Jun 2024
The Task-oriented Queries Benchmark (ToQB)
The Task-oriented Queries Benchmark (ToQB)
Keun Soo Yim
82
1
0
05 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
135
0
0
04 Jun 2024
Position: Cracking the Code of Cascading Disparity Towards Marginalized
  Communities
Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities
G. Farnadi
Mohammad Havaei
Negar Rostamzadeh
82
3
0
03 Jun 2024
Graph Neural Network Enhanced Retrieval for Question Answering of LLMs
Graph Neural Network Enhanced Retrieval for Question Answering of LLMs
Zijian Li
Qingyan Guo
Jiawei Shao
Lei Song
Jiang Bian
Jun Zhang
Rui Wang
RALM
70
13
0
03 Jun 2024
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Fatemeh Shiri
Van Nguyen
Farhad Moghimifar
John Yoo
Gholamreza Haffari
Yuan-Fang Li
ReLM
130
6
0
03 Jun 2024
BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of
  Large Language Models
BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Jiaqi Xue
Meng Zheng
Yebowen Hu
Fei Liu
Xun Chen
Qian Lou
AAMLSILM
100
38
0
03 Jun 2024
LIDAO: Towards Limited Interventions for Debiasing (Large) Language
  Models
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
Tianci Liu
Haoyu Wang
Shiyang Wang
Yu Cheng
Jing Gao
ALM
86
1
0
01 Jun 2024
Query2CAD: Generating CAD models using natural language queries
Query2CAD: Generating CAD models using natural language queries
Akshay Badagabettu
Sai Sravan Yarlagadda
A. Farimani
81
15
0
31 May 2024
Designing an Evaluation Framework for Large Language Models in Astronomy
  Research
Designing an Evaluation Framework for Large Language Models in Astronomy Research
John F. Wu
Alina Hyk
Kiera McCormick
Christine Ye
Simone Astarita
...
M. Ntampaka
Charlie OÑeill
J. Peek
Sanjib Sharma
Mikaeel Yunus
71
1
0
30 May 2024
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
Ling-Hao Chen
Shunlin Lu
Ailing Zeng
Hao Zhang
Benyou Wang
Ruimao Zhang
Lei Zhang
120
38
0
30 May 2024
Hallucination-Free? Assessing the Reliability of Leading AI Legal
  Research Tools
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
Varun Magesh
Faiz Surani
Matthew Dahl
Mirac Suzgun
Christopher D. Manning
Daniel E. Ho
HILMELMAILaw
77
80
0
30 May 2024
Kernel Language Entropy: Fine-grained Uncertainty Quantification for
  LLMs from Semantic Similarities
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Alexander Nikitin
Jannik Kossen
Yarin Gal
Pekka Marttinen
UQCV
138
45
0
30 May 2024
Detecting Hallucinations in Large Language Model Generation: A Token
  Probability Approach
Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach
Ernesto Quevedo
Jorge Yero
Rachel Koerner
Pablo Rivas
Tomas Cerny
HILM
81
18
0
30 May 2024
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
Shenghuan Sun
Gregory M. Goldgof
Alexander Schubert
Zhiqing Sun
Thomas Hartvigsen
A. Butte
Ahmed Alaa
LM&MA
80
4
0
29 May 2024
Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models
V. Venkatasubramanian
Arijit Chakraborty
64
13
0
29 May 2024
CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control
CtrlA: Adaptive Retrieval-Augmented Generation via Probe-Guided Control
Huanshuo Liu
Hao Zhang
Zhijiang Guo
Kuicai Dong
Xiangyang Li
Yi Quan Lee
Cong Zhang
Yong Liu
3DV
94
1
0
29 May 2024
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Ziyang Wang
Shoubin Yu
Elias Stengel-Eskin
Jaehong Yoon
Feng Cheng
Gedas Bertasius
Mohit Bansal
153
70
0
29 May 2024
Tool Learning with Large Language Models: A Survey
Tool Learning with Large Language Models: A Survey
Changle Qu
Sunhao Dai
Xiaochi Wei
Hengyi Cai
Shuaiqiang Wang
D. Yin
Jun Xu
Jirong Wen
LLMAG
105
107
0
28 May 2024
Collage is the New Writing: Exploring the Fragmentation of Text and User
  Interfaces in AI Tools
Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools
Daniel Buschek
147
15
0
27 May 2024
Perturbation-Restrained Sequential Model Editing
Perturbation-Restrained Sequential Model Editing
Junjie Ma
Hong Wang
Haoyang Xu
Zhen-Hua Ling
Jia-Chen Gu
KELM
173
11
0
27 May 2024
Accelerating Inference of Retrieval-Augmented Generation via Sparse
  Context Selection
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection
Yun Zhu
Jia-Chen Gu
Caitlin Sikora
Ho Ko
Yinxiao Liu
...
Lei Shu
Liangchen Luo
Lei Meng
Bang Liu
Jindong Chen
RALM
99
19
0
25 May 2024
LLM-based Robot Task Planning with Exceptional Handling for General
  Purpose Service Robots
LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots
Ruoyu Wang
Zhipeng Yang
Zinan Zhao
Xinyan Tong
Zhi Hong
Kun Qian
LM&RoLLMAG
54
11
0
24 May 2024
Are Long-LLMs A Necessity For Long-Context Tasks?
Are Long-LLMs A Necessity For Long-Context Tasks?
Hongjin Qian
Zheng Liu
Peitian Zhang
Kelong Mao
Yujia Zhou
Xu Chen
Zhicheng Dou
71
13
0
24 May 2024
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
Beitao Chen
Xinyu Lyu
Lianli Gao
Jingkuan Song
Hengtao Shen
MLLM
189
12
0
24 May 2024
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of
  Large Language Models
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models
Peng Wang
Zexi Li
Ningyu Zhang
Ziwen Xu
Yunzhi Yao
Yong Jiang
Pengjun Xie
Fei Huang
Huajun Chen
KELMCLL
127
34
0
23 May 2024
How do Observable Users Decompose D3 Code? An Exploratory Study
How do Observable Users Decompose D3 Code? An Exploratory Study
Melissa Lin
Heer Patel
Medina Lamkin
Tukey Tu
Hannah K. Bako
Soham Raut
Leilani Battle
84
0
0
23 May 2024
Generative AI Search Engines as Arbiters of Public Knowledge: An Audit
  of Bias and Authority
Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority
Alice Li
Luanne Sinnamon
48
4
0
22 May 2024
Just rephrase it! Uncertainty estimation in closed-source language
  models via multiple rephrased queries
Just rephrase it! Uncertainty estimation in closed-source language models via multiple rephrased queries
Adam Yang
Chen Chen
Konstantinos Pitas
63
12
0
22 May 2024
Stochastic Online Conformal Prediction with Semi-Bandit Feedback
Stochastic Online Conformal Prediction with Semi-Bandit Feedback
Haosen Ge
Hamsa Bastani
Osbert Bastani
224
2
0
22 May 2024
Previous
123...8910...212223
Next