ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.03629
  4. Cited By
Survey of Hallucination in Natural Language Generation
v1v2v3v4v5v6v7 (latest)

Survey of Hallucination in Natural Language Generation

8 February 2022
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
Yan Xu
Etsuko Ishii
Yejin Bang
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
    HILMLRM
ArXiv (abs)PDFHTML

Papers citing "Survey of Hallucination in Natural Language Generation"

50 / 1,118 papers shown
Title
ReviewerGPT? An Exploratory Study on Using Large Language Models for
  Paper Reviewing
ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing
Ryan Liu
Nihar B. Shah
ELM
116
76
0
01 Jun 2023
Examining risks of racial biases in NLP tools for child protective
  services
Examining risks of racial biases in NLP tools for child protective services
Anjalie Field
Amanda Coston
Nupoor Gandhi
Alexandra Chouldechova
Emily Putnam-Hornstein
David Steier
Yulia Tsvetkov
92
14
0
30 May 2023
GPT4GEO: How a Language Model Sees the World's Geography
GPT4GEO: How a Language Model Sees the World's Geography
Jonathan Roberts
Timo Lüddecke
Sowmen Das
Kai Han
Samuel Albanie
73
64
0
30 May 2023
GPT Models in Construction Industry: Opportunities, Limitations, and a
  Use Case Validation
GPT Models in Construction Industry: Opportunities, Limitations, and a Use Case Validation
Abdullahi Saka
Ridwan Taiwo
Nurudeen Saka
B. Salami
Saheed Ajayi
Kabiru O. Akande
Hadi Kazemi
92
71
0
30 May 2023
Graph Reasoning for Question Answering with Triplet Retrieval
Graph Reasoning for Question Answering with Triplet Retrieval
Shiyang Li
Yifan Gao
Hao Jiang
Qingyu Yin
Zheng Li
Xifeng Yan
Chao Zhang
Bing Yin
RALMReLM
93
35
0
30 May 2023
Domain Specialization as the Key to Make Large Language Models
  Disruptive: A Comprehensive Survey
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
169
140
0
30 May 2023
Do Language Models Know When They're Hallucinating References?
Do Language Models Know When They're Hallucinating References?
A. Agrawal
Mirac Suzgun
Lester W. Mackey
Adam Tauman Kalai
HILMLRM
134
100
0
29 May 2023
Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Zehao Wang
Alexander Ku
Jason Baldridge
Thomas Griffiths
Been Kim
UQCV
92
13
0
29 May 2023
GripRank: Bridging the Gap between Retrieval and Generation via the
  Generative Knowledge Improved Passage Ranking
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking
Jiaqi Bai
Hongcheng Guo
Jiaheng Liu
Jian Yang
Xinnian Liang
Zhao Yan
Zhoujun Li
RALM
72
15
0
29 May 2023
Large Language Models, scientific knowledge and factuality: A systematic
  analysis in antibiotic discovery
Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery
Magdalena Wysocka
Oskar Wysocki
Maxime Delmas
V. Mutel
André Freitas
LM&MA
72
6
0
28 May 2023
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction
  Model
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model
I-Hung Hsu
Zhiyu Xie
Kuan-Hao Huang
Premkumar Natarajan
Nanyun Peng
64
43
0
26 May 2023
Self-contradictory Hallucinations of Large Language Models: Evaluation,
  Detection and Mitigation
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
Niels Mündler
Jingxuan He
Slobodan Jenko
Martin Vechev
HILM
70
119
0
25 May 2023
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting
Lei Shu
Liangchen Luo
Jayakumar Hoskere
Yun Zhu
Canoee Liu
Simon Tong
Jindong Chen
Lei Meng
KELMLRM
95
51
0
25 May 2023
The Larger They Are, the Harder They Fail: Language Models do not
  Recognize Identifier Swaps in Python
The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python
Antonio Valerio Miceli Barone
Fazl Barez
Ioannis Konstas
Shay B. Cohen
50
32
0
24 May 2023
Lawyer LLaMA Technical Report
Lawyer LLaMA Technical Report
Quzhe Huang
Mingxu Tao
Chen Zhang
Zhenwei An
Cong Jiang
Zhibin Chen
Zirui Wu
Yansong Feng
ELMALMAILaw
131
55
0
24 May 2023
Mastering the ABCDs of Complex Questions: Answer-Based Claim
  Decomposition for Fine-grained Self-Evaluation
Mastering the ABCDs of Complex Questions: Answer-Based Claim Decomposition for Fine-grained Self-Evaluation
Nishant Balepur
Jie Huang
Samraj Moorjani
Hari Sundaram
Kevin Chen-Chuan Chang
ReLM
45
0
0
24 May 2023
Enabling Large Language Models to Generate Text with Citations
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MAHILM
165
357
0
24 May 2023
Unraveling ChatGPT: A Critical Analysis of AI-Generated Goal-Oriented
  Dialogues and Annotations
Unraveling ChatGPT: A Critical Analysis of AI-Generated Goal-Oriented Dialogues and Annotations
Tiziano Labruna
Sofia Brenna
Andrea Zaninello
Bernardo Magnini
50
15
0
23 May 2023
Dancing Between Success and Failure: Edit-level Simplification
  Evaluation using SALSA
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA
David Heineman
Yao Dou
Mounica Maddela
Wei Xu
102
17
0
23 May 2023
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Benjamin Muller
John Wieting
J. Clark
Tom Kwiatkowski
Sebastian Ruder
Livio Baldini Soares
Roee Aharoni
Jonathan Herzig
Xinyi Wang
HILM
85
17
0
23 May 2023
USB: A Unified Summarization Benchmark Across Tasks and Domains
USB: A Unified Summarization Benchmark Across Tasks and Domains
Kundan Krishna
Prakhar Gupta
S. Ramprasad
Byron C. Wallace
Jeffrey P. Bigham
Zachary Chase Lipton
HILM
91
8
0
23 May 2023
Does ChatGPT have Theory of Mind?
Does ChatGPT have Theory of Mind?
B. Holterman
Kees van Deemter
LRMAI4CE
81
23
0
23 May 2023
Make a Choice! Knowledge Base Question Answering with In-Context
  Learning
Make a Choice! Knowledge Base Question Answering with In-Context Learning
Chuanyuan Tan
Yuehe Chen
Wenbiao Shao
Wenliang Chen
50
13
0
23 May 2023
Revealing User Familiarity Bias in Task-Oriented Dialogue via
  Interactive Evaluation
Revealing User Familiarity Bias in Task-Oriented Dialogue via Interactive Evaluation
Takyoung Kim
Jamin Shin
Young-Ho Kim
Sanghwan Bae
Sungdong Kim
94
1
0
23 May 2023
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large
  Language Models
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Alfonso Amayuelas
Kyle Wong
Liangming Pan
Wenhu Chen
Wenjie Wang
105
29
0
23 May 2023
Towards Legally Enforceable Hate Speech Detection for Public Forums
Towards Legally Enforceable Hate Speech Detection for Public Forums
Chunyan Luo
R. Bhambhoria
Xiao-Dan Zhu
Samuel Dahan
AILaw
71
5
0
23 May 2023
On the Risk of Misinformation Pollution with Large Language Models
On the Risk of Misinformation Pollution with Large Language Models
Yikang Pan
Liangming Pan
Wenhu Chen
Preslav Nakov
Min-Yen Kan
Wenjie Wang
DeLMO
260
127
0
23 May 2023
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large
  Language Models in Knowledge Conflicts
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
326
181
0
22 May 2023
Chain-of-Knowledge: Grounding Large Language Models via Dynamic
  Knowledge Adapting over Heterogeneous Sources
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources
Xingxuan Li
Ruochen Zhao
Yew Ken Chia
Bosheng Ding
Shafiq Joty
Soujanya Poria
Lidong Bing
HILMBDLLRM
129
102
0
22 May 2023
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization
  Evaluation
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Elizabeth Clark
Shruti Rijhwani
Sebastian Gehrmann
Joshua Maynez
Roee Aharoni
Vitaly Nikolaev
Thibault Sellam
Aditya Siddhant
Dipanjan Das
Ankur P. Parikh
95
41
0
22 May 2023
Evaluating Open-QA Evaluation
Evaluating Open-QA Evaluation
Cunxiang Wang
Sirui Cheng
Qipeng Guo
Yuanhao Yue
Bowen Ding
Zhikun Xu
Yidong Wang
Xiangkun Hu
Zheng Zhang
Yue Zhang
ELM
123
33
0
21 May 2023
Gene Set Summarization using Large Language Models
Gene Set Summarization using Large Language Models
marcin p. joachimiak
J. H. Caufield
N. Harris
Hyeongsik Kim
Christopher J. Mungall
113
21
0
21 May 2023
Pointwise Mutual Information Based Metric and Decoding Strategy for
  Faithful Generation in Document Grounded Dialogs
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs
Yatin Nandwani
Vineet Kumar
Dinesh Raghu
Sachindra Joshi
Luis A. Lastras
71
6
0
20 May 2023
Paragraph-level Citation Recommendation based on Topic Sentences as
  Queries
Paragraph-level Citation Recommendation based on Topic Sentences as Queries
Zoran Medic
Jan Snajder
3DV
58
1
0
20 May 2023
AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed
  methods evaluation
AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation
V. Murali
C. Maddila
Imad Ahmad
Michael Bolin
Daniel Cheng
Negar Ghorbani
Renuka Fernandez
Nachiappan Nagappan
Peter C. Rigby
85
14
0
20 May 2023
Clinical Camel: An Open Expert-Level Medical Language Model with
  Dialogue-Based Knowledge Encoding
Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding
Barry Rubin
Patrick R. Lawler
Jimmy Ba
Rahul G. Krishnan
Barry Rubin
Bo Wang
LM&MAAI4MHELM
86
38
0
19 May 2023
Pengi: An Audio Language Model for Audio Tasks
Pengi: An Audio Language Model for Audio Tasks
Soham Deshmukh
Benjamin Elizalde
Rita Singh
Huaming Wang
MLLMAuLLM
106
182
0
19 May 2023
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large
  Language Models
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models
Junyi Li
Xiaoxue Cheng
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
HILMVLM
118
254
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELMLRM
156
399
0
19 May 2023
Introspective Tips: Large Language Model for In-Context Decision Making
Introspective Tips: Large Language Model for In-Context Decision Making
Liting Chen
Lu Wang
Hang Dong
Yali Du
Jie Yan
...
Pu Zhao
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LLMAGLRM
100
28
0
19 May 2023
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via
  Tool Embeddings
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
Shibo Hao
Tianyang Liu
Zhen Wang
Zhiting Hu
RALMLLMAG
155
183
0
19 May 2023
Empower Large Language Model to Perform Better on Industrial
  Domain-Specific Question Answering
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang
Pu Zhao
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
106
51
0
19 May 2023
Post Hoc Explanations of Language Models Can Improve Language Models
Post Hoc Explanations of Language Models Can Improve Language Models
Satyapriya Krishna
Jiaqi Ma
Dylan Slack
Asma Ghandeharioun
Sameer Singh
Himabindu Lakkaraju
ReLMLRM
90
59
0
19 May 2023
Generalized Multiple Intent Conditioned Slot Filling
Generalized Multiple Intent Conditioned Slot Filling
Harshil Shah
Arthur Wilcke
Marius Cobzarenco
Cristian C Cobzarenco
Edward Challis
David Barber
54
0
0
18 May 2023
The Web Can Be Your Oyster for Improving Large Language Models
The Web Can Be Your Oyster for Improving Large Language Models
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Jingyuan Wang
Jian-Yun Nie
Ji-Rong Wen
RALMKELM
97
5
0
18 May 2023
Statistical Knowledge Assessment for Large Language Models
Statistical Knowledge Assessment for Large Language Models
Qingxiu Dong
Jingjing Xu
Lingpeng Kong
Zhifang Sui
Lei Li
HILM
63
8
0
17 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLMLRM
269
1,214
0
17 May 2023
Evaluating Object Hallucination in Large Vision-Language Models
Evaluating Object Hallucination in Large Vision-Language Models
Yifan Li
Yifan Du
Kun Zhou
Jinpeng Wang
Wayne Xin Zhao
Ji-Rong Wen
MLLMLRM
361
816
0
17 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
  Language Models
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
137
37
0
17 May 2023
Exploring In-Context Learning Capabilities of Foundation Models for
  Generating Knowledge Graphs from Text
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text
H. Khorashadizadeh
Nandana Mihindukulasooriya
Sanju Tiwari
Jinghua Groppe
Sven Groppe
73
23
0
15 May 2023
Previous
123...1920212223
Next