Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.03629
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Survey of Hallucination in Natural Language Generation
8 February 2022
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
Yan Xu
Etsuko Ishii
Yejin Bang
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Survey of Hallucination in Natural Language Generation"
50 / 337 papers shown
Title
Recommender Systems in the Era of Large Language Models (LLMs)
Zihuai Zhao
Wenqi Fan
Jiatong Li
Yunqing Liu
Xiaowei Mei
...
Zhen Wen
Fei Wang
Xiangyu Zhao
Jiliang Tang
Qing Li
KELM
171
348
0
05 Jul 2023
Still No Lie Detector for Language Models: Probing Empirical and Conceptual Roadblocks
B. Levinstein
Daniel A. Herrmann
102
61
0
30 Jun 2023
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
140
247
0
27 Jun 2023
System-Level Natural Language Feedback
Weizhe Yuan
Kyunghyun Cho
Jason Weston
115
5
0
23 Jun 2023
I See Dead People: Gray-Box Adversarial Attack on Image-To-Text Models
Raz Lapid
Moshe Sipper
AAML
110
17
0
13 Jun 2023
Exploring the Responses of Large Language Models to Beginner Programmers' Help Requests
Arto Hellas
Juho Leinonen
Sami Sarsa
Charles Koutcheme
Lilja Kujanpää
Juha Sorva
AI4Ed
69
114
0
09 Jun 2023
Long-form analogies generated by chatGPT lack human-like psycholinguistic properties
S. M. Seals
V. Shalin
50
12
0
07 Jun 2023
Benchmarking Foundation Models with Language-Model-as-an-Examiner
Yushi Bai
Jiahao Ying
Yixin Cao
Xin Lv
Yuze He
...
Yijia Xiao
Haozhe Lyu
Jiayin Zhang
Juanzi Li
Lei Hou
ALM
ELM
107
149
0
07 Jun 2023
Deductive Verification of Chain-of-Thought Reasoning
Z. Ling
Yunhao Fang
Xuanlin Li
Zhiao Huang
Mingu Lee
Roland Memisevic
Hao Su
ReLM
LRM
111
136
0
06 Jun 2023
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views
Paul D. Yoo
Jiaxian Guo
Yutaka Matsuo
S. Gu
110
24
0
06 Jun 2023
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction
Rose E. Wang
Dorottya Demszky
73
61
0
05 Jun 2023
Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference
Yan Xu
Deqian Kong
Dehong Xu
Ziwei Ji
Bo Pang
Pascale Fung
Yingting Wu
88
7
0
01 Jun 2023
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
Sreyan Ghosh
Utkarsh Tyagi
Manan Suri
Sonal Kumar
S. Ramaneswaran
Dinesh Manocha
73
16
0
01 Jun 2023
Graph Reasoning for Question Answering with Triplet Retrieval
Shiyang Li
Yifan Gao
Hao Jiang
Qingyu Yin
Zheng Li
Xifeng Yan
Chao Zhang
Bing Yin
RALM
ReLM
93
35
0
30 May 2023
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation
Niels Mündler
Jingxuan He
Slobodan Jenko
Martin Vechev
HILM
70
119
0
25 May 2023
Lawyer LLaMA Technical Report
Quzhe Huang
Mingxu Tao
Chen Zhang
Zhenwei An
Cong Jiang
Zhibin Chen
Zirui Wu
Yansong Feng
ELM
ALM
AILaw
131
53
0
24 May 2023
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MA
HILM
159
357
0
24 May 2023
Does ChatGPT have Theory of Mind?
B. Holterman
Kees van Deemter
LRM
AI4CE
81
24
0
23 May 2023
Revealing User Familiarity Bias in Task-Oriented Dialogue via Interactive Evaluation
Takyoung Kim
Jamin Shin
Young-Ho Kim
Sanghwan Bae
Sungdong Kim
94
1
0
23 May 2023
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Alfonso Amayuelas
Kyle Wong
Liangming Pan
Wenhu Chen
Wenjie Wang
100
29
0
23 May 2023
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Elizabeth Clark
Shruti Rijhwani
Sebastian Gehrmann
Joshua Maynez
Roee Aharoni
Vitaly Nikolaev
Thibault Sellam
Aditya Siddhant
Dipanjan Das
Ankur P. Parikh
95
41
0
22 May 2023
Paragraph-level Citation Recommendation based on Topic Sentences as Queries
Zoran Medic
Jan Snajder
3DV
55
1
0
20 May 2023
AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation
V. Murali
C. Maddila
Imad Ahmad
Michael Bolin
Daniel Cheng
Negar Ghorbani
Renuka Fernandez
Nachiappan Nagappan
Peter C. Rigby
85
14
0
20 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
148
398
0
19 May 2023
Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks
Moritz Plenz
Juri Opitz
Philipp Heinisch
Philipp Cimiano
Anette Frank
95
9
0
15 May 2023
Synergistic Interplay between Search and Large Language Models for Information Retrieval
Jiazhan Feng
Chongyang Tao
Xiubo Geng
Tao Shen
Can Xu
Guodong Long
Dongyan Zhao
Daxin Jiang
KELM
129
6
0
12 May 2023
The Current State of Summarization
Fabian Retkowski
78
6
0
08 May 2023
Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns
Ning Bian
Hongyu Lin
Peilin Liu
Yaojie Lu
Chunkang Zhang
Xianpei Han
Xianpei Han
Le Sun
78
14
0
08 May 2023
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process
Anna Glazkova
Zongjie Li
Michael Kadantsev
Maksim Glazkov
KELM
86
14
0
04 May 2023
Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables
Matthias Urban
Carsten Binnig
71
5
0
26 Apr 2023
The Internal State of an LLM Knows When It's Lying
A. Azaria
Tom Michael Mitchell
HILM
353
348
0
26 Apr 2023
(Vector) Space is Not the Final Frontier: Product Search as Program Synthesis
Jacopo Tagliabue
C. Greco
88
2
0
22 Apr 2023
Improving Patient Pre-screening for Clinical Trials: Assisting Physicians with Large Language Models
D. Hamer
P. Schoor
T. Polak
Daniel Kapitan
LRM
LM&MA
66
15
0
14 Apr 2023
Galactic ChitChat: Using Large Language Models to Converse with Astronomy Literature
I. Ciucă
Y. Ting 丁
59
6
0
12 Apr 2023
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELM
AI4MH
HILM
109
73
0
11 Apr 2023
Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learning
J. H. Caufield
Harshad B. Hegde
Vincent Emonet
N. Harris
marcin p. joachimiak
...
Sierra A T Moxon
Justin P Reese
M. Haendel
Peter N. Robinson
Christopher J. Mungall
101
89
0
05 Apr 2023
LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models
Patrik Puchert
Poonam Poonam
Christian van Onzenoodt
Timo Ropinski
64
9
0
02 Apr 2023
Elastic Weight Removal for Faithful and Abstractive Dialogue Generation
Nico Daheim
Nouha Dziri
Mrinmaya Sachan
Iryna Gurevych
Edoardo Ponti
MoMe
108
31
0
30 Mar 2023
Hallucinations in Large Multilingual Translation Models
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLM
HILM
LRM
195
154
0
28 Mar 2023
Who's in Charge? Roles and Responsibilities of Decision-Making Components in Conversational Robots
Pierre Lison
C. Kennington
57
3
0
15 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
93
47
0
10 Mar 2023
Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Hannah Rose Kirk
Bertie Vidgen
Paul Röttger
Scott A. Hale
106
107
0
09 Mar 2023
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation
Jing Zhang
Yanling Wang
Daniel Zhang-Li
Jifan Yu
Zijun Yao
...
Xiaohan Zhang
Nianyi Lin
Sunrui Lu
Juan Li
Jie Tang
81
19
0
28 Feb 2023
CARE: Collaborative AI-Assisted Reading Environment
Dennis Zyska
Nils Dycke
Jan Buchmann
Ilia Kuznetsov
Iryna Gurevych
67
6
0
24 Feb 2023
Dr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctness
Guido Zuccon
Bevan Koopman
KELM
AI4MH
MedIm
65
41
0
23 Feb 2023
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
212
16
0
17 Feb 2023
Towards Few-Shot Identification of Morality Frames using In-Context Learning
Shamik Roy
Nishanth Nakshatri
Dan Goldwasser
92
11
0
03 Feb 2023
ExClaim: Explainable Neural Claim Verification Using Rationalization
Sai Gurrapu
Lifu Huang
Feras A. Batarseh
AAML
94
9
0
21 Jan 2023
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection
Weijia Xu
Sweta Agrawal
Eleftheria Briakou
Marianna J. Martindale
Marine Carpuat
HILM
74
49
0
18 Jan 2023
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities
Jillian Bommarito
M. Bommarito
Daniel Martin Katz
Jessica Katz
ELM
73
55
0
11 Jan 2023
Previous
1
2
3
4
5
6
7
Next