ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.15296
  4. Cited By
UHGEval: Benchmarking the Hallucination of Chinese Large Language Models
  via Unconstrained Generation
v1v2v3 (latest)

UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation

26 November 2023
Xun Liang
Shichao Song
Pengnian Qi
Zhiyu Li
Feiyu Xiong
Simin Niu
Zhaohui Wy
Dawei He
Peng Cheng
Zhonghao Wang
Haiying Deng
    HILM
ArXiv (abs)PDFHTML

Papers citing "UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation"

5 / 5 papers shown
Title
Evaluation of Retrieval-Augmented Generation: A Survey
Evaluation of Retrieval-Augmented Generation: A Survey
Hao Yu
Aoran Gan
Kai Zhang
Shiwei Tong
Qi Liu
Zhaofeng Liu
3DV
136
100
0
13 May 2024
A Survey of Automatic Hallucination Evaluation on Natural Language Generation
A Survey of Automatic Hallucination Evaluation on Natural Language Generation
Siya Qi
Yulan He
Yulan He
Zheng Yuan
LRMHILM
99
1
0
18 Apr 2024
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination
  Tendency of LLMs
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs
Cem Uluoglakci
T. Taşkaya-Temizel
HILM
64
3
0
25 Feb 2024
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented
  Generation of Large Language Models
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
Yuanjie Lyu
Zhiyu Li
Pengnian Qi
Feiyu Xiong
Simin Niu
Wenjin Wang
Hao Wu
Huan Liu
Tong Xu
Enhong Chen
RALM
84
40
0
30 Jan 2024
Baichuan 2: Open Large-scale Language Models
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELMLRM
322
755
0
19 Sep 2023
1