Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.11282
Cited By
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models
15 July 2024
Qingcheng Zeng
Mingyu Jin
Qinkai Yu
Zhenting Wang
Wenyue Hua
Zihao Zhou
Guangyan Sun
Yanda Meng
Shiqing Ma
Qifan Wang
Felix Juefei Xu
Kaize Ding
Fan Yang
Ruixiang Tang
Yongfeng Zhang
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models"
13 / 13 papers shown
Title
Towards Uncertainty Unification: A Case Study for Preference Learning
Shaoting Peng
Haonan Chen
Katherine Driggs-Campbell
56
0
0
25 Mar 2025
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents
Mingyu Jin
Beichen Wang
Zhaoqian Xue
Suiyuan Zhu
Wenyue Hua
Hua Tang
Kai Mei
Jundong Li
Yongfeng Zhang
LM&Ro
LLMAG
87
10
0
03 Jan 2025
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding
Ankur Mallick
Chi Wang
Robert Sim
Subhabrata Mukherjee
Victor Rühle
L. Lakshmanan
Ahmed Hassan Awadallah
88
77
0
22 Apr 2024
Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A
Benjamin Plaut
Khanh Nguyen
Tu Trinh
37
5
0
20 Feb 2024
LLM Performance Predictors are good initializers for Architecture Search
Ganesh Jawahar
Muhammad Abdul-Mageed
L. Lakshmanan
Dujian Ding
LRM
48
19
0
25 Oct 2023
PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models
Hongwei Yao
Jian Lou
Zhan Qin
SILM
AAML
61
30
0
19 Oct 2023
Investigating Uncertainty Calibration of Aligned Language Models under the Multiple-Choice Setting
Guande He
Peng Cui
Jianfei Chen
Wenbo Hu
Jun Zhu
50
11
0
18 Oct 2023
Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness
Jiuhai Chen
Jonas W. Mueller
46
56
0
30 Aug 2023
TrojText: Test-time Invisible Textual Trojan Insertion
Qiang Lou
Ye Liu
Bo Feng
37
23
0
03 Mar 2023
BppAttack: Stealthy and Efficient Trojan Attacks against Deep Neural Networks via Image Quantization and Contrastive Adversarial Learning
Zhenting Wang
Juan Zhai
Shiqing Ma
AAML
131
97
0
26 May 2022
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
100
227
0
15 Apr 2021
Clean-Label Backdoor Attacks on Video Recognition Models
Shihao Zhao
Xingjun Ma
Xiang Zheng
James Bailey
Jingjing Chen
Yu-Gang Jiang
AAML
196
274
0
06 Mar 2020
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
276
5,661
0
05 Dec 2016
1