Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.04328
Cited By
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions
6 October 2024
Yu-Shin Huang
Peter Just
Krishna Narayanan
Chao Tian
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions"
6 / 6 papers shown
Title
Distillation Robustifies Unlearning
Bruce W. Lee
Addie Foote
Alex Infanger
Leni Shor
Harish Kamath
Jacob Goldman-Wetzler
Bryce Woodworth
Alex Cloud
Alexander Matt Turner
MU
57
0
0
06 Jun 2025
Modular Training of Neural Networks aids Interpretability
Satvik Golechha
Maheep Chaudhary
Joan Velja
Alessandro Abate
Nandi Schoots
153
0
0
04 Feb 2025
Physics of Skill Learning
Ziming Liu
Yizhou Liu
Eric J. Michaud
Jeff Gore
Max Tegmark
119
2
0
21 Jan 2025
An Adversarial Perspective on Machine Unlearning for AI Safety
Jakub Łucki
Boyi Wei
Yangsibo Huang
Peter Henderson
F. Tramèr
Javier Rando
MU
AAML
206
53
0
26 Sep 2024
Tamper-Resistant Safeguards for Open-Weight LLMs
Rishub Tamirisa
Bhrugu Bharathi
Long Phan
Andy Zhou
Alice Gatti
...
Andy Zou
Dawn Song
Bo Li
Dan Hendrycks
Mantas Mazeika
AAML
MU
133
63
0
01 Aug 2024
LEACE: Perfect linear concept erasure in closed form
Nora Belrose
David Schneider-Joseph
Shauli Ravfogel
Ryan Cotterell
Edward Raff
Stella Biderman
KELM
MU
182
120
0
06 Jun 2023
1