Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.01452
Cited By
MPCFormer: fast, performant and private Transformer inference with MPC
2 November 2022
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MPCFormer: fast, performant and private Transformer inference with MPC"
50 / 50 papers shown
Title
Private Transformer Inference in MLaaS: A Survey
Yang Li
Xinyu Zhou
Yishuo Wang
Liangxin Qian
Jun Zhao
25
0
0
15 May 2025
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
33
0
0
12 May 2025
CipherPrune: Efficient and Scalable Private Transformer Inference
Yancheng Zhang
Jinbao Xue
Mengxin Zheng
Mimi Xie
Mingzhe Zhang
Lei Jiang
Qian Lou
64
2
0
24 Feb 2025
Encryption-Friendly LLM Architecture
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
57
2
0
24 Feb 2025
HawkEye: Statically and Accurately Profiling the Communication Cost of Models in Multi-party Learning
Wenqiang Ruan
Xin Lin
Ruisheng Zhou
Guopeng Lin
Shui Yu
Weili Han
45
0
0
16 Feb 2025
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference
Wenxuan Zeng
Ye Dong
Jinjin Zhou
Junming Ma
Jin Tan
Runsheng Wang
Meng Li
49
0
0
12 Jan 2025
TruncFormer: Private LLM Inference Using Only Truncations
Patrick Yubeaton
Jianqiao Mo
Karthik Garimella
N. Jha
Brandon Reagen
Chinmay Hegde
Siddharth Garg
76
0
0
02 Dec 2024
Nimbus: Secure and Efficient Two-Party Inference for Transformers
Zhengyi Li
Kang Yang
Jin Tan
Wen-jie Lu
Haoqi Wu
...
Yu Yu
Derun Zhao
Yancheng Zheng
M. Guo
Jingwen Leng
74
2
0
24 Nov 2024
Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives
Vincent Hanke
Tom Blanchard
Franziska Boenisch
Iyiola Emmanuel Olatunji
Michael Backes
Adam Dziedzic
PILM
56
3
0
02 Nov 2024
AERO: Softmax-Only LLMs for Efficient Private Inference
N. Jha
Brandon Reagen
32
1
0
16 Oct 2024
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
Shuzhang Zhong
Wenxuan Zeng
Runsheng Wang
Meng Li
MQ
31
0
0
12 Oct 2024
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Guanchu Wang
Yu-Neng Chuang
Ruixiang Tang
Shaochen Zhong
Jiayi Yuan
...
Zirui Liu
V. Chaudhary
Shuai Xu
James Caverlee
Xia Hu
PILM
84
1
0
06 Oct 2024
Secure Multiparty Generative AI
Manil Shrestha
Yashodha Ravichandran
Edward Kim
20
0
0
27 Sep 2024
Confidential Prompting: Protecting User Prompts from Cloud LLM Providers
In Gim
Caihua Li
Lin Zhong
52
2
0
27 Sep 2024
CipherDM: Secure Three-Party Inference for Diffusion Model Sampling
Xin Zhao
Xiaojun Chen
Xinyu Chen
He Li
Tingyu Fan
Zhendong Zhao
36
1
0
09 Sep 2024
MPC-Minimized Secure LLM Inference
Deevashwer Rathee
Dacheng Li
Ion Stoica
Hao Zhang
Raluca A. Popa
47
1
0
07 Aug 2024
Low-Latency Privacy-Preserving Deep Learning Design via Secure MPC
Ke Lin
Yasir Glani
Ping Luo
21
0
0
24 Jul 2024
ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets
Ahmed Frikha
Nassim Walha
Ricardo Mendes
K. K. Nakka
Xue Jiang
Xuebing Zhou
74
2
0
03 Jul 2024
Unique Security and Privacy Threats of Large Language Model: A Comprehensive Survey
Shang Wang
Tianqing Zhu
Bo Liu
Ming Ding
Xu Guo
Dayong Ye
Wanlei Zhou
Philip S. Yu
PILM
69
17
0
12 Jun 2024
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas
Chengyuan Deng
Yiqun Duan
Xin Jin
Heng Chang
Yijun Tian
...
Kuofeng Gao
Sihong He
Jun Zhuang
Lu Cheng
Haohan Wang
AILaw
46
16
0
08 Jun 2024
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration
Ziqian Zeng
Jianwei Wang
Zhengdong Lu
Huiping Zhuang
Cen Chen
RALM
KELM
50
7
0
03 Jun 2024
PermLLM: Private Inference of Large Language Models within 3 Seconds under WAN
Fei Zheng
Chaochao Chen
Zhongxuan Han
Xiaolin Zheng
LRM
37
4
0
29 May 2024
Comet:
\textit{Comet:}
Comet:
A
C
o
m
‾
\underline{Com}
C
o
m
munication-
e
‾
\underline{e}
e
fficient and Performant Approxima
t
‾
\underline{t}
t
ion for Private Transformer Inference
Xiangrui Xu
Qiao Zhang
R. Ning
Chunsheng Xin
Hongyi Wu
46
5
0
24 May 2024
Ditto: Quantization-aware Secure Inference of Transformers upon MPC
Haoqi Wu
Wenjing Fang
Yancheng Zheng
Junming Ma
Jin Tan
Yinggui Wang
Lei Wang
MQ
53
2
0
09 May 2024
EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization
Wenxuan Zeng
Tianshi Xu
Meng Li
Runsheng Wang
MQ
38
0
0
15 Apr 2024
CipherFormer: Efficient Transformer Private Inference with Low Round Complexity
Weize Wang
Yi Kuang
28
0
0
25 Mar 2024
A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism
Zhiyuan Chen
Yu Li
Suochao Zhang
Jingbo Zhou
Jiwen Zhou
Chenfu Bao
Dianhai Yu
26
0
0
12 Mar 2024
Spin: An Efficient Secure Computation Framework with GPU Acceleration
Wuxuan Jiang
Xiangjun Song
Shenbai Hong
Haijun Zhang
Wenxin Liu
Bo Zhao
Wei Xu
Yi Li
23
1
0
04 Feb 2024
Regularized PolyKervNets: Optimizing Expressiveness and Efficiency for Private Inference in Deep Neural Networks
Toluwani Aremu
25
0
0
23 Dec 2023
Grounding Foundation Models through Federated Transfer Learning: A General Framework
Yan Kang
Tao Fan
Hanlin Gu
Xiaojin Zhang
Lixin Fan
Qiang Yang
AI4CE
68
19
0
29 Nov 2023
CompactTag: Minimizing Computation Overheads in Actively-Secure MPC for Deep Neural Networks
Yongqin Wang
Pratik Sarkar
Nishat Koti
A. Patra
Murali Annavaram
27
2
0
08 Nov 2023
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models
Haoran Li
Dadi Guo
Donghao Li
Wei Fan
Qi Hu
Xin Liu
Chunkit Chan
Duanyi Yao
Yuan Yao
Yangqiu Song
PILM
39
24
0
07 Nov 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference
Wenxuan Zeng
Meng Li
Haichuan Yang
Wen-jie Lu
Runsheng Wang
Ru Huang
23
6
0
03 Nov 2023
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
50
42
0
16 Oct 2023
PriViT: Vision Transformers for Fast Private Inference
Naren Dhyani
Jianqiao Mo
Minsu Cho
Ameya Joshi
Siddharth Garg
Brandon Reagen
Chinmay Hegde
28
4
0
06 Oct 2023
Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference
Kiwan Maeng
G. E. Suh
30
2
0
09 Sep 2023
Compact: Approximating Complex Activation Functions for Secure Computation
Mazharul Islam
Sunpreet S. Arora
Rahul Chatterjee
Peter Rindal
Maliheh Shirvanian
32
4
0
09 Sep 2023
East: Efficient and Accurate Secure Transformer Framework for Inference
Yuanchao Ding
Hua Guo
Yewei Guan
Weixin Liu
Jiarong Huo
Zhenyu Guan
Xiyong Zhang
34
17
0
19 Aug 2023
PUMA: Secure Inference of LLaMA-7B in Five Minutes
Ye Dong
Wen-jie Lu
Yancheng Zheng
Haoqi Wu
Derun Zhao
Jin Tan
Zhicong Huang
Cheng Hong
Tao Wei
Wen-Chang Cheng
39
52
0
24 Jul 2023
LLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly Transformers
Xuanqing Liu
Zhuotao Liu
16
22
0
28 May 2023
MERGE: Fast Private Text Generation
Zi Liang
Pinghui Wang
Ruofei Zhang
Nuo Xu
Lifeng Xing
Shuo Zhang
20
6
0
25 May 2023
Fast Distributed Inference Serving for Large Language Models
Bingyang Wu
Yinmin Zhong
Zili Zhang
Gang Huang
Xuanzhe Liu
Xin Jin
30
93
0
10 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
58
22
0
04 May 2023
DeepReShape: Redesigning Neural Networks for Efficient Private Inference
N. Jha
Brandon Reagen
36
10
0
20 Apr 2023
Primer: Fast Private Transformer Inference on Encrypted Data
Mengxin Zheng
Qian Lou
Lei Jiang
25
30
0
23 Mar 2023
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
43
67
0
09 Feb 2023
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
19
5
0
06 Jan 2023
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
Wenyuan Zeng
Meng Li
Wenjie Xiong
Tong Tong
Wen-jie Lu
Jin Tan
Runsheng Wang
Ru Huang
24
20
0
25 Nov 2022
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
266
7,638
0
03 Jul 2012
1