ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.01452
  4. Cited By
MPCFormer: fast, performant and private Transformer inference with MPC

MPCFormer: fast, performant and private Transformer inference with MPC

2 November 2022
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
ArXivPDFHTML

Papers citing "MPCFormer: fast, performant and private Transformer inference with MPC"

50 / 50 papers shown
Title
Private Transformer Inference in MLaaS: A Survey
Private Transformer Inference in MLaaS: A Survey
Yang Li
Xinyu Zhou
Yishuo Wang
Liangxin Qian
Jun Zhao
25
0
0
15 May 2025
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
33
0
0
12 May 2025
CipherPrune: Efficient and Scalable Private Transformer Inference
CipherPrune: Efficient and Scalable Private Transformer Inference
Yancheng Zhang
Jinbao Xue
Mengxin Zheng
Mimi Xie
Mingzhe Zhang
Lei Jiang
Qian Lou
64
2
0
24 Feb 2025
Encryption-Friendly LLM Architecture
Encryption-Friendly LLM Architecture
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
57
2
0
24 Feb 2025
HawkEye: Statically and Accurately Profiling the Communication Cost of Models in Multi-party Learning
HawkEye: Statically and Accurately Profiling the Communication Cost of Models in Multi-party Learning
Wenqiang Ruan
Xin Lin
Ruisheng Zhou
Guopeng Lin
Shui Yu
Weili Han
45
0
0
16 Feb 2025
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference
Wenxuan Zeng
Ye Dong
Jinjin Zhou
Junming Ma
Jin Tan
Runsheng Wang
Meng Li
49
0
0
12 Jan 2025
TruncFormer: Private LLM Inference Using Only Truncations
TruncFormer: Private LLM Inference Using Only Truncations
Patrick Yubeaton
Jianqiao Mo
Karthik Garimella
N. Jha
Brandon Reagen
Chinmay Hegde
Siddharth Garg
76
0
0
02 Dec 2024
Nimbus: Secure and Efficient Two-Party Inference for Transformers
Nimbus: Secure and Efficient Two-Party Inference for Transformers
Zhengyi Li
Kang Yang
Jin Tan
Wen-jie Lu
Haoqi Wu
...
Yu Yu
Derun Zhao
Yancheng Zheng
M. Guo
Jingwen Leng
74
2
0
24 Nov 2024
Open LLMs are Necessary for Current Private Adaptations and Outperform
  their Closed Alternatives
Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives
Vincent Hanke
Tom Blanchard
Franziska Boenisch
Iyiola Emmanuel Olatunji
Michael Backes
Adam Dziedzic
PILM
56
3
0
02 Nov 2024
AERO: Softmax-Only LLMs for Efficient Private Inference
AERO: Softmax-Only LLMs for Efficient Private Inference
N. Jha
Brandon Reagen
32
1
0
16 Oct 2024
PrivQuant: Communication-Efficient Private Inference with Quantized
  Network/Protocol Co-Optimization
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization
Tianshi Xu
Shuzhang Zhong
Wenxuan Zeng
Runsheng Wang
Meng Li
MQ
31
0
0
12 Oct 2024
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Guanchu Wang
Yu-Neng Chuang
Ruixiang Tang
Shaochen Zhong
Jiayi Yuan
...
Zirui Liu
V. Chaudhary
Shuai Xu
James Caverlee
Xia Hu
PILM
84
1
0
06 Oct 2024
Secure Multiparty Generative AI
Secure Multiparty Generative AI
Manil Shrestha
Yashodha Ravichandran
Edward Kim
20
0
0
27 Sep 2024
Confidential Prompting: Protecting User Prompts from Cloud LLM Providers
Confidential Prompting: Protecting User Prompts from Cloud LLM Providers
In Gim
Caihua Li
Lin Zhong
52
2
0
27 Sep 2024
CipherDM: Secure Three-Party Inference for Diffusion Model Sampling
CipherDM: Secure Three-Party Inference for Diffusion Model Sampling
Xin Zhao
Xiaojun Chen
Xinyu Chen
He Li
Tingyu Fan
Zhendong Zhao
36
1
0
09 Sep 2024
MPC-Minimized Secure LLM Inference
MPC-Minimized Secure LLM Inference
Deevashwer Rathee
Dacheng Li
Ion Stoica
Hao Zhang
Raluca A. Popa
47
1
0
07 Aug 2024
Low-Latency Privacy-Preserving Deep Learning Design via Secure MPC
Low-Latency Privacy-Preserving Deep Learning Design via Secure MPC
Ke Lin
Yasir Glani
Ping Luo
21
0
0
24 Jul 2024
ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets
ObfuscaTune: Obfuscated Offsite Fine-tuning and Inference of Proprietary LLMs on Private Datasets
Ahmed Frikha
Nassim Walha
Ricardo Mendes
K. K. Nakka
Xue Jiang
Xuebing Zhou
74
2
0
03 Jul 2024
Unique Security and Privacy Threats of Large Language Model: A
  Comprehensive Survey
Unique Security and Privacy Threats of Large Language Model: A Comprehensive Survey
Shang Wang
Tianqing Zhu
Bo Liu
Ming Ding
Xu Guo
Dayong Ye
Wanlei Zhou
Philip S. Yu
PILM
69
17
0
12 Jun 2024
Deconstructing The Ethics of Large Language Models from Long-standing
  Issues to New-emerging Dilemmas
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas
Chengyuan Deng
Yiqun Duan
Xin Jin
Heng Chang
Yijun Tian
...
Kuofeng Gao
Sihong He
Jun Zhuang
Lu Cheng
Haohan Wang
AILaw
46
16
0
08 Jun 2024
PrivacyRestore: Privacy-Preserving Inference in Large Language Models
  via Privacy Removal and Restoration
PrivacyRestore: Privacy-Preserving Inference in Large Language Models via Privacy Removal and Restoration
Ziqian Zeng
Jianwei Wang
Zhengdong Lu
Huiping Zhuang
Cen Chen
RALM
KELM
50
7
0
03 Jun 2024
PermLLM: Private Inference of Large Language Models within 3 Seconds
  under WAN
PermLLM: Private Inference of Large Language Models within 3 Seconds under WAN
Fei Zheng
Chaochao Chen
Zhongxuan Han
Xiaolin Zheng
LRM
37
4
0
29 May 2024
$\textit{Comet:}$ A $\underline{Com}$munication-$\underline{e}$fficient
  and Performant Approxima$\underline{t}$ion for Private Transformer Inference
Comet:\textit{Comet:}Comet: A Com‾\underline{Com}Com​munication-e‾\underline{e}e​fficient and Performant Approximat‾\underline{t}t​ion for Private Transformer Inference
Xiangrui Xu
Qiao Zhang
R. Ning
Chunsheng Xin
Hongyi Wu
46
5
0
24 May 2024
Ditto: Quantization-aware Secure Inference of Transformers upon MPC
Ditto: Quantization-aware Secure Inference of Transformers upon MPC
Haoqi Wu
Wenjing Fang
Yancheng Zheng
Junming Ma
Jin Tan
Yinggui Wang
Lei Wang
MQ
53
2
0
09 May 2024
EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based
  Protocol and Quantization Co-Optimization
EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization
Wenxuan Zeng
Tianshi Xu
Meng Li
Runsheng Wang
MQ
38
0
0
15 Apr 2024
CipherFormer: Efficient Transformer Private Inference with Low Round
  Complexity
CipherFormer: Efficient Transformer Private Inference with Low Round Complexity
Weize Wang
Yi Kuang
28
0
0
25 Mar 2024
A Framework for Cost-Effective and Self-Adaptive LLM Shaking and
  Recovery Mechanism
A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism
Zhiyuan Chen
Yu Li
Suochao Zhang
Jingbo Zhou
Jiwen Zhou
Chenfu Bao
Dianhai Yu
26
0
0
12 Mar 2024
Spin: An Efficient Secure Computation Framework with GPU Acceleration
Spin: An Efficient Secure Computation Framework with GPU Acceleration
Wuxuan Jiang
Xiangjun Song
Shenbai Hong
Haijun Zhang
Wenxin Liu
Bo Zhao
Wei Xu
Yi Li
23
1
0
04 Feb 2024
Regularized PolyKervNets: Optimizing Expressiveness and Efficiency for
  Private Inference in Deep Neural Networks
Regularized PolyKervNets: Optimizing Expressiveness and Efficiency for Private Inference in Deep Neural Networks
Toluwani Aremu
25
0
0
23 Dec 2023
Grounding Foundation Models through Federated Transfer Learning: A
  General Framework
Grounding Foundation Models through Federated Transfer Learning: A General Framework
Yan Kang
Tao Fan
Hanlin Gu
Xiaojin Zhang
Lixin Fan
Qiang Yang
AI4CE
68
19
0
29 Nov 2023
CompactTag: Minimizing Computation Overheads in Actively-Secure MPC for
  Deep Neural Networks
CompactTag: Minimizing Computation Overheads in Actively-Secure MPC for Deep Neural Networks
Yongqin Wang
Pratik Sarkar
Nishat Koti
A. Patra
Murali Annavaram
27
2
0
08 Nov 2023
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language
  Models
PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models
Haoran Li
Dadi Guo
Donghao Li
Wei Fan
Qi Hu
Xin Liu
Chunkit Chan
Duanyi Yao
Yuan Yao
Yangqiu Song
PILM
39
24
0
07 Nov 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient
  Private Inference
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference
Wenxuan Zeng
Meng Li
Haichuan Yang
Wen-jie Lu
Runsheng Wang
Ru Huang
23
6
0
03 Nov 2023
Privacy in Large Language Models: Attacks, Defenses and Future
  Directions
Privacy in Large Language Models: Attacks, Defenses and Future Directions
Haoran Li
Yulin Chen
Jinglong Luo
Yan Kang
Xiaojin Zhang
Qi Hu
Chunkit Chan
Yangqiu Song
PILM
50
42
0
16 Oct 2023
PriViT: Vision Transformers for Fast Private Inference
PriViT: Vision Transformers for Fast Private Inference
Naren Dhyani
Jianqiao Mo
Minsu Cho
Ameya Joshi
Siddharth Garg
Brandon Reagen
Chinmay Hegde
28
4
0
06 Oct 2023
Approximating ReLU on a Reduced Ring for Efficient MPC-based Private
  Inference
Approximating ReLU on a Reduced Ring for Efficient MPC-based Private Inference
Kiwan Maeng
G. E. Suh
30
2
0
09 Sep 2023
Compact: Approximating Complex Activation Functions for Secure
  Computation
Compact: Approximating Complex Activation Functions for Secure Computation
Mazharul Islam
Sunpreet S. Arora
Rahul Chatterjee
Peter Rindal
Maliheh Shirvanian
32
4
0
09 Sep 2023
East: Efficient and Accurate Secure Transformer Framework for Inference
East: Efficient and Accurate Secure Transformer Framework for Inference
Yuanchao Ding
Hua Guo
Yewei Guan
Weixin Liu
Jiarong Huo
Zhenyu Guan
Xiyong Zhang
34
17
0
19 Aug 2023
PUMA: Secure Inference of LLaMA-7B in Five Minutes
PUMA: Secure Inference of LLaMA-7B in Five Minutes
Ye Dong
Wen-jie Lu
Yancheng Zheng
Haoqi Wu
Derun Zhao
Jin Tan
Zhicong Huang
Cheng Hong
Tao Wei
Wen-Chang Cheng
39
52
0
24 Jul 2023
LLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly
  Transformers
LLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly Transformers
Xuanqing Liu
Zhuotao Liu
16
22
0
28 May 2023
MERGE: Fast Private Text Generation
MERGE: Fast Private Text Generation
Zi Liang
Pinghui Wang
Ruofei Zhang
Nuo Xu
Lifeng Xing
Shuo Zhang
20
6
0
25 May 2023
Fast Distributed Inference Serving for Large Language Models
Fast Distributed Inference Serving for Large Language Models
Bingyang Wu
Yinmin Zhong
Zili Zhang
Gang Huang
Xuanzhe Liu
Xin Jin
30
93
0
10 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Cuttlefish: Low-Rank Model Training without All the Tuning
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
58
22
0
04 May 2023
DeepReShape: Redesigning Neural Networks for Efficient Private Inference
DeepReShape: Redesigning Neural Networks for Efficient Private Inference
N. Jha
Brandon Reagen
36
10
0
20 Apr 2023
Primer: Fast Private Transformer Inference on Encrypted Data
Primer: Fast Private Transformer Inference on Encrypted Data
Mengxin Zheng
Qian Lou
Lei Jiang
25
30
0
23 Mar 2023
Offsite-Tuning: Transfer Learning without Full Model
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
43
67
0
09 Feb 2023
Does compressing activations help model parallel training?
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
19
5
0
06 Jan 2023
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision
  Transformer with Heterogeneous Attention
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
Wenyuan Zeng
Meng Li
Wenjie Xiong
Tong Tong
Wen-jie Lu
Jin Tan
Runsheng Wang
Ru Huang
24
20
0
25 Nov 2022
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Improving neural networks by preventing co-adaptation of feature
  detectors
Improving neural networks by preventing co-adaptation of feature detectors
Geoffrey E. Hinton
Nitish Srivastava
A. Krizhevsky
Ilya Sutskever
Ruslan Salakhutdinov
VLM
266
7,638
0
03 Jul 2012
1