Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.05073
Cited By
v1
v2
v3
v4 (latest)
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices
6 September 2019
Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
CVBM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices"
47 / 47 papers shown
Title
HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection
Qizhi Zheng
Zhongze Luo
Meiyan Guo
Xinzhu Wang
Renqimuge Wu
Qiu Meng
Guanghui Dong
ObjD
126
0
0
10 Mar 2025
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Lixian Jing
Jianpeng Qi
Junyu Dong
Yanwei Yu
3DPC
AI4CE
76
0
0
24 Dec 2024
All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management
Yifan Gong
Zheng Zhan
Pu Zhao
Yushu Wu
Chaoan Wu
Caiwen Ding
Weiwen Jiang
Minghai Qin
Yanzhi Wang
71
7
0
09 Dec 2022
TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition
Lizhi Xiang
Miao Yin
Chengming Zhang
Aravind Sukumaran-Rajam
P. Sadayappan
Bo Yuan
Dingwen Tao
3DV
93
8
0
07 Nov 2022
Advancing Model Pruning via Bi-level Optimization
Yihua Zhang
Yuguang Yao
Parikshit Ram
Pu Zhao
Tianlong Chen
Min-Fong Hong
Yanzhi Wang
Sijia Liu
152
68
0
08 Oct 2022
Efficient Multi-Prize Lottery Tickets: Enhanced Accuracy, Training, and Inference Speed
Hao-Ran Cheng
Pu Zhao
Yize Li
Xue Lin
James Diffenderfer
R. Goldhahn
B. Kailkhura
MQ
65
0
0
26 Sep 2022
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training
Geng Yuan
Yanyu Li
Sheng Li
Zhenglun Kong
Sergey Tulyakov
Xulong Tang
Yanzhi Wang
Jian Ren
86
16
0
22 Sep 2022
SparCL: Sparse Continual Learning on the Edge
Zifeng Wang
Zheng Zhan
Yifan Gong
Geng Yuan
Wei Niu
T. Jian
Bin Ren
Stratis Ioannidis
Yanzhi Wang
Jennifer Dy
CLL
124
63
0
20 Sep 2022
EVE: Environmental Adaptive Neural Network Models for Low-power Energy Harvesting System
Sahidul Islam
Shangli Zhou
Ran Ran
Yufang Jin
Wu-Shao Wen
Caiwen Ding
Mimi Xie
83
9
0
14 Jul 2022
Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
59
3
0
30 Jun 2022
CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework
Xiaofeng Li
Bin Ren
Xipeng Shen
Yanzhi Wang
GNN
50
0
0
21 Jun 2022
Can pruning improve certified robustness of neural networks?
Zhangheng Li
Tianlong Chen
Linyi Li
Yue Liu
Zhangyang Wang
AAML
108
13
0
15 Jun 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
115
0
25 Apr 2022
Structured Pruning is All You Need for Pruning CNNs at Initialization
Yaohui Cai
Weizhe Hua
Hongzheng Chen
G. E. Suh
Christopher De Sa
Zhiru Zhang
CVBM
96
15
0
04 Mar 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
64
25
0
01 Mar 2022
Mixture-of-Rookies: Saving DNN Computations by Predicting ReLU Outputs
D. Pinto
J. Arnau
Antonio González
49
1
0
10 Feb 2022
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets
Tianlong Chen
Xuxi Chen
Xiaolong Ma
Yanzhi Wang
Zhangyang Wang
82
34
0
09 Feb 2022
A Secure and Efficient Federated Learning Framework for NLP
Jieren Deng
Chenghong Wang
Xianrui Meng
Yijue Wang
Ji Li
Sheng Lin
Shuo Han
Fei Miao
Sanguthevar Rajasekaran
Caiwen Ding
FedML
121
22
0
28 Jan 2022
Recursive Least Squares for Training and Pruning Convolutional Neural Networks
Tianzong Yu
Chunyuan Zhang
Yuan Wang
Meng-tao Ma
Qingwei Song
69
1
0
13 Jan 2022
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
96
0
0
21 Dec 2021
Deep Odometry Systems on Edge with EKF-LoRa Backend for Real-Time Positioning in Adverse Environment
Zhuangzhuang Dai
Muhamad Risqi U. Saputra
Chris Xiaoxuan Lu
Andrew Markham
A. Trigoni
62
1
0
10 Dec 2021
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
...
Sijia Liu
Bin Ren
Xue Lin
Xulong Tang
Yanzhi Wang
66
10
0
22 Nov 2021
NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
60
20
0
28 Oct 2021
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
Geng Yuan
Xiaolong Ma
Wei Niu
Zhengang Li
Zhenglun Kong
...
Minghai Qin
Bin Ren
Yanzhi Wang
Sijia Liu
Xue Lin
97
96
0
26 Oct 2021
SMOF: Squeezing More Out of Filters Yields Hardware-Friendly CNN Pruning
Yanli Liu
Bochen Guan
Qinwen Xu
Weiyi Li
Shuxue Quan
77
2
0
21 Oct 2021
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization
Panjie Qi
E. Sha
Qingfeng Zhuge
Hongwu Peng
Shaoyi Huang
Zhenglun Kong
Yuhong Song
Bingbing Li
91
51
0
19 Oct 2021
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
115
17
0
30 Sep 2021
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity
Wei Niu
Zhengang
Xiaolong Ma
Peiyan Dong
Gang Zhou
Xuehai Qian
Xue Lin
Yanzhi Wang
Bin Ren
43
19
0
25 Aug 2021
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search
Zheng Zhan
Yifan Gong
Pu Zhao
Geng Yuan
Wei Niu
...
Malith Jayaweera
David Kaeli
Bin Ren
Xue Lin
Yanzhi Wang
SupR
69
42
0
18 Aug 2021
RFC-HyPGCN: A Runtime Sparse Feature Compress Accelerator for Skeleton-Based GCNs Action Recognition Model with Hybrid Pruning
Dong Wen
Jingfei Jiang
Jinwei Xu
Kang Wang
Tao Xiao
Yang Zhao
Y. Dou
GNN
49
7
0
02 Aug 2021
Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search
Pu Zhao
Wei Niu
Geng Yuan
Yuxuan Cai
Bin Ren
Yanzhi Wang
Xue Lin
3DPC
43
2
0
28 Jun 2021
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
Boyuan Feng
Yuke Wang
Tong Geng
Ang Li
Yufei Ding
MQ
72
37
0
23 Jun 2021
Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities
L. Gonzalez-Carabarin
Iris A. M. Huijben
Bastian Veeling
A. Schmid
Ruud J. G. van Sloun
57
11
0
26 May 2021
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design
Cong Hao
Jordan Dotzel
Jinjun Xiong
Luca Benini
Zhiru Zhang
Deming Chen
113
37
0
25 Mar 2021
Teachers Do More Than Teach: Compressing Image-to-Image Models
Qing Jin
Jian Ren
Oliver J. Woodford
Jiazhuo Wang
Geng Yuan
Yanzhi Wang
Sergey Tulyakov
82
56
0
05 Mar 2021
Dancing along Battery: Enabling Transformer with Run-time Reconfigurability on Mobile Devices
Yuhong Song
Weiwen Jiang
Bingbing Li
Panjie Qi
Qingfeng Zhuge
E. Sha
Sakyasingha Dasgupta
Yiyu Shi
Caiwen Ding
64
18
0
12 Feb 2021
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Zhengang Li
Geng Yuan
Wei Niu
Pu Zhao
Yanyu Li
...
Sijia Liu
Kaiyuan Yang
Bin Ren
Yanzhi Wang
Xue Lin
MQ
100
27
0
01 Dec 2020
ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning
Chengming Zhang
Geng Yuan
Wei Niu
Jiannan Tian
Sian Jin
...
Zhe Jiang
Yanzhi Wang
Bin Ren
Shuaiwen Leon Song
Dingwen Tao
3DV
71
1
0
20 Nov 2020
Standing on the Shoulders of Giants: Hardware and Neural Architecture Co-Search with Hot Start
Weiwen Jiang
Lei Yang
Sakyasingha Dasgupta
Jiaxi Hu
Yiyu Shi
68
59
0
17 Jul 2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
107
7
0
24 Apr 2020
A Unified DNN Weight Compression Framework Using Reweighted Optimization Methods
Tianyun Zhang
Xiaolong Ma
Zheng Zhan
Shangli Zhou
Minghai Qin
Fei Sun
Yen-kuang Chen
Caiwen Ding
M. Fardad
Yanzhi Wang
38
5
0
12 Apr 2020
Pre-defined Sparsity for Low-Complexity Convolutional Neural Networks
Souvik Kundu
M. Nazemi
Massoud Pedram
K. Chugg
Peter A. Beerel
CVBM
77
36
0
29 Jan 2020
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
Xiaolong Ma
Wei Niu
Tianyun Zhang
Sijia Liu
Sheng Lin
...
Xiang Chen
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
103
27
0
20 Jan 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
Xinyu Lin
Yanzhi Wang
Bin Ren
MQ
104
229
0
01 Jan 2020
Light-weight Calibrator: a Separable Component for Unsupervised Domain Adaptation
Shaokai Ye
Kailu Wu
Mu Zhou
Yunfei Yang
S. Tan
Kaidi Xu
Jiebo Song
Chenglong Bao
Kaisheng Ma
72
21
0
28 Nov 2019
Neural Network Training with Approximate Logarithmic Computations
Arnab Sanyal
Peter A. Beerel
K. Chugg
71
11
0
22 Oct 2019
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Xiaolong Ma
Sheng Lin
Shaokai Ye
Zhezhi He
Linfeng Zhang
...
Deliang Fan
Xuehai Qian
Xinyu Lin
Kaisheng Ma
Yanzhi Wang
MQ
132
93
0
03 Jul 2019
1