Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06168
Cited By
v1
v2 (latest)
Channel Pruning for Accelerating Very Deep Neural Networks
19 July 2017
Yihui He
Xiangyu Zhang
Jian Sun
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Channel Pruning for Accelerating Very Deep Neural Networks"
50 / 1,097 papers shown
Title
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Gongfan Fang
Hongxu Yin
Saurav Muralidharan
Greg Heinrich
Jeff Pool
Jan Kautz
Pavlo Molchanov
Xinchao Wang
73
10
0
26 Sep 2024
CNN Mixture-of-Depths
Rinor Cakaj
Jens Mehnert
Bin Yang
79
1
0
25 Sep 2024
Green Federated Learning: A new era of Green Aware AI
Dipanwita Thakur
Antonella Guzzo
Giancarlo Fortino
Francesco Piccialli
AI4CE
107
5
0
19 Sep 2024
Less Memory Means smaller GPUs: Backpropagation with Compressed Activations
Daniel Barley
Holger Froning
124
0
0
18 Sep 2024
Distilling Channels for Efficient Deep Tracking
Shiming Ge
Zhao Luo
Chunhui Zhang
Yingying Hua
Dacheng Tao
76
30
0
18 Sep 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Yuezhou Hu
Jun-Jie Zhu
Jianfei Chen
129
0
0
13 Sep 2024
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
81
1
0
12 Sep 2024
Hyper-Compression: Model Compression via Hyperfunction
Fenglei Fan
Juntong Fan
Dayang Wang
Jingbo Zhang
Zelin Dong
Shijun Zhang
Ge Wang
Tieyong Zeng
116
0
0
01 Sep 2024
MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning
Seungbeom Hu
ChanJun Park
Andrew Ferraiuolo
Sang-Ki Ko
Jinwoo Kim
Haein Song
Jieung Kim
119
1
0
24 Aug 2024
A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs
Kiran Purohit
Anurag Parvathgari
Sourangshu Bhattacharya
VLM
65
0
0
22 Aug 2024
An Effective Information Theoretic Framework for Channel Pruning
Yihao Chen
Zefang Wang
86
3
0
14 Aug 2024
Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection
Zhonglin Chen
Anyu Geng
Jianan Jiang
Jiwu Lu
Di Wu
ObjD
47
0
0
14 Aug 2024
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression
Jonas Schmitt
Ruiping Liu
Junwei Zheng
Jiaming Zhang
Rainer Stiefelhagen
VLM
122
0
0
06 Aug 2024
An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design
Mingshuo Liu
Shiyi Luo
Kevin Han
Magno T. M. Silva
R. Demara
Ali H. Sayed
ObjD
74
13
0
02 Aug 2024
Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs
Seungmin Yu
Xiaodie Yi
Hayun Lee
Dongkun Shin
74
1
0
30 Jul 2024
Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors
Matt Gorbett
Hossein Shirazi
Indrakshi Ray
MQ
114
0
0
16 Jul 2024
Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates
Shirley Kokane
M. R. Uddin
Min Xu
110
1
0
05 Jul 2024
Isomorphic Pruning for Vision Models
Gongfan Fang
Xinyin Ma
Michael Bi Mi
Xinchao Wang
VLM
ViT
83
8
0
05 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Min-man Wu
Min Wu
Xiaoli Li
Weisi Lin
ViT
VLM
211
10
0
02 Jul 2024
Pruning One More Token is Enough: Leveraging Latency-Workload Non-Linearities for Vision Transformers on the Edge
Nick Eliopoulos
Purvish Jajal
James Davis
Gaowen Liu
George K. Thiravathukal
Yung-Hsiang Lu
68
1
0
01 Jul 2024
DIR-BHRNet: A Lightweight Network for Real-time Vision-based Multi-person Pose Estimation on Smartphones
Gongjin Lan
Yu Wu
Qi Hao
3DH
70
4
0
01 Jul 2024
RepAct: The Re-parameterizable Adaptive Activation Function
Xian Wu
Qingchuan Tao
Shuang Wang
70
0
0
28 Jun 2024
SCOPE: Stochastic Cartographic Occupancy Prediction Engine for Uncertainty-Aware Dynamic Navigation
Zhanteng Xie
P. Dames
116
1
0
28 Jun 2024
Finding Transformer Circuits with Edge Pruning
Adithya Bhaskar
Alexander Wettig
Dan Friedman
Danqi Chen
226
20
0
24 Jun 2024
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization
Sungbin Shin
Wonpyo Park
Jaeho Lee
Namhoon Lee
75
2
0
21 Jun 2024
A Comprehensive Study of Structural Pruning for Vision Models
Haoling Li
Haoling Li
Mengqi Xue
Gongfan Fang
Sheng Zhou
Zunlei Feng
Huiqiong Wang
Mingli Song
Lechao Cheng
VLM
69
0
0
18 Jun 2024
A Generic Layer Pruning Method for Signal Modulation Recognition Deep Learning Models
Yao Lu
Yutao Zhu
Yuqi Li
Dongwei Xu
Yun Lin
Qi Xuan
Xiaoniu Yang
61
8
0
12 Jun 2024
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
Xiang Meng
Kayhan Behdin
Haoyue Wang
Rahul Mazumder
76
6
0
12 Jun 2024
Decay Pruning Method: Smooth Pruning With a Self-Rectifying Procedure
Minghao Yang
Linlin Gao
Pengyuan Li
Wenbo Li
Yihong Dong
Zhiying Cui
67
1
0
06 Jun 2024
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
Jinyin Chen
Xiaoming Zhao
Haibin Zheng
Xiao Li
Sheng Xiang
Haifeng Guo
AAML
50
5
0
01 Jun 2024
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study
Pallavi Mitra
Gesina Schwalbe
Nadja Klein
AAML
74
1
0
31 May 2024
STAT: Shrinking Transformers After Training
Megan Flynn
Alexander Wang
Dean Edward Alvarez
Christopher De Sa
Anil Damle
79
2
0
29 May 2024
Subspace Node Pruning
Joshua Offergeld
Marcel van Gerven
Nasir Ahmad
81
0
0
26 May 2024
Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost
Yuan Gao
Weizhong Zhang
Wenhan Luo
Lin Ma
Jin-Gang Yu
Gui-Song Xia
Jiayi Ma
82
1
0
09 May 2024
COPAL: Continual Pruning in Large Language Generative Models
Srikanth Malla
Joon Hee Choi
Chiho Choi
VLM
CLL
81
2
0
02 May 2024
AB-Training: A Communication-Efficient Approach for Distributed Low-Rank Learning
D. Coquelin
Katherina Flügel
Marie Weiel
Nicholas Kiefer
Muhammed Öz
Charlotte Debus
Achim Streit
Markus Goetz
90
0
0
02 May 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
71
2
0
22 Apr 2024
Data-independent Module-aware Pruning for Hierarchical Vision Transformers
Yang He
Qiufeng Wang
ViT
83
5
0
21 Apr 2024
SNP: Structured Neuron-level Pruning to Preserve Attention Scores
Kyunghwan Shim
Jaewoong Yun
Shinkook Choi
42
1
0
18 Apr 2024
SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural Networks
Sreyes P. Venkatesh
Razvan Marinescu
Jason K. Eshraghian
MQ
112
5
0
15 Apr 2024
ONNXPruner: ONNX-Based General Model Pruning Adapter
Dongdong Ren
Wenbin Li
Tianyu Ding
Lei Wang
Qi Fan
Jing Huo
Hongbing Pan
Yang Gao
92
3
0
10 Apr 2024
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
130
66
0
08 Apr 2024
Towards Generalized Entropic Sparsification for Convolutional Neural Networks
Tin Barisin
I. Horenko
53
1
0
06 Apr 2024
Separate, Dynamic and Differentiable (SMART) Pruner for Block/Output Channel Pruning on Computer Vision Tasks
Guanhua Ding
Zexi Ye
Zhen Zhong
Gang Li
David Shao
61
0
0
29 Mar 2024
Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
84
7
0
28 Mar 2024
Tiny Machine Learning: Progress and Futures
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Song Han
90
60
0
28 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
155
106
0
26 Mar 2024
Block Selective Reprogramming for On-device Training of Vision Transformers
Sreetama Sarkar
Souvik Kundu
Kai Zheng
Peter A. Beerel
63
2
0
25 Mar 2024
Tensor network compressibility of convolutional models
Sukhbinder Singh
S. Jahromi
Roman Orus
51
3
0
21 Mar 2024
EffiPerception: an Efficient Framework for Various Perception Tasks
Xinhao Xiang
Simon Dräger
Jiawei Zhang
VLM
75
0
0
18 Mar 2024
Previous
1
2
3
4
5
...
20
21
22
Next