ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06168
  4. Cited By
Channel Pruning for Accelerating Very Deep Neural Networks

Channel Pruning for Accelerating Very Deep Neural Networks

19 July 2017
Yihui He
Xiangyu Zhang
Jian Sun
ArXivPDFHTML

Papers citing "Channel Pruning for Accelerating Very Deep Neural Networks"

50 / 1,090 papers shown
Title
OSSCAR: One-Shot Structured Pruning in Vision and Language Models with
  Combinatorial Optimization
OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization
Xiang Meng
Shibal Ibrahim
Kayhan Behdin
Hussein Hazimeh
Natalia Ponomareva
Rahul Mazumder
VLM
49
5
0
02 Mar 2024
Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space
Towards Explaining Deep Neural Network Compression Through a Probabilistic Latent Space
Mahsa Mozafari-Nia
Salimeh Yasaei Sekeh
23
0
0
29 Feb 2024
REPrune: Channel Pruning via Kernel Representative Selection
REPrune: Channel Pruning via Kernel Representative Selection
Mincheol Park
Dongjin Kim
Cheonjun Park
Yuna Park
Gyeong Eun Gong
Won Woo Ro
Suhyun Kim
VLM
49
1
0
27 Feb 2024
SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field
SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field
Zetian Song
Wenhong Duan
Yuhuai Zhang
Shiqi Wang
Siwei Ma
Wen Gao
25
2
0
26 Feb 2024
GPTVQ: The Blessing of Dimensionality for LLM Quantization
GPTVQ: The Blessing of Dimensionality for LLM Quantization
M. V. Baalen
Andrey Kuzmin
Markus Nagel
Peter Couperus
Cédric Bastoul
E. Mahurin
Tijmen Blankevoort
Paul N. Whatmough
MQ
36
28
0
23 Feb 2024
Not All Experts are Equal: Efficient Expert Pruning and Skipping for
  Mixture-of-Experts Large Language Models
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
Xudong Lu
Qi Liu
Yuhui Xu
Aojun Zhou
Siyuan Huang
Bo-Wen Zhang
Junchi Yan
Hongsheng Li
MoE
32
26
0
22 Feb 2024
FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local
  Parameter Sharing
FedLPS: Heterogeneous Federated Learning for Multiple Tasks with Local Parameter Sharing
Yongzhe Jia
Xuyun Zhang
Amin Beheshti
Wanchun Dou
FedML
27
5
0
13 Feb 2024
Towards Meta-Pruning via Optimal Transport
Towards Meta-Pruning via Optimal Transport
Alexander Theus
Olin Geimer
Friedrich Wicke
Thomas Hofmann
Sotiris Anagnostidis
Sidak Pal Singh
MoMe
24
3
0
12 Feb 2024
Compressing Deep Reinforcement Learning Networks with a Dynamic
  Structured Pruning Method for Autonomous Driving
Compressing Deep Reinforcement Learning Networks with a Dynamic Structured Pruning Method for Autonomous Driving
Wensheng Su
Zhenni Li
Minrui Xu
Jiawen Kang
Dusit Niyato
Shengli Xie
20
8
0
07 Feb 2024
A Survey on Transformer Compression
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
41
28
0
05 Feb 2024
Lightweight Pixel Difference Networks for Efficient Visual
  Representation Learning
Lightweight Pixel Difference Networks for Efficient Visual Representation Learning
Z. Su
Jiehua Zhang
Longguang Wang
Hua Zhang
Zhen Liu
M. Pietikäinen
Li Liu
38
20
0
01 Feb 2024
One-Step Forward and Backtrack: Overcoming Zig-Zagging in Loss-Aware
  Quantization Training
One-Step Forward and Backtrack: Overcoming Zig-Zagging in Loss-Aware Quantization Training
Lianbo Ma
Yuee Zhou
Jianlun Ma
Guo-Ding Yu
Qing Li
MQ
25
1
0
30 Jan 2024
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Saleh Ashkboos
Maximilian L. Croci
Marcelo Gennari do Nascimento
Torsten Hoefler
James Hensman
VLM
132
145
0
26 Jan 2024
DTMM: Deploying TinyML Models on Extremely Weak IoT Devices with Pruning
DTMM: Deploying TinyML Models on Extremely Weak IoT Devices with Pruning
Lixiang Han
Zhen Xiao
Zhenjiang Li
41
5
0
17 Jan 2024
Harnessing Orthogonality to Train Low-Rank Neural Networks
Harnessing Orthogonality to Train Low-Rank Neural Networks
D. Coquelin
Katharina Flügel
Marie Weiel
Nicholas Kiefer
Charlotte Debus
Achim Streit
Markus Goetz
26
1
0
16 Jan 2024
Boosting Defect Detection in Manufacturing using Tensor Convolutional
  Neural Networks
Boosting Defect Detection in Manufacturing using Tensor Convolutional Neural Networks
Pablo Martin-Ramiro
Unai Sainz de la Maza
Sukhbinder Singh
Roman Orus
Samuel Mugel
UQCV
38
2
0
29 Dec 2023
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision
  Quantization
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization
K. Balaskas
Andreas Karatzas
Christos Sad
K. Siozios
Iraklis Anagnostopoulos
Georgios Zervakis
Jörg Henkel
MQ
41
10
0
23 Dec 2023
Sparsity-Guided Holistic Explanation for LLMs with Interpretable
  Inference-Time Intervention
Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention
Zhen Tan
Tianlong Chen
Zhenyu Zhang
Huan Liu
52
14
0
22 Dec 2023
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic
  Tensor Selection
ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection
Kai Huang
Boyuan Yang
Wei Gao
32
18
0
21 Dec 2023
ARBiBench: Benchmarking Adversarial Robustness of Binarized Neural
  Networks
ARBiBench: Benchmarking Adversarial Robustness of Binarized Neural Networks
Peng Zhao
Jiehua Zhang
Bowen Peng
Longguang Wang
Yingmei Wei
Yu Liu
Li Liu
AAML
32
0
0
21 Dec 2023
Model-Based Control with Sparse Neural Dynamics
Model-Based Control with Sparse Neural Dynamics
Ziang Liu
Genggeng Zhou
Jeff He
Tobia Marcucci
Fei-Fei Li
Jiajun Wu
Yunzhu Li
AI4CE
35
17
0
20 Dec 2023
Optimizing Convolutional Neural Network Architecture
Optimizing Convolutional Neural Network Architecture
Luis Balderas
Miguel Lastra
José M. Benítez
CVBM
23
4
0
17 Dec 2023
OTOv3: Automatic Architecture-Agnostic Neural Network Training and
  Compression from Structured Pruning to Erasing Operators
OTOv3: Automatic Architecture-Agnostic Neural Network Training and Compression from Structured Pruning to Erasing Operators
Tianyi Chen
Tianyu Ding
Zhihui Zhu
Zeyu Chen
HsiangTao Wu
Ilya Zharkov
Luming Liang
21
3
0
15 Dec 2023
Weight subcloning: direct initialization of transformers using larger
  pretrained ones
Weight subcloning: direct initialization of transformers using larger pretrained ones
Mohammad Samragh
Mehrdad Farajtabar
Sachin Mehta
Raviteja Vemulapalli
Fartash Faghri
Devang Naik
Oncel Tuzel
Mohammad Rastegari
21
26
0
14 Dec 2023
CBQ: Cross-Block Quantization for Large Language Models
CBQ: Cross-Block Quantization for Large Language Models
Xin Ding
Xiaoyu Liu
Zhijun Tu
Yun-feng Zhang
Wei Li
...
Hanting Chen
Yehui Tang
Zhiwei Xiong
Baoqun Yin
Yunhe Wang
MQ
38
13
0
13 Dec 2023
MaxQ: Multi-Axis Query for N:M Sparsity Network
MaxQ: Multi-Axis Query for N:M Sparsity Network
Jingyang Xiang
Siqi Li
Junhao Chen
Zhuangzhi Chen
Tianxin Huang
Linpeng Peng
Yong-Jin Liu
18
0
0
12 Dec 2023
Ternary Spike: Learning Ternary Spikes for Spiking Neural Networks
Ternary Spike: Learning Ternary Spikes for Spiking Neural Networks
Yu-Zhu Guo
Y. Chen
Xiaode Liu
Weihang Peng
Yuhan Zhang
Xuhui Huang
Zhe Ma
33
28
0
11 Dec 2023
SlimSAM: 0.1% Data Makes Segment Anything Slim
SlimSAM: 0.1% Data Makes Segment Anything Slim
Zigeng Chen
Gongfan Fang
Xinyin Ma
Xinchao Wang
38
13
0
08 Dec 2023
Accelerating Convolutional Neural Network Pruning via Spatial Aura
  Entropy
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy
Bogdan Musat
Razvan Andonie
26
0
0
08 Dec 2023
A Masked Pruning Approach for Dimensionality Reduction in
  Communication-Efficient Federated Learning Systems
A Masked Pruning Approach for Dimensionality Reduction in Communication-Efficient Federated Learning Systems
Tamir L. S. Gez
Kobi Cohen
32
2
0
06 Dec 2023
Towards Sobolev Pruning
Towards Sobolev Pruning
Neil Kichler
Sher Afghan
U. Naumann
23
0
0
06 Dec 2023
Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger
Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger
Yiming Li
Mingyan Zhu
Junfeng Guo
Tao Wei
Shu-Tao Xia
Zhan Qin
AAML
71
1
0
03 Dec 2023
Towards Higher Ranks via Adversarial Weight Pruning
Towards Higher Ranks via Adversarial Weight Pruning
Yuchuan Tian
Hanting Chen
Tianyu Guo
Chao Xu
Yunhe Wang
35
2
0
29 Nov 2023
BinaryHPE: 3D Human Pose and Shape Estimation via Binarization
BinaryHPE: 3D Human Pose and Shape Estimation via Binarization
Zhiteng Li
Yulun Zhang
Jing Lin
Haotong Qin
Jinjin Gu
Xin Yuan
Linghe Kong
Xiaokang Yang
3DH
42
1
0
24 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive
  Review
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
34
8
0
20 Nov 2023
Pursing the Sparse Limitation of Spiking Deep Learning Structures
Pursing the Sparse Limitation of Spiking Deep Learning Structures
Hao-Ran Cheng
Jiahang Cao
Erjia Xiao
Mengshu Sun
Le Yang
Jize Zhang
Xue Lin
B. Kailkhura
Kaidi Xu
Renjing Xu
16
1
0
18 Nov 2023
Adaptive Compression-Aware Split Learning and Inference for Enhanced
  Network Efficiency
Adaptive Compression-Aware Split Learning and Inference for Enhanced Network Efficiency
Akrit Mudvari
Antero Vainio
Iason Ofeidis
Sasu Tarkoma
Leandros Tassiulas
29
3
0
09 Nov 2023
Mini but Mighty: Finetuning ViTs with Mini Adapters
Mini but Mighty: Finetuning ViTs with Mini Adapters
Imad Eddine Marouf
Enzo Tartaglione
Stéphane Lathuilière
36
5
0
07 Nov 2023
OrthoNets: Orthogonal Channel Attention Networks
OrthoNets: Orthogonal Channel Attention Networks
Hadi Salman
Caleb Parks
Matthew Swan
John Gauch
18
9
0
06 Nov 2023
Efficient Model-Based Deep Learning via Network Pruning and Fine-Tuning
Efficient Model-Based Deep Learning via Network Pruning and Fine-Tuning
Chicago Y. Park
Weijie Gan
Zihao Zou
Yuyang Hu
Zhixin Sun
Ulugbek S. Kamilov
22
0
0
03 Nov 2023
USDC: Unified Static and Dynamic Compression for Visual Transformer
USDC: Unified Static and Dynamic Compression for Visual Transformer
Huan Yuan
Chao Liao
Jianchao Tan
Peng Yao
Jiyuan Jia
Bin Chen
Chengru Song
Di Zhang
ViT
25
0
0
17 Oct 2023
Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse
  Multi-DNN Workloads
Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi-DNN Workloads
Hongxiang Fan
Stylianos I. Venieris
Alexandros Kouris
Nicholas D. Lane
21
7
0
17 Oct 2023
The Road to On-board Change Detection: A Lightweight Patch-Level Change
  Detection Network via Exploring the Potential of Pruning and Pooling
The Road to On-board Change Detection: A Lightweight Patch-Level Change Detection Network via Exploring the Potential of Pruning and Pooling
Lihui Xue
Zhihao Wang
Xueqian Wang
Gang Li
41
1
0
16 Oct 2023
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large
  Language Models
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
Jing Liu
Ruihao Gong
Xiuying Wei
Zhiwei Dong
Jianfei Cai
Bohan Zhuang
MQ
35
51
0
12 Oct 2023
Filter Pruning For CNN With Enhanced Linear Representation Redundancy
Filter Pruning For CNN With Enhanced Linear Representation Redundancy
Bojue Wang
Chun-Xia Ma
Bin Liu
Nianbo Liu
Jinqi Zhu
37
1
0
10 Oct 2023
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading
  Acceleration
SUBP: Soft Uniform Block Pruning for 1xN Sparse CNNs Multithreading Acceleration
Jingyang Xiang
Siqi Li
Jun Chen
Shipeng Bai
Yukai Ma
Guang Dai
Yong-Jin Liu
24
1
0
10 Oct 2023
Extreme sparsification of physics-augmented neural networks for
  interpretable model discovery in mechanics
Extreme sparsification of physics-augmented neural networks for interpretable model discovery in mechanics
J. Fuhg
Reese E. Jones
N. Bouklas
AI4CE
34
23
0
05 Oct 2023
ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
Yangyang Guo
Haoyu Zhang
Yongkang Wong
Liqiang Nie
Mohan Kankanhalli
VLM
30
3
0
28 Sep 2023
Enabling Resource-efficient AIoT System with Cross-level Optimization: A
  survey
Enabling Resource-efficient AIoT System with Cross-level Optimization: A survey
Sicong Liu
Bin Guo
Cheng Fang
Ziqi Wang
Shiyan Luo
Zimu Zhou
Zhiwen Yu
AI4CE
37
22
0
27 Sep 2023
A Differentiable Framework for End-to-End Learning of Hybrid Structured
  Compression
A Differentiable Framework for End-to-End Learning of Hybrid Structured Compression
Moonjung Eo
Suhyun Kang
Wonjong Rhee
25
1
0
21 Sep 2023
Previous
123456...202122
Next