ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02626
  4. Cited By
Learning both Weights and Connections for Efficient Neural Networks

Learning both Weights and Connections for Efficient Neural Networks

8 June 2015
Song Han
Jeff Pool
J. Tran
W. Dally
    CVBM
ArXivPDFHTML

Papers citing "Learning both Weights and Connections for Efficient Neural Networks"

50 / 1,191 papers shown
Title
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
45
2
0
14 Jun 2022
Energy Consumption Analysis of pruned Semantic Segmentation Networks on
  an Embedded GPU
Energy Consumption Analysis of pruned Semantic Segmentation Networks on an Embedded GPU
Hugo Tessier
Vincent Gripon
Mathieu Léonardon
M. Arzel
David Bertrand
T. Hannagan
GNN
SSeg
3DPC
45
2
0
13 Jun 2022
Leveraging Structured Pruning of Convolutional Neural Networks
Leveraging Structured Pruning of Convolutional Neural Networks
Hugo Tessier
Vincent Gripon
Mathieu Léonardon
M. Arzel
David Bertrand
T. Hannagan
CVBM
28
1
0
13 Jun 2022
PAC-Net: A Model Pruning Approach to Inductive Transfer Learning
PAC-Net: A Model Pruning Approach to Inductive Transfer Learning
Sanghoon Myung
I. Huh
Wonik Jang
Jae Myung Choe
Jisu Ryu
Daesin Kim
Kee-Eung Kim
C. Jeong
32
13
0
12 Jun 2022
Recall Distortion in Neural Network Pruning and the Undecayed Pruning
  Algorithm
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm
Aidan Good
Jia-Huei Lin
Hannah Sieg
Mikey Ferguson
Xin Yu
Shandian Zhe
J. Wieczorek
Thiago Serra
42
11
0
07 Jun 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for
  Large-Scale Transformers
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLM
MQ
78
455
0
04 Jun 2022
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Y. Fu
Haichuan Yang
Jiayi Yuan
Meng Li
Cheng Wan
Raghuraman Krishnamoorthi
Vikas Chandra
Yingyan Lin
41
19
0
02 Jun 2022
ViNNPruner: Visual Interactive Pruning for Deep Learning
ViNNPruner: Visual Interactive Pruning for Deep Learning
U. Schlegel
Samuel Schiegg
Daniel A. Keim
VLM
34
2
0
31 May 2022
Gator: Customizable Channel Pruning of Neural Networks with Gating
Gator: Customizable Channel Pruning of Neural Networks with Gating
E. Passov
E. David
N. Netanyahu
AAML
45
0
0
30 May 2022
Walle: An End-to-End, General-Purpose, and Large-Scale Production System
  for Device-Cloud Collaborative Machine Learning
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning
Chengfei Lv
Chaoyue Niu
Renjie Gu
Xiaotang Jiang
Zhaode Wang
...
Guohuan Xu
Fei Wu
Shaojie Tang
Fan Wu
Guihai Chen
MoE
LRM
18
38
0
30 May 2022
Machine Learning for Microcontroller-Class Hardware: A Review
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
39
120
0
29 May 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with
  IO-Awareness
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
142
2,074
0
27 May 2022
Spartan: Differentiable Sparsity via Regularized Transportation
Spartan: Differentiable Sparsity via Regularized Transportation
Kai Sheng Tai
Taipeng Tian
Ser-Nam Lim
39
11
0
27 May 2022
Pruning has a disparate impact on model accuracy
Pruning has a disparate impact on model accuracy
Cuong Tran
Ferdinando Fioretto
Jung-Eun Kim
Rakshit Naidu
50
40
0
26 May 2022
Compression-aware Training of Neural Networks using Frank-Wolfe
Compression-aware Training of Neural Networks using Frank-Wolfe
Max Zimmer
Christoph Spiegel
Sebastian Pokutta
36
9
0
24 May 2022
MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud
  Environments
MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments
Shreshth Tuli
G. Casale
N. Jennings
30
2
0
21 May 2022
HyBNN and FedHyBNN: (Federated) Hybrid Binary Neural Networks
HyBNN and FedHyBNN: (Federated) Hybrid Binary Neural Networks
Kinshuk Dua
FedML
MQ
29
0
0
19 May 2022
Perturbation of Deep Autoencoder Weights for Model Compression and
  Classification of Tabular Data
Perturbation of Deep Autoencoder Weights for Model Compression and Classification of Tabular Data
Manar D. Samad
Sakib Abrar
38
12
0
17 May 2022
Sharp asymptotics on the compression of two-layer neural networks
Sharp asymptotics on the compression of two-layer neural networks
Mohammad Hossein Amani
Simone Bombari
Marco Mondelli
Rattana Pukdee
Stefano Rini
MLT
32
0
0
17 May 2022
Convolutional and Residual Networks Provably Contain Lottery Tickets
Convolutional and Residual Networks Provably Contain Lottery Tickets
R. Burkholz
UQCV
MLT
49
13
0
04 May 2022
Most Activation Functions Can Win the Lottery Without Excessive Depth
Most Activation Functions Can Win the Lottery Without Excessive Depth
R. Burkholz
MLT
84
18
0
04 May 2022
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
Yihan Wang
Zhekai Zhang
Han Cai
Wei-Ming Chen
Song Han
3DH
35
72
0
03 May 2022
Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block
  Regularization of Activation Maps
Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block Regularization of Activation Maps
Hsu-Tung Shih
Tian-Sheuan Chang
30
3
0
02 May 2022
Sparse Compressed Spiking Neural Network Accelerator for Object
  Detection
Sparse Compressed Spiking Neural Network Accelerator for Object Detection
Hong-Han Lien
Tian-Sheuan Chang
27
27
0
02 May 2022
A Closer Look at Branch Classifiers of Multi-exit Architectures
A Closer Look at Branch Classifiers of Multi-exit Architectures
Shaohui Lin
Bo Ji
Rongrong Ji
Angela Yao
22
4
0
28 Apr 2022
Federated Progressive Sparsification (Purge, Merge, Tune)+
Federated Progressive Sparsification (Purge, Merge, Tune)+
Dimitris Stripelis
Umang Gupta
Greg Ver Steeg
J. Ambite
FedML
28
9
0
26 Apr 2022
Attentive Fine-Grained Structured Sparsity for Image Restoration
Attentive Fine-Grained Structured Sparsity for Image Restoration
Junghun Oh
Heewon Kim
Seungjun Nah
Chee Hong
Jonghyun Choi
Kyoung Mu Lee
32
18
0
26 Apr 2022
PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution
PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution
Zhijian Liu
Haotian Tang
Shengyu Zhao
Kevin Shao
Song Han
3DPC
26
40
0
25 Apr 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network
  Accelerator
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator
Miao Yu
Tingting Xiang
Venkata Pavan Kumar Miriyala
Trevor E. Carlson
28
1
0
20 Apr 2022
Receding Neuron Importances for Structured Pruning
Receding Neuron Importances for Structured Pruning
Mihai Suteu
Yike Guo
34
1
0
13 Apr 2022
OMAD: On-device Mental Anomaly Detection for Substance and Non-Substance
  Users
OMAD: On-device Mental Anomaly Detection for Substance and Non-Substance Users
Emon Dey
Nirmalya Roy
23
5
0
13 Apr 2022
Enabling All In-Edge Deep Learning: A Literature Review
Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi
Mohammed Hasanuzzaman
Chandra Thapa
Haithem Afli
T. Scully
50
23
0
07 Apr 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and
  Structured Sparsification
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
40
11
0
06 Apr 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate
  Training
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao
Beidi Chen
N. Sohoni
Arjun D Desai
Michael Poli
Jessica Grogan
Alexander Liu
Aniruddh Rao
Atri Rudra
Christopher Ré
46
88
0
01 Apr 2022
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech
  Representations
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
L. D. Prasad
Sreyan Ghosh
S. Umesh
45
12
0
31 Mar 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
34
3
0
30 Mar 2022
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
Ziqing Yang
Yiming Cui
Zhigang Chen
SyDa
VLM
31
12
0
30 Mar 2022
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
Yu Tang
Chenyu Wang
Yufan Zhang
Yuliang Liu
Xingcheng Zhang
Linbo Qiao
Zhiquan Lai
Dongsheng Li
26
4
0
30 Mar 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional
  Filters
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
Paul Gavrikov
J. Keuper
AAML
29
31
0
29 Mar 2022
Playing Lottery Tickets in Style Transfer Models
Playing Lottery Tickets in Style Transfer Models
Meihao Kong
Jing Huo
Wenbin Li
Jing Wu
Yu-kun Lai
Yang Gao
29
1
0
25 Mar 2022
Sparse Federated Learning with Hierarchical Personalized Models
Sparse Federated Learning with Hierarchical Personalized Models
Xiaofeng Liu
Qing Wang
Yunfeng Shao
Yinchuan Li
FedML
60
11
0
25 Mar 2022
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption
Sam Leroux
Pieter Simoens
Meelis Lootus
Kartik Thakore
Akshay Sharma
37
16
0
21 Mar 2022
Online Continual Learning for Embedded Devices
Online Continual Learning for Embedded Devices
Tyler L. Hayes
Christopher Kanan
CLL
45
54
0
21 Mar 2022
Delta Keyword Transformer: Bringing Transformers to the Edge through
  Dynamically Pruned Multi-Head Self-Attention
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
Zuzana Jelčicová
Marian Verhelst
51
5
0
20 Mar 2022
Learning Compressed Embeddings for On-Device Inference
Learning Compressed Embeddings for On-Device Inference
Niketan Pansare
J. Katukuri
Aditya Arora
F. Cipollone
R. Shaik
Noyan Tokgozoglu
Chandru Venkataraman
44
14
0
18 Mar 2022
Improve Convolutional Neural Network Pruning by Maximizing Filter
  Variety
Improve Convolutional Neural Network Pruning by Maximizing Filter Variety
Nathan Hubens
M. Mancas
B. Gosselin
Marius Preda
T. Zaharia
21
2
0
11 Mar 2022
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core
  Aware Weight Pruning
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning
Guyue Huang
Haoran Li
Minghai Qin
Fei Sun
Yufei Din
Yuan Xie
38
18
0
09 Mar 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another
  in Neural Networks
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
Xin Yu
Thiago Serra
Srikumar Ramalingam
Shandian Zhe
49
48
0
09 Mar 2022
Pruning Graph Convolutional Networks to select meaningful graph
  frequencies for fMRI decoding
Pruning Graph Convolutional Networks to select meaningful graph frequencies for fMRI decoding
Yassine El Ouahidi
Hugo Tessier
G. Lioi
Nicolas Farrugia
Bastien Pasdeloup
Vincent Gripon
GNN
46
2
0
09 Mar 2022
Dual Lottery Ticket Hypothesis
Dual Lottery Ticket Hypothesis
Yue Bai
Haiquan Wang
Zhiqiang Tao
Kunpeng Li
Yun Fu
42
38
0
08 Mar 2022
Previous
123...789...222324
Next