Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02626
Cited By
Learning both Weights and Connections for Efficient Neural Networks
8 June 2015
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning both Weights and Connections for Efficient Neural Networks"
50 / 1,191 papers shown
Title
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
45
2
0
14 Jun 2022
Energy Consumption Analysis of pruned Semantic Segmentation Networks on an Embedded GPU
Hugo Tessier
Vincent Gripon
Mathieu Léonardon
M. Arzel
David Bertrand
T. Hannagan
GNN
SSeg
3DPC
45
2
0
13 Jun 2022
Leveraging Structured Pruning of Convolutional Neural Networks
Hugo Tessier
Vincent Gripon
Mathieu Léonardon
M. Arzel
David Bertrand
T. Hannagan
CVBM
28
1
0
13 Jun 2022
PAC-Net: A Model Pruning Approach to Inductive Transfer Learning
Sanghoon Myung
I. Huh
Wonik Jang
Jae Myung Choe
Jisu Ryu
Daesin Kim
Kee-Eung Kim
C. Jeong
32
13
0
12 Jun 2022
Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm
Aidan Good
Jia-Huei Lin
Hannah Sieg
Mikey Ferguson
Xin Yu
Shandian Zhe
J. Wieczorek
Thiago Serra
42
11
0
07 Jun 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLM
MQ
78
455
0
04 Jun 2022
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Y. Fu
Haichuan Yang
Jiayi Yuan
Meng Li
Cheng Wan
Raghuraman Krishnamoorthi
Vikas Chandra
Yingyan Lin
41
19
0
02 Jun 2022
ViNNPruner: Visual Interactive Pruning for Deep Learning
U. Schlegel
Samuel Schiegg
Daniel A. Keim
VLM
34
2
0
31 May 2022
Gator: Customizable Channel Pruning of Neural Networks with Gating
E. Passov
E. David
N. Netanyahu
AAML
45
0
0
30 May 2022
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning
Chengfei Lv
Chaoyue Niu
Renjie Gu
Xiaotang Jiang
Zhaode Wang
...
Guohuan Xu
Fei Wu
Shaojie Tang
Fan Wu
Guihai Chen
MoE
LRM
18
38
0
30 May 2022
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
39
120
0
29 May 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
142
2,074
0
27 May 2022
Spartan: Differentiable Sparsity via Regularized Transportation
Kai Sheng Tai
Taipeng Tian
Ser-Nam Lim
39
11
0
27 May 2022
Pruning has a disparate impact on model accuracy
Cuong Tran
Ferdinando Fioretto
Jung-Eun Kim
Rakshit Naidu
50
40
0
26 May 2022
Compression-aware Training of Neural Networks using Frank-Wolfe
Max Zimmer
Christoph Spiegel
Sebastian Pokutta
36
9
0
24 May 2022
MetaNet: Automated Dynamic Selection of Scheduling Policies in Cloud Environments
Shreshth Tuli
G. Casale
N. Jennings
30
2
0
21 May 2022
HyBNN and FedHyBNN: (Federated) Hybrid Binary Neural Networks
Kinshuk Dua
FedML
MQ
29
0
0
19 May 2022
Perturbation of Deep Autoencoder Weights for Model Compression and Classification of Tabular Data
Manar D. Samad
Sakib Abrar
38
12
0
17 May 2022
Sharp asymptotics on the compression of two-layer neural networks
Mohammad Hossein Amani
Simone Bombari
Marco Mondelli
Rattana Pukdee
Stefano Rini
MLT
32
0
0
17 May 2022
Convolutional and Residual Networks Provably Contain Lottery Tickets
R. Burkholz
UQCV
MLT
49
13
0
04 May 2022
Most Activation Functions Can Win the Lottery Without Excessive Depth
R. Burkholz
MLT
84
18
0
04 May 2022
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
Yihan Wang
Zhekai Zhang
Han Cai
Wei-Ming Chen
Song Han
3DH
35
72
0
03 May 2022
Zebra: Memory Bandwidth Reduction for CNN Accelerators With Zero Block Regularization of Activation Maps
Hsu-Tung Shih
Tian-Sheuan Chang
30
3
0
02 May 2022
Sparse Compressed Spiking Neural Network Accelerator for Object Detection
Hong-Han Lien
Tian-Sheuan Chang
27
27
0
02 May 2022
A Closer Look at Branch Classifiers of Multi-exit Architectures
Shaohui Lin
Bo Ji
Rongrong Ji
Angela Yao
22
4
0
28 Apr 2022
Federated Progressive Sparsification (Purge, Merge, Tune)+
Dimitris Stripelis
Umang Gupta
Greg Ver Steeg
J. Ambite
FedML
28
9
0
26 Apr 2022
Attentive Fine-Grained Structured Sparsity for Image Restoration
Junghun Oh
Heewon Kim
Seungjun Nah
Chee Hong
Jonghyun Choi
Kyoung Mu Lee
32
18
0
26 Apr 2022
PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution
Zhijian Liu
Haotian Tang
Shengyu Zhao
Kevin Shao
Song Han
3DPC
26
40
0
25 Apr 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator
Miao Yu
Tingting Xiang
Venkata Pavan Kumar Miriyala
Trevor E. Carlson
28
1
0
20 Apr 2022
Receding Neuron Importances for Structured Pruning
Mihai Suteu
Yike Guo
34
1
0
13 Apr 2022
OMAD: On-device Mental Anomaly Detection for Substance and Non-Substance Users
Emon Dey
Nirmalya Roy
23
5
0
13 Apr 2022
Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi
Mohammed Hasanuzzaman
Chandra Thapa
Haithem Afli
T. Scully
50
23
0
07 Apr 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
40
11
0
06 Apr 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao
Beidi Chen
N. Sohoni
Arjun D Desai
Michael Poli
Jessica Grogan
Alexander Liu
Aniruddh Rao
Atri Rudra
Christopher Ré
46
88
0
01 Apr 2022
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
L. D. Prasad
Sreyan Ghosh
S. Umesh
45
12
0
31 Mar 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
34
3
0
30 Mar 2022
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models
Ziqing Yang
Yiming Cui
Zhigang Chen
SyDa
VLM
31
12
0
30 Mar 2022
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
Yu Tang
Chenyu Wang
Yufan Zhang
Yuliang Liu
Xingcheng Zhang
Linbo Qiao
Zhiquan Lai
Dongsheng Li
26
4
0
30 Mar 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
Paul Gavrikov
J. Keuper
AAML
29
31
0
29 Mar 2022
Playing Lottery Tickets in Style Transfer Models
Meihao Kong
Jing Huo
Wenbin Li
Jing Wu
Yu-kun Lai
Yang Gao
29
1
0
25 Mar 2022
Sparse Federated Learning with Hierarchical Personalized Models
Xiaofeng Liu
Qing Wang
Yunfeng Shao
Yinchuan Li
FedML
60
11
0
25 Mar 2022
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption
Sam Leroux
Pieter Simoens
Meelis Lootus
Kartik Thakore
Akshay Sharma
37
16
0
21 Mar 2022
Online Continual Learning for Embedded Devices
Tyler L. Hayes
Christopher Kanan
CLL
45
54
0
21 Mar 2022
Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
Zuzana Jelčicová
Marian Verhelst
51
5
0
20 Mar 2022
Learning Compressed Embeddings for On-Device Inference
Niketan Pansare
J. Katukuri
Aditya Arora
F. Cipollone
R. Shaik
Noyan Tokgozoglu
Chandru Venkataraman
44
14
0
18 Mar 2022
Improve Convolutional Neural Network Pruning by Maximizing Filter Variety
Nathan Hubens
M. Mancas
B. Gosselin
Marius Preda
T. Zaharia
21
2
0
11 Mar 2022
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning
Guyue Huang
Haoran Li
Minghai Qin
Fei Sun
Yufei Din
Yuan Xie
38
18
0
09 Mar 2022
The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks
Xin Yu
Thiago Serra
Srikumar Ramalingam
Shandian Zhe
49
48
0
09 Mar 2022
Pruning Graph Convolutional Networks to select meaningful graph frequencies for fMRI decoding
Yassine El Ouahidi
Hugo Tessier
G. Lioi
Nicolas Farrugia
Bastien Pasdeloup
Vincent Gripon
GNN
46
2
0
09 Mar 2022
Dual Lottery Ticket Hypothesis
Yue Bai
Haiquan Wang
Zhiqiang Tao
Kunpeng Li
Yun Fu
42
38
0
08 Mar 2022
Previous
1
2
3
...
7
8
9
...
22
23
24
Next