Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.02626
Cited By
Learning both Weights and Connections for Efficient Neural Networks
8 June 2015
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning both Weights and Connections for Efficient Neural Networks"
50 / 1,144 papers shown
Title
Layer-Adaptive State Pruning for Deep State Space Models
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
32
0
0
05 Nov 2024
Efficient Sparse Training with Structured Dropout
Andy Lo
BDL
35
0
0
02 Nov 2024
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
47
0
0
01 Nov 2024
Mutual Information Preserving Neural Network Pruning
Charles Westphal
Stephen Hailes
Mirco Musolesi
59
1
0
31 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
84
5
0
28 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
246
0
0
22 Oct 2024
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
Sjoerd Groot
Qinyu Chen
Jan C. van Gemert
Chang Gao
Mamba
233
0
0
14 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
50
4
0
12 Oct 2024
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Xiang Hao
Chenxiang Ma
Qu Yang
Jibin Wu
Kay Chen Tan
33
0
0
07 Oct 2024
SOI: Scaling Down Computational Complexity by Estimating Partial States of the Model
Grzegorz Stefański
P. Daniluk
Artur Szumaczuk
Jakub Tkaczuk
36
0
0
04 Oct 2024
A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions
Shailja Gupta
Rajesh Ranjan
Surya Narayan Singh
3DV
VLM
AILaw
48
21
0
03 Oct 2024
Investigating the Effect of Network Pruning on Performance and Interpretability
Jonathan von Rad
Florian Seuffert
36
1
0
29 Sep 2024
Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization
Vladimír Boža
Vladimír Macko
45
1
0
27 Sep 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
42
2
0
23 Sep 2024
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds
Zhixing Hou
Maoxu Gao
Hang Yu
Mengyu Yang
Chio-in Ieong
50
1
0
17 Sep 2024
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
Haolin Chen
Philip N. Garner
61
1
0
16 Sep 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Yuezhou Hu
Jun-Jie Zhu
Jianfei Chen
56
0
0
13 Sep 2024
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
51
1
0
12 Sep 2024
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models
Yupeng Su
Ziyi Guan
Xiaoqun Liu
Tianlai Jin
Dongkuan Wu
G. Chesi
Ngai Wong
Hao Yu
47
1
0
20 Aug 2024
Tensor tree learns hidden relational structures in data to construct generative models
Kenji Harada
Tsuyoshi Okubo
Naoki Kawashima
44
3
0
20 Aug 2024
Mask in the Mirror: Implicit Sparsification
Tom Jacobs
R. Burkholz
52
3
0
19 Aug 2024
MoDeGPT: Modular Decomposition for Large Language Model Compression
Chi-Heng Lin
Shangqian Gao
James Seale Smith
Abhishek Patel
Shikhar Tuli
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
71
9
0
19 Aug 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
46
0
0
07 Aug 2024
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
32
0
0
29 Jul 2024
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang
Ruihao Gong
Xianglong Liu
Jing Liu
Yuhang Li
Jiwen Lu
Dacheng Tao
DiffM
MQ
49
0
0
28 Jul 2024
Gaussian Eigen Models for Human Heads
Wojciech Zielonka
Timo Bolkart
Thabo Beeler
Justus Thies
3DGS
55
5
0
05 Jul 2024
Learning Interpretable Differentiable Logic Networks
Chang Yue
N. Jha
NAI
AI4CE
29
0
0
04 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
62
3
0
03 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Min-man Wu
Min Wu
Xiaoli Li
Weisi Lin
ViT
VLM
56
7
0
02 Jul 2024
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems
Hung Vinh Tran
Tong Chen
Quoc Viet Hung Nguyen
Zi-Rui Huang
Lizhen Cui
Hongzhi Yin
50
1
0
25 Jun 2024
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
Yash Akhauri
Ahmed F. AbouElhamayed
Jordan Dotzel
Zhiru Zhang
Alexander M Rush
Safeen Huda
Mohamed S. Abdelfattah
23
2
0
24 Jun 2024
An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers
Ashim Gupta
Sina Mahdipour Saravani
P. Sadayappan
Vivek Srikumar
37
2
0
17 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
61
4
0
17 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
56
0
0
10 Jun 2024
Optimal Recurrent Network Topologies for Dynamical Systems Reconstruction
Christoph Jürgen Hemmer
Manuel Brenner
Florian Hess
Daniel Durstewitz
43
4
0
07 Jun 2024
GENIE: Watermarking Graph Neural Networks for Link Prediction
Venkata Sai Pranav Bachina
Ankit Gangwal
Aaryan Ajay Sharma
Charu Sharma
53
1
0
07 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
51
6
0
31 May 2024
ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers
Quan Liu
Brandon T. Swartz
Ivan Kravchenko
Jason G Valentine
Yuankai Huo
38
2
0
27 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
55
0
0
13 May 2024
Pruning as a Domain-specific LLM Extractor
Nan Zhang
Yanchi Liu
Xujiang Zhao
Wei Cheng
Runxue Bao
Rui Zhang
Prasenjit Mitra
Haifeng Chen
28
9
0
10 May 2024
Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes
Ruihao Gong
Yang Yong
Zining Wang
Jinyang Guo
Xiuying Wei
Yuqing Ma
Xianglong Liu
59
5
0
09 May 2024
Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
Shuo Shao
Yiming Li
Hongwei Yao
Yiling He
Zhan Qin
Kui Ren
37
14
0
08 May 2024
Iterative Filter Pruning for Concatenation-based CNN Architectures
Svetlana Pavlitska
Oliver Bagge
Federico Nicolás Peccia
Toghrul Mammadov
J. Marius Zöllner
VLM
3DPC
48
2
0
04 May 2024
The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
Victor Quétu
Zhu Liao
Enzo Tartaglione
54
4
0
27 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
45
1
0
24 Apr 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
43
2
0
22 Apr 2024
SparseDM: Toward Sparse Efficient Diffusion Models
Kafeng Wang
Jianfei Chen
He Li
Zhenpeng Mi
Jun-Jie Zhu
DiffM
70
8
0
16 Apr 2024
ONNXPruner: ONNX-Based General Model Pruning Adapter
Dongdong Ren
Wenbin Li
Tianyu Ding
Lei Wang
Qi Fan
Jing Huo
Hongbing Pan
Yang Gao
51
3
0
10 Apr 2024
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Hongchuan Zeng
Hongshen Xu
Lu Chen
Kai Yu
59
5
0
06 Apr 2024
Are Compressed Language Models Less Subgroup Robust?
Leonidas Gee
Andrea Zugarini
Novi Quadrianto
35
1
0
26 Mar 2024
Previous
1
2
3
4
5
...
21
22
23
Next