ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.02626
  4. Cited By
Learning both Weights and Connections for Efficient Neural Networks

Learning both Weights and Connections for Efficient Neural Networks

8 June 2015
Song Han
Jeff Pool
J. Tran
W. Dally
    CVBM
ArXivPDFHTML

Papers citing "Learning both Weights and Connections for Efficient Neural Networks"

50 / 1,144 papers shown
Title
Layer-Adaptive State Pruning for Deep State Space Models
Layer-Adaptive State Pruning for Deep State Space Models
Minseon Gwak
Seongrok Moon
Joohwan Ko
PooGyeon Park
32
0
0
05 Nov 2024
Efficient Sparse Training with Structured Dropout
Efficient Sparse Training with Structured Dropout
Andy Lo
BDL
35
0
0
02 Nov 2024
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
47
0
0
01 Nov 2024
Mutual Information Preserving Neural Network Pruning
Mutual Information Preserving Neural Network Pruning
Charles Westphal
Stephen Hailes
Mirco Musolesi
59
1
0
31 Oct 2024
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA
Sangmin Bae
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Seungyeon Kim
Tal Schuster
KELM
84
5
0
28 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
246
0
0
22 Oct 2024
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning
Sjoerd Groot
Qinyu Chen
Jan C. van Gemert
Chang Gao
Mamba
233
0
0
14 Oct 2024
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Wenlong Deng
Yize Zhao
V. Vakilian
Minghui Chen
Xiaoxiao Li
Christos Thrampoulidis
50
4
0
12 Oct 2024
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with
  Spiking-FullSubNet
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Xiang Hao
Chenxiang Ma
Qu Yang
Jibin Wu
Kay Chen Tan
33
0
0
07 Oct 2024
SOI: Scaling Down Computational Complexity by Estimating Partial States
  of the Model
SOI: Scaling Down Computational Complexity by Estimating Partial States of the Model
Grzegorz Stefański
P. Daniluk
Artur Szumaczuk
Jakub Tkaczuk
36
0
0
04 Oct 2024
A Comprehensive Survey of Retrieval-Augmented Generation (RAG):
  Evolution, Current Landscape and Future Directions
A Comprehensive Survey of Retrieval-Augmented Generation (RAG): Evolution, Current Landscape and Future Directions
Shailja Gupta
Rajesh Ranjan
Surya Narayan Singh
3DV
VLM
AILaw
48
21
0
03 Oct 2024
Investigating the Effect of Network Pruning on Performance and Interpretability
Investigating the Effect of Network Pruning on Performance and Interpretability
Jonathan von Rad
Florian Seuffert
36
1
0
29 Sep 2024
Two Sparse Matrices are Better than One: Sparsifying Neural Networks
  with Double Sparse Factorization
Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization
Vladimír Boža
Vladimír Macko
45
1
0
27 Sep 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and
  Sub-Network Selection
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
Alireza Ganjdanesh
Yan Kang
Yuchen Liu
Richard Y. Zhang
Zhe Lin
Heng Huang
DiffM
42
2
0
23 Sep 2024
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable
  Channel-Wise Membrane Thresholds
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds
Zhixing Hou
Maoxu Gao
Hang Yu
Mengyu Yang
Chio-in Ieong
50
1
0
17 Sep 2024
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
Haolin Chen
Philip N. Garner
61
1
0
16 Sep 2024
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
Yuezhou Hu
Jun-Jie Zhu
Jianfei Chen
56
0
0
13 Sep 2024
Structured Pruning for Efficient Visual Place Recognition
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
51
1
0
12 Sep 2024
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for
  Large Language Models
LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models
Yupeng Su
Ziyi Guan
Xiaoqun Liu
Tianlai Jin
Dongkuan Wu
G. Chesi
Ngai Wong
Hao Yu
47
1
0
20 Aug 2024
Tensor tree learns hidden relational structures in data to construct generative models
Tensor tree learns hidden relational structures in data to construct generative models
Kenji Harada
Tsuyoshi Okubo
Naoki Kawashima
44
3
0
20 Aug 2024
Mask in the Mirror: Implicit Sparsification
Mask in the Mirror: Implicit Sparsification
Tom Jacobs
R. Burkholz
52
3
0
19 Aug 2024
MoDeGPT: Modular Decomposition for Large Language Model Compression
MoDeGPT: Modular Decomposition for Large Language Model Compression
Chi-Heng Lin
Shangqian Gao
James Seale Smith
Abhishek Patel
Shikhar Tuli
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
71
9
0
19 Aug 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
46
0
0
07 Aug 2024
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile
  Devices
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
32
0
0
29 Jul 2024
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Temporal Feature Matters: A Framework for Diffusion Model Quantization
Yushi Huang
Ruihao Gong
Xianglong Liu
Jing Liu
Yuhang Li
Jiwen Lu
Dacheng Tao
DiffM
MQ
49
0
0
28 Jul 2024
Gaussian Eigen Models for Human Heads
Gaussian Eigen Models for Human Heads
Wojciech Zielonka
Timo Bolkart
Thabo Beeler
Justus Thies
3DGS
55
5
0
05 Jul 2024
Learning Interpretable Differentiable Logic Networks
Learning Interpretable Differentiable Logic Networks
Chang Yue
N. Jha
NAI
AI4CE
29
0
0
04 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
62
3
0
03 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Min-man Wu
Min Wu
Xiaoli Li
Weisi Lin
ViT
VLM
56
7
0
02 Jul 2024
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems
A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems
Hung Vinh Tran
Tong Chen
Quoc Viet Hung Nguyen
Zi-Rui Huang
Lizhen Cui
Hongzhi Yin
50
1
0
25 Jun 2024
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models
Yash Akhauri
Ahmed F. AbouElhamayed
Jordan Dotzel
Zhiru Zhang
Alexander M Rush
Safeen Huda
Mohamed S. Abdelfattah
23
2
0
24 Jun 2024
An Empirical Investigation of Matrix Factorization Methods for
  Pre-trained Transformers
An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers
Ashim Gupta
Sina Mahdipour Saravani
P. Sadayappan
Vivek Srikumar
37
2
0
17 Jun 2024
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models
Alireza Ganjdanesh
Reza Shirkavand
Shangqian Gao
Heng Huang
DiffM
VLM
61
4
0
17 Jun 2024
Evaluating Zero-Shot Long-Context LLM Compression
Evaluating Zero-Shot Long-Context LLM Compression
Chenyu Wang
Yihan Wang
Kai Li
56
0
0
10 Jun 2024
Optimal Recurrent Network Topologies for Dynamical Systems
  Reconstruction
Optimal Recurrent Network Topologies for Dynamical Systems Reconstruction
Christoph Jürgen Hemmer
Manuel Brenner
Florian Hess
Daniel Durstewitz
43
4
0
07 Jun 2024
GENIE: Watermarking Graph Neural Networks for Link Prediction
GENIE: Watermarking Graph Neural Networks for Link Prediction
Venkata Sai Pranav Bachina
Ankit Gangwal
Aaryan Ajay Sharma
Charu Sharma
53
1
0
07 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
51
6
0
31 May 2024
ExtremeMETA: High-speed Lightweight Image Segmentation Model by
  Remodeling Multi-channel Metamaterial Imagers
ExtremeMETA: High-speed Lightweight Image Segmentation Model by Remodeling Multi-channel Metamaterial Imagers
Quan Liu
Brandon T. Swartz
Ivan Kravchenko
Jason G Valentine
Yuankai Huo
38
2
0
27 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
55
0
0
13 May 2024
Pruning as a Domain-specific LLM Extractor
Pruning as a Domain-specific LLM Extractor
Nan Zhang
Yanchi Liu
Xujiang Zhao
Wei Cheng
Runxue Bao
Rui Zhang
Prasenjit Mitra
Haifeng Chen
28
9
0
10 May 2024
Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity
  Allocation with Global Constraint in Minutes
Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes
Ruihao Gong
Yang Yong
Zining Wang
Jinyang Guo
Xiuying Wei
Yuqing Ma
Xianglong Liu
59
5
0
09 May 2024
Explanation as a Watermark: Towards Harmless and Multi-bit Model
  Ownership Verification via Watermarking Feature Attribution
Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
Shuo Shao
Yiming Li
Hongwei Yao
Yiling He
Zhan Qin
Kui Ren
37
14
0
08 May 2024
Iterative Filter Pruning for Concatenation-based CNN Architectures
Iterative Filter Pruning for Concatenation-based CNN Architectures
Svetlana Pavlitska
Oliver Bagge
Federico Nicolás Peccia
Toghrul Mammadov
J. Marius Zöllner
VLM
3DPC
48
2
0
04 May 2024
The Simpler The Better: An Entropy-Based Importance Metric To Reduce
  Neural Networks' Depth
The Simpler The Better: An Entropy-Based Importance Metric To Reduce Neural Networks' Depth
Victor Quétu
Zhu Liao
Enzo Tartaglione
54
4
0
27 Apr 2024
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
Lang Qin
Ziming Wang
Runhao Jiang
Rui Yan
Huajin Tang
45
1
0
24 Apr 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at
  Initialization
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
43
2
0
22 Apr 2024
SparseDM: Toward Sparse Efficient Diffusion Models
SparseDM: Toward Sparse Efficient Diffusion Models
Kafeng Wang
Jianfei Chen
He Li
Zhenpeng Mi
Jun-Jie Zhu
DiffM
70
8
0
16 Apr 2024
ONNXPruner: ONNX-Based General Model Pruning Adapter
ONNXPruner: ONNX-Based General Model Pruning Adapter
Dongdong Ren
Wenbin Li
Tianyu Ding
Lei Wang
Qi Fan
Jing Huo
Hongbing Pan
Yang Gao
51
3
0
10 Apr 2024
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Hongchuan Zeng
Hongshen Xu
Lu Chen
Kai Yu
59
5
0
06 Apr 2024
Are Compressed Language Models Less Subgroup Robust?
Are Compressed Language Models Less Subgroup Robust?
Leonidas Gee
Andrea Zugarini
Novi Quadrianto
35
1
0
26 Mar 2024
Previous
12345...212223
Next