Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.09282
Cited By
A Survey of Model Compression and Acceleration for Deep Neural Networks
23 October 2017
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey of Model Compression and Acceleration for Deep Neural Networks"
50 / 125 papers shown
Title
Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning
Sanghwan Bae
Jiwoo Hong
Min Young Lee
Hanbyul Kim
Jeongyeon Nam
Donghyun Kwak
OffRL
LRM
53
3
0
04 Apr 2025
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
154
0
0
29 Oct 2024
Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging
Deyuan Liu
Zhanyue Qin
Hairu Wang
Zhao Yang
Zecheng Wang
...
Zhao Lv
Zhiying Tu
Dianhui Chu
Bo Li
Dianbo Sui
22
2
0
24 Jun 2024
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
Rickard Brüel-Gabrielsson
Jiacheng Zhu
Onkar Bhardwaj
Leshem Choshen
Kristjan Greenewald
Mikhail Yurochkin
Justin Solomon
45
5
0
17 Jun 2024
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
41
2
0
26 Mar 2024
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Jixuan Leng
Yijiang Li
Haohan Wang
VLM
31
0
0
26 Nov 2023
Federated learning compression designed for lightweight communications
Lucas Grativol Ribeiro
Mathieu Léonardon
Guillaume Muller
Virginie Fresse
Matthieu Arzel
FedML
30
3
0
23 Oct 2023
Language Modeling Is Compression
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
...
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
AI4CE
32
131
0
19 Sep 2023
FedDIP: Federated Learning with Extreme Dynamic Pruning and Incremental Regularization
Qianyu Long
Christos Anagnostopoulos
S. P. Parambath
Daning Bi
AI4CE
FedML
23
2
0
13 Sep 2023
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
Habib Hajimolahoseini
Walid Ahmed
Yang Liu
OffRL
MQ
19
6
0
07 Sep 2023
Neural Networks at a Fraction with Pruned Quaternions
Sahel Mohammad Iqbal
Subhankar Mishra
28
4
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
33
3
0
13 Aug 2023
NUPES : Non-Uniform Post-Training Quantization via Power Exponent Search
Edouard Yvinec
Arnaud Dapogny
Kévin Bailly
MQ
24
6
0
10 Aug 2023
Accelerating Distributed ML Training via Selective Synchronization
S. Tyagi
Martin Swany
FedML
29
3
0
16 Jul 2023
Proximity to Losslessly Compressible Parameters
Matthew Farrugia-Roberts
30
0
0
05 Jun 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
41
91
0
20 Apr 2023
Domain Adaptation for Inertial Measurement Unit-based Human Activity Recognition: A Survey
Avijoy Chakma
A. Faridee
Indrajeet Ghosh
Nirmalya Roy
35
4
0
07 Apr 2023
Learning to Zoom and Unzoom
Chittesh Thavamani
Mengtian Li
Francesco Ferroni
Deva Ramanan
22
8
0
27 Mar 2023
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare
Andreas Triantafyllopoulos
Alexander Kathan
Alice Baird
Lukas Christ
Alexander Gebhard
...
Shahin Amiriparian
K. D. Bartl-Pokorny
A. Batliner
Florian B. Pokorny
Björn W. Schuller
44
7
0
25 Jan 2023
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
23
3
0
17 Dec 2022
Unbiased Knowledge Distillation for Recommendation
Gang Chen
Jiawei Chen
Fuli Feng
Sheng Zhou
Xiangnan He
24
27
0
27 Nov 2022
Efficient Incremental Text-to-Speech on GPUs
Muyang Du
Chuan Liu
Jiaxing Qi
Junjie Lai
16
1
0
25 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDL
MLT
AI4CE
29
51
0
24 Nov 2022
Learning to Simulate Realistic LiDARs
Benoît Guillard
Sai H. Vemprala
Jayesh K. Gupta
O. Mikšík
Vibhav Vineet
Pascal Fua
Ashish Kapoor
3DPC
11
18
0
22 Sep 2022
R-WhONet: Recalibrated Wheel Odometry Neural Network for Vehicular Positioning using Transfer Learning
Uche Onyekpe
Alicja Szkolnik
Vasile Palade
S. Kanarachos
M. Fitzpatrick
30
1
0
13 Sep 2022
Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing Methods
Ricards Marcinkevics
Ece Ozkan
Julia E. Vogt
16
18
0
26 Jul 2022
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification
Xiao-Ze Lin
Seungbae Kim
Jungseock Joo
CVBM
34
38
0
22 Jul 2022
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Ji Liu
Daxiang Dong
Xi Wang
An Qin
Xingjian Li
P. Valduriez
Dejing Dou
Dianhai Yu
31
6
0
14 Jul 2022
CEG4N: Counter-Example Guided Neural Network Quantization Refinement
J. Matos
I. Bessa
Edoardo Manino
Xidan Song
Lucas C. Cordeiro
MQ
40
2
0
09 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
30
2
0
02 Jul 2022
Knowledge Distillation for Oriented Object Detection on Aerial Images
Yicheng Xiao
Junpeng Zhang
ObjD
19
1
0
20 Jun 2022
FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification
Aliaksandra Shysheya
J. Bronskill
Massimiliano Patacchiola
Sebastian Nowozin
Richard Turner
3DH
FedML
38
27
0
17 Jun 2022
Zeroth-Order Topological Insights into Iterative Magnitude Pruning
Aishwarya H. Balwani
J. Krzyston
29
2
0
14 Jun 2022
Blueprint Separable Residual Network for Efficient Image Super-Resolution
Zheyu Li
Yingqi Liu
Xiangyu Chen
Haoming Cai
Jinjin Gu
Yu Qiao
Chao Dong
27
131
0
12 May 2022
Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures
Yongji Wu
Matthew Lentz
Danyang Zhuo
Yao Lu
23
22
0
10 May 2022
Physics Community Needs, Tools, and Resources for Machine Learning
Philip C. Harris
E. Katsavounidis
W. McCormack
D. Rankin
Yongbin Feng
...
De-huai Chen
Mark S. Neubauer
Javier Mauricio Duarte
G. Karagiorgi
Miaoyuan Liu
AI4CE
17
3
0
30 Mar 2022
On Neural Network Equivalence Checking using SMT Solvers
Charis Eleftheriadis
Nikolaos Kekatos
Panagiotis Katsaros
S. Tripakis
AAML
24
12
0
22 Mar 2022
Infrastructure-free, Deep Learned Urban Noise Monitoring at
∼
\sim
∼
100mW
Jihoon Yun
Sangeeta Srivastava
Dhrubojyoti Roy
Nathan Stohs
C. Mydlarz
Mahiny A. Salman
Bea Steers
J. P. Bello
Anish Arora
17
5
0
11 Mar 2022
Update Compression for Deep Neural Networks on the Edge
Bo Chen
A. Bakhshi
Gustavo E. A. P. A. Batista
Brian Ng
Tat-Jun Chin
24
17
0
09 Mar 2022
Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks
Sina Shahhosseini
Dongjoo Seo
A. Kanduri
Tianyi Hu
Sung-Soo Lim
Bryan Donyanavard
Amir M.Rahmani
N. Dutt
27
17
0
21 Feb 2022
Survey on Graph Neural Network Acceleration: An Algorithmic Perspective
Xin Liu
Yurui Lai
Lei Deng
Guoqi Li
Xiaochun Ye
Xiaochun Ye
Shirui Pan
Yuan Xie
GNN
13
41
0
10 Feb 2022
Comparative assessment of federated and centralized machine learning
Ibrahim Abdul Majeed
Sagar Kaushik
Aniruddha Bardhan
Venkata Siva Kumar Tadi
Hwang-Ki Min
K. Kumaraguru
Rajasekhara Reddy Duvvuru Muni
FedML
20
6
0
03 Feb 2022
Explaining Cognitive Computing Through the Information Systems Lens
S. Elnagar
Manoj A. Thomas
16
2
0
16 Jan 2022
The Effect of Model Compression on Fairness in Facial Expression Recognition
Samuil Stoychev
Hatice Gunes
CVBM
27
19
0
05 Jan 2022
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
32
16
0
14 Dec 2021
Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks
A. Mazayev
F. Al-Tam
N. Correia
21
5
0
07 Dec 2021
Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
Bo-Shiuan Chu
Che-Rung Lee
18
11
0
07 Dec 2021
CondenseNeXt: An Ultra-Efficient Deep Neural Network for Embedded Systems
Priyank Kalgaonkar
M. El-Sharkawy
3DH
17
5
0
01 Dec 2021
Nonlinear Tensor Ring Network
Xiao Peng Li
Qi Liu
Hayden Kwok-Hay So
19
0
0
12 Nov 2021
Gabor filter incorporated CNN for compression
Akihiro Imamura
N. Arizumi
CVBM
25
2
0
29 Oct 2021
1
2
3
Next