Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.06426
Cited By
Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge
16 December 2018
Guangli Li
Lei Liu
Xueying Wang
Xiao-jun Dong
Peng Zhao
Xiaobing Feng
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge"
16 / 16 papers shown
Title
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
41
3
0
18 Apr 2025
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
Xiaolin Li
Binhua Huang
B. Cardiff
Deepu John
41
0
0
31 Jan 2025
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
43
0
0
01 Nov 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
40
3
0
25 Mar 2024
I-SPLIT: Deep Network Interpretability for Split Computing
Federico Cunico
Luigi Capogrosso
Francesco Setti
D. Carra
Franco Fummi
Marco Cristani
35
14
0
23 Sep 2022
Neural Architecture Search for Improving Latency-Accuracy Trade-off in Split Computing
Shoma Shimizu
Takayuki Nishio
Shota Saito
Yoichi Hirose
Yen-Hsiu Chen
Shinichi Shirakawa
34
3
0
30 Aug 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications
Deniz Gunduz
Zhijin Qin
Iñaki Estella Aguerri
Harpreet S. Dhillon
Zhaohui Yang
Aylin Yener
Kai‐Kit Wong
C. Chae
27
432
0
19 Jul 2022
Romanus: Robust Task Offloading in Modular Multi-Sensor Autonomous Driving Systems
Luke Chen
Mohanad Odema
Mohammad Abdullah Al Faruque
27
4
0
18 Jul 2022
Distributed Training for Deep Learning Models On An Edge Computing Network Using ShieldedReinforcement Learning
Tanmoy Sen
Haiying Shen
OffRL
11
5
0
01 Jun 2022
Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Learning
Zhiwei Hao
Guanyu Xu
Yong Luo
Han Hu
Jianping An
Shiwen Mao
24
22
0
24 May 2022
SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems
Xin Dong
B. D. Salvo
Meng Li
Chiao Liu
Zhongnan Qu
H. T. Kung
Ziyun Li
3DGS
26
20
0
10 Apr 2022
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
28
86
0
04 May 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
33
199
0
08 Mar 2021
Split Computing for Complex Object Detectors: Challenges and Preliminary Results
Yoshitomo Matsubara
Marco Levorato
46
24
0
27 Jul 2020
Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence
Shuiguang Deng
Hailiang Zhao
Weijia Fang
Jianwei Yin
Schahram Dustdar
Albert Y. Zomaya
74
605
0
02 Sep 2019
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
337
1,049
0
10 Feb 2017
1