Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.01686
Cited By
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
6 September 2017
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks"
50 / 251 papers shown
Title
The Case for Hierarchical Deep Learning Inference at the Network Edge
Ghina Al-Atat
Andrea Fresa
Adarsh Prasad Behera
Vishnu Narayanan Moothedath
James Gross
J. Champati
73
8
0
23 Apr 2023
Towards Carbon-Neutral Edge Computing: Greening Edge AI by Harnessing Spot and Future Carbon Markets
Huirong Ma
Zhi Zhou
Xiaoxi Zhang
Xu Chen
72
12
0
22 Apr 2023
DynamicDet: A Unified Dynamic Architecture for Object Detection
Zhi-Hao Lin
Yongtao Wang
Jinhe Zhang
Xiaojie Chu
ObjD
83
31
0
12 Apr 2023
Revisiting Single-gated Mixtures of Experts
Amelie Royer
I. Karmanov
Andrii Skliar
B. Bejnordi
Tijmen Blankevoort
MoE
MoMe
73
6
0
11 Apr 2023
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
Yuhang Li
Tamar Geller
Youngeun Kim
Priyadarshini Panda
102
41
0
02 Apr 2023
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction
Xiaotao Hu
Zhewei Huang
Ailin Huang
Jun Xu
Shuchang Zhou
VGen
102
71
0
17 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
76
2
0
15 Mar 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
91
13
0
16 Feb 2023
Fixing Overconfidence in Dynamic Neural Networks
Lassi Meronen
Martin Trapp
Andrea Pilzer
Le Yang
Arno Solin
BDL
127
16
0
13 Feb 2023
Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits
Simone Sarti
Eugenio Lomurno
Matteo Matteucci
57
4
0
28 Jan 2023
Adaptive Deep Neural Network Inference Optimization with EENet
Fatih Ilhan
Ka-Ho Chow
Sihao Hu
Tiansheng Huang
Selim Tekin
...
Myungjin Lee
Ramana Rao Kompella
Hugo Latapie
Gan Liu
Ling Liu
82
11
0
15 Jan 2023
Fair Multi-Exit Framework for Facial Attribute Classification
Ching-Hao Chiu
Hao-Wei Chung
Yu-Jen Chen
Yiyu Shi
Tsung-Yi Ho
CVBM
68
4
0
08 Jan 2023
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction
Yachen Yan
Liubo Li
84
3
0
06 Jan 2023
Holistic Network Virtualization and Pervasive Network Intelligence for 6G
Xuemin Shen
Shen
Jie Gao
Wen Wu
Mushu Li
Conghao Zhou
W. Zhuang
106
238
0
02 Jan 2023
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu
Peng Yang
Weiting Zhang
Conghao Zhou
Xuemin
X. Shen
124
108
0
31 Dec 2022
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures
Devdhar Patel
H. Siegelmann
OnRL
83
1
0
25 Dec 2022
SplitGP: Achieving Both Generalization and Personalization in Federated Learning
Dong-Jun Han
Do-Yeon Kim
Minseok Choi
Christopher G. Brinton
Jaekyun Moon
FedML
73
34
0
16 Dec 2022
Slimmable Pruned Neural Networks
Hideaki Kuratsu
Atsuyoshi Nakamura
104
2
0
07 Dec 2022
HADAS: Hardware-Aware Dynamic Neural Architecture Search for Edge Performance Scaling
Halima Bouzidi
Mohanad Odema
Hamza Ouarnoughi
Mohammad Abdullah Al Faruque
Smail Niar
82
19
0
06 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
81
2
0
06 Dec 2022
Understanding the Robustness of Multi-Exit Models under Common Corruptions
Akshay Mehra
Skyler Seto
Navdeep Jaitly
B. Theobald
AAML
86
4
0
03 Dec 2022
Boosted Dynamic Neural Networks
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
103
8
0
30 Nov 2022
Edge Video Analytics: A Survey on Applications, Systems and Enabling Techniques
Renjie Xu
S. Razavi
Rong Zheng
114
21
0
28 Nov 2022
Layer-Stack Temperature Scaling
Amr Khalifa
Michael C. Mozer
Hanie Sedghi
Behnam Neyshabur
Ibrahim Alabdulmohsin
146
2
0
18 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
49
1
0
16 Nov 2022
Personalized Federated Learning with Multi-branch Architecture
Junki Mori
T. Yoshiyama
Ryo Furukawa
Isamu Teranishi
FedML
103
2
0
15 Nov 2022
Enabling AI Quality Control via Feature Hierarchical Edge Inference
Jinhyuk Choi
Seongun Kim
Seung-Woo Ko
45
0
0
15 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLM
LRM
61
6
0
13 Nov 2022
Efficient Graph Neural Network Inference at Large Scale
Xin-pu Gao
Wentao Zhang
Yingxia Shao
Quoc Viet Hung Nguyen
Tengjiao Wang
Hongzhi Yin
AI4CE
GNN
119
8
0
01 Nov 2022
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Bowen Shen
Zheng Lin
Yuanxin Liu
Zhengxiao Liu
Lei Wang
Weiping Wang
VLM
77
5
0
27 Oct 2022
Efficiently Controlling Multiple Risks with Pareto Testing
Bracha Laufer-Goldshtein
Adam Fisch
Regina Barzilay
Tommi Jaakkola
196
16
0
14 Oct 2022
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning
Tiannan Wang
Wangchunshu Zhou
Yan Zeng
Xinsong Zhang
VLM
82
44
0
14 Oct 2022
Bandwidth-efficient distributed neural network architectures with application to body sensor networks
Thomas Strypsteen
Alexander Bertrand
35
1
0
14 Oct 2022
DeepPerform: An Efficient Approach for Performance Testing of Resource-Constrained Neural Networks
Simin Chen
Mirazul Haque
Cong Liu
Wei Yang
110
22
0
10 Oct 2022
In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks
Kaibin Huang
Hai Wu
Zhiyan Liu
Xiaojuan Qi
72
10
0
07 Oct 2022
Tuning of Mixture-of-Experts Mixed-Precision Neural Networks
Fabian Tschopp
FedML
MoE
11
0
0
29 Sep 2022
Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural Networks on Edge NPUs
Alexandros Kouris
Stylianos I. Venieris
Stefanos Laskaridis
Nicholas D. Lane
101
8
0
27 Sep 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
155
0
0
24 Sep 2022
Unsupervised Early Exit in DNNs with Multiple Exits
U. HariNarayanN
M. Hanawal
Avinash Bhardwaj
70
11
0
20 Sep 2022
Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks
Francesco Daghero
Luca Bompani
Chen Xie
Marco Castellano
Luca Gandolfi
A. Calimera
Enrico Macii
Massimo Poncino
Daniele Jahier Pagliari
BDL
HAI
63
24
0
02 Sep 2022
Generalization In Multi-Objective Machine Learning
Peter Súkeník
Christoph H. Lampert
AI4CE
87
6
0
29 Aug 2022
Auditing Membership Leakages of Multi-Exit Networks
Zheng Li
Yiyong Liu
Xinlei He
Ning Yu
Michael Backes
Yang Zhang
AAML
73
34
0
23 Aug 2022
Edge-centric Optimization of Multi-modal ML-driven eHealth Applications
A. Kanduri
Sina Shahhosseini
Emad Kasaeyan Naeini
Hamid Alikhani
P. Liljeberg
N. Dutt
Amir M. Rahmani
93
7
0
04 Aug 2022
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers
Ji Xin
Raphael Tang
Zhiying Jiang
Yaoliang Yu
Jimmy J. Lin
45
1
0
31 Jul 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
70
19
0
20 Jul 2022
A Survey on Collaborative DNN Inference for Edge Intelligence
Weiqing Ren
Yuben Qu
Chao Dong
Yuqian Jing
Hao Sun
Qihui Wu
Song Guo
87
54
0
16 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
74
3
0
05 Jul 2022
A Feature Memory Rearrangement Network for Visual Inspection of Textured Surface Defects Toward Edge Intelligent Manufacturing
Haiming Yao
Wen-yong Yu
Xue Wang
59
41
0
22 Jun 2022
Binary Early-Exit Network for Adaptive Inference on Low-Resource Devices
Aaqib Saeed
MQ
25
1
0
17 Jun 2022
Switchable Representation Learning Framework with Self-compatibility
Shengsen Wu
Yan Bai
Yihang Lou
Xiongkun Linghu
Jianzhong He
Ling-yu Duan
109
1
0
16 Jun 2022
Previous
1
2
3
4
5
6
Next