Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.01686
Cited By
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
6 September 2017
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks"
50 / 172 papers shown
Title
Fixing Overconfidence in Dynamic Neural Networks
Lassi Meronen
Martin Trapp
Andrea Pilzer
Le Yang
Arno Solin
BDL
37
16
0
13 Feb 2023
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
29
12
0
29 Jan 2023
Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits
Simone Sarti
Eugenio Lomurno
Matteo Matteucci
27
4
0
28 Jan 2023
Adaptive Deep Neural Network Inference Optimization with EENet
Fatih Ilhan
Ka-Ho Chow
Sihao Hu
Tiansheng Huang
Selim Tekin
...
Myungjin Lee
Ramana Rao Kompella
Hugo Latapie
Gan Liu
Ling Liu
41
11
0
15 Jan 2023
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction
Yachen Yan
Liubo Li
22
3
0
06 Jan 2023
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu
Peng Yang
Weiting Zhang
Conghao Zhou
Xuemin
X. Shen
24
103
0
31 Dec 2022
SplitGP: Achieving Both Generalization and Personalization in Federated Learning
Dong-Jun Han
Do-Yeon Kim
Minseok Choi
Christopher G. Brinton
Jaekyun Moon
FedML
26
31
0
16 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
32
2
0
06 Dec 2022
Understanding the Robustness of Multi-Exit Models under Common Corruptions
Akshay Mehra
Skyler Seto
Navdeep Jaitly
B. Theobald
AAML
24
3
0
03 Dec 2022
Boosted Dynamic Neural Networks
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
35
7
0
30 Nov 2022
Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing
Kunjal Panchal
Sunav Choudhary
Nisarg Parikh
Lijun Zhang
Hui Guan
37
5
0
28 Nov 2022
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model
Sheng Tang
Yaqing Wang
Zhenglun Kong
Tianchi Zhang
Yao Li
Caiwen Ding
Yanzhi Wang
Yi Liang
Dongkuan Xu
33
32
0
21 Nov 2022
Layer-Stack Temperature Scaling
Amr Khalifa
Michael C. Mozer
Hanie Sedghi
Behnam Neyshabur
Ibrahim M. Alabdulmohsin
81
2
0
18 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
29
1
0
16 Nov 2022
Personalized Federated Learning with Multi-branch Architecture
Junki Mori
T. Yoshiyama
Ryo Furukawa
Isamu Teranishi
FedML
31
2
0
15 Nov 2022
Enabling AI Quality Control via Feature Hierarchical Edge Inference
Jinhyuk Choi
Seongun Kim
Seung-Woo Ko
15
0
0
15 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLM
LRM
35
6
0
13 Nov 2022
Avoid Overthinking in Self-Supervised Models for Speech Recognition
Dan Berrebbi
Brian Yan
Shinji Watanabe
LRM
26
4
0
01 Nov 2022
Efficient Graph Neural Network Inference at Large Scale
Xin-pu Gao
Wentao Zhang
Yingxia Shao
Quoc Viet Hung Nguyen
Bin Cui
Hongzhi Yin
AI4CE
GNN
62
8
0
01 Nov 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Alperen Görmez
Erdem Koyuncu
23
5
0
27 Oct 2022
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Bowen Shen
Zheng Lin
Yuanxin Liu
Zhengxiao Liu
Lei Wang
Weiping Wang
VLM
52
4
0
27 Oct 2022
Efficiently Controlling Multiple Risks with Pareto Testing
Bracha Laufer-Goldshtein
Adam Fisch
Regina Barzilay
Tommi Jaakkola
38
16
0
14 Oct 2022
Edge-Cloud Cooperation for DNN Inference via Reinforcement Learning and Supervised Learning
Tinghao Zhang
Zhijun Li
Yongrui Chen
Kwok-Yan Lam
Jun Zhao
13
4
0
11 Oct 2022
In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks
Kaibin Huang
Hai Wu
Zhiyan Liu
Xiaojuan Qi
19
9
0
07 Oct 2022
Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural Networks on Edge NPUs
Alexandros Kouris
Stylianos I. Venieris
Stefanos Laskaridis
Nicholas D. Lane
42
8
0
27 Sep 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
25
0
0
24 Sep 2022
Unsupervised Early Exit in DNNs with Multiple Exits
U. HariNarayanN
M. Hanawal
Avinash Bhardwaj
29
10
0
20 Sep 2022
Edge-centric Optimization of Multi-modal ML-driven eHealth Applications
A. Kanduri
Sina Shahhosseini
Emad Kasaeyan Naeini
Hamid Alikhani
P. Liljeberg
N. Dutt
Amir M. Rahmani
38
7
0
04 Aug 2022
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers
Ji Xin
Raphael Tang
Zhiying Jiang
Yaoliang Yu
Jimmy J. Lin
20
1
0
31 Jul 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
27
14
0
20 Jul 2022
A Survey on Collaborative DNN Inference for Edge Intelligence
Weiqing Ren
Yuben Qu
Chao Dong
Yuqian Jing
Hao Sun
Qihui Wu
Song Guo
36
49
0
16 Jul 2022
Pruning Early Exit Networks
Alperen Görmez
Erdem Koyuncu
41
5
0
08 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
26
3
0
05 Jul 2022
PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile Devices
Xiang Yang
Zikang Xu
Q. Qi
Jingyu Wang
Haifeng Sun
J. Liao
Song Guo
21
11
0
17 Jun 2022
Switchable Representation Learning Framework with Self-compatibility
Shengsen Wu
Yan Bai
Yihang Lou
Xiongkun Linghu
Jianzhong He
Ling-yu Duan
24
1
0
16 Jun 2022
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Jani Boutellier
Bo Tan
J. Nurmi
24
2
0
16 Jun 2022
Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Xiangjie Li
Chen Lou
Zhengping Zhu
Yuchi Chen
Yingtao Shen
Yehan Ma
An Zou
27
21
0
09 Jun 2022
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Y. Fu
Haichuan Yang
Jiayi Yuan
Meng Li
Cheng Wan
Raghuraman Krishnamoorthi
Vikas Chandra
Yingyan Lin
36
19
0
02 Jun 2022
Transkimmer: Transformer Learns to Layer-wise Skim
Yue Guan
Zhengyi Li
Jingwen Leng
Zhouhan Lin
Minyi Guo
80
38
0
15 May 2022
Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection
Mingyu Yang
Yu Chen
Hun-Seok Kim
44
27
0
12 May 2022
A Safety Assurable Human-Inspired Perception Architecture
Rick Salay
Krzysztof Czarnecki
AAML
AI4CE
26
1
0
10 May 2022
A Closer Look at Branch Classifiers of Multi-exit Architectures
Shaohui Lin
Bo Ji
Rongrong Ji
Angela Yao
14
4
0
28 Apr 2022
CONTINUER: Maintaining Distributed DNN Services During Edge Failures
A. Majeed
Peter Kilpatrick
I. Spence
Blesson Varghese
16
0
0
25 Apr 2022
Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi
Mohammed Hasanuzzaman
Chandra Thapa
Haithem Afli
T. Scully
48
22
0
07 Apr 2022
PALBERT: Teaching ALBERT to Ponder
Nikita Balagansky
Daniil Gavrilov
MoE
29
6
0
07 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
35
13
0
05 Apr 2022
Dynamic Multimodal Fusion
Zihui Xue
R. Marculescu
43
48
0
31 Mar 2022
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning
Utku Evci
Vincent Dumoulin
Hugo Larochelle
Michael C. Mozer
30
83
0
10 Jan 2022
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection
Chris Rohlfs
31
4
0
05 Jan 2022
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
49
0
0
21 Dec 2021
Previous
1
2
3
4
Next