ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.01686
  4. Cited By
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

6 September 2017
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
    UQCV
ArXivPDFHTML

Papers citing "BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks"

50 / 172 papers shown
Title
Fixing Overconfidence in Dynamic Neural Networks
Fixing Overconfidence in Dynamic Neural Networks
Lassi Meronen
Martin Trapp
Andrea Pilzer
Le Yang
Arno Solin
BDL
37
16
0
13 Feb 2023
Towards Inference Efficient Deep Ensemble Learning
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
29
12
0
29 Jan 2023
Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks
  via Aggregated Early Exits
Anticipate, Ensemble and Prune: Improving Convolutional Neural Networks via Aggregated Early Exits
Simone Sarti
Eugenio Lomurno
Matteo Matteucci
27
4
0
28 Jan 2023
Adaptive Deep Neural Network Inference Optimization with EENet
Adaptive Deep Neural Network Inference Optimization with EENet
Fatih Ilhan
Ka-Ho Chow
Sihao Hu
Tiansheng Huang
Selim Tekin
...
Myungjin Lee
Ramana Rao Kompella
Hugo Latapie
Gan Liu
Ling Liu
41
11
0
15 Jan 2023
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for
  Click-Through Rate Prediction
AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction
Yachen Yan
Liubo Li
22
3
0
06 Jan 2023
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via
  Deep Reinforcement Learning
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu
Peng Yang
Weiting Zhang
Conghao Zhou
Xuemin
X. Shen
24
103
0
31 Dec 2022
SplitGP: Achieving Both Generalization and Personalization in Federated
  Learning
SplitGP: Achieving Both Generalization and Personalization in Federated Learning
Dong-Jun Han
Do-Yeon Kim
Minseok Choi
Christopher G. Brinton
Jaekyun Moon
FedML
26
31
0
16 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
32
2
0
06 Dec 2022
Understanding the Robustness of Multi-Exit Models under Common
  Corruptions
Understanding the Robustness of Multi-Exit Models under Common Corruptions
Akshay Mehra
Skyler Seto
Navdeep Jaitly
B. Theobald
AAML
24
3
0
03 Dec 2022
Boosted Dynamic Neural Networks
Boosted Dynamic Neural Networks
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
35
7
0
30 Nov 2022
Flow: Per-Instance Personalized Federated Learning Through Dynamic
  Routing
Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing
Kunjal Panchal
Sunav Choudhary
Nisarg Parikh
Lijun Zhang
Hui Guan
37
5
0
28 Nov 2022
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating
  Unified Vision Language Model
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model
Sheng Tang
Yaqing Wang
Zhenglun Kong
Tianchi Zhang
Yao Li
Caiwen Ding
Yanzhi Wang
Yi Liang
Dongkuan Xu
33
32
0
21 Nov 2022
Layer-Stack Temperature Scaling
Layer-Stack Temperature Scaling
Amr Khalifa
Michael C. Mozer
Hanie Sedghi
Behnam Neyshabur
Ibrahim M. Alabdulmohsin
81
2
0
18 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight
  BERT
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
29
1
0
16 Nov 2022
Personalized Federated Learning with Multi-branch Architecture
Personalized Federated Learning with Multi-branch Architecture
Junki Mori
T. Yoshiyama
Ryo Furukawa
Isamu Teranishi
FedML
31
2
0
15 Nov 2022
Enabling AI Quality Control via Feature Hierarchical Edge Inference
Enabling AI Quality Control via Feature Hierarchical Edge Inference
Jinhyuk Choi
Seongun Kim
Seung-Woo Ko
15
0
0
15 Nov 2022
FPT: Improving Prompt Tuning Efficiency via Progressive Training
FPT: Improving Prompt Tuning Efficiency via Progressive Training
Yufei Huang
Yujia Qin
Huadong Wang
Yichun Yin
Maosong Sun
Zhiyuan Liu
Qun Liu
VLM
LRM
35
6
0
13 Nov 2022
Avoid Overthinking in Self-Supervised Models for Speech Recognition
Avoid Overthinking in Self-Supervised Models for Speech Recognition
Dan Berrebbi
Brian Yan
Shinji Watanabe
LRM
26
4
0
01 Nov 2022
Efficient Graph Neural Network Inference at Large Scale
Efficient Graph Neural Network Inference at Large Scale
Xin-pu Gao
Wentao Zhang
Yingxia Shao
Quoc Viet Hung Nguyen
Bin Cui
Hongzhi Yin
AI4CE
GNN
62
8
0
01 Nov 2022
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Class Based Thresholding in Early Exit Semantic Segmentation Networks
Alperen Görmez
Erdem Koyuncu
23
5
0
27 Oct 2022
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency
  with Slenderized Multi-exit Language Models
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Bowen Shen
Zheng Lin
Yuanxin Liu
Zhengxiao Liu
Lei Wang
Weiping Wang
VLM
52
4
0
27 Oct 2022
Efficiently Controlling Multiple Risks with Pareto Testing
Efficiently Controlling Multiple Risks with Pareto Testing
Bracha Laufer-Goldshtein
Adam Fisch
Regina Barzilay
Tommi Jaakkola
38
16
0
14 Oct 2022
Edge-Cloud Cooperation for DNN Inference via Reinforcement Learning and
  Supervised Learning
Edge-Cloud Cooperation for DNN Inference via Reinforcement Learning and Supervised Learning
Tinghao Zhang
Zhijun Li
Yongrui Chen
Kwok-Yan Lam
Jun Zhao
13
4
0
11 Oct 2022
In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile
  Networks
In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks
Kaibin Huang
Hai Wu
Zhiyan Liu
Xiaojuan Qi
19
9
0
07 Oct 2022
Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural
  Networks on Edge NPUs
Fluid Batching: Exit-Aware Preemptive Serving of Early-Exit Neural Networks on Edge NPUs
Alexandros Kouris
Stylianos I. Venieris
Stefanos Laskaridis
Nicholas D. Lane
42
8
0
27 Sep 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
25
0
0
24 Sep 2022
Unsupervised Early Exit in DNNs with Multiple Exits
Unsupervised Early Exit in DNNs with Multiple Exits
U. HariNarayanN
M. Hanawal
Avinash Bhardwaj
29
10
0
20 Sep 2022
Edge-centric Optimization of Multi-modal ML-driven eHealth Applications
Edge-centric Optimization of Multi-modal ML-driven eHealth Applications
A. Kanduri
Sina Shahhosseini
Emad Kasaeyan Naeini
Hamid Alikhani
P. Liljeberg
N. Dutt
Amir M. Rahmani
38
7
0
04 Aug 2022
Building an Efficiency Pipeline: Commutativity and Cumulativeness of
  Efficiency Operators for Transformers
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers
Ji Xin
Raphael Tang
Zhiying Jiang
Yaoliang Yu
Jimmy J. Lin
20
1
0
31 Jul 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and
  Device
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
27
14
0
20 Jul 2022
A Survey on Collaborative DNN Inference for Edge Intelligence
A Survey on Collaborative DNN Inference for Edge Intelligence
Weiqing Ren
Yuben Qu
Chao Dong
Yuqian Jing
Hao Sun
Qihui Wu
Song Guo
36
49
0
16 Jul 2022
Pruning Early Exit Networks
Pruning Early Exit Networks
Alperen Görmez
Erdem Koyuncu
41
5
0
08 Jul 2022
Learning to Accelerate Approximate Methods for Solving Integer
  Programming via Early Fixing
Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing
Longkang Li
Baoyuan Wu
26
3
0
05 Jul 2022
PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile
  Devices
PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile Devices
Xiang Yang
Zikang Xu
Q. Qi
Jingyu Wang
Haifeng Sun
J. Liao
Song Guo
21
11
0
17 Jun 2022
Switchable Representation Learning Framework with Self-compatibility
Switchable Representation Learning Framework with Self-compatibility
Shengsen Wu
Yan Bai
Yihang Lou
Xiongkun Linghu
Jianzhong He
Ling-yu Duan
24
1
0
16 Jun 2022
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Jani Boutellier
Bo Tan
J. Nurmi
24
2
0
16 Jun 2022
Predictive Exit: Prediction of Fine-Grained Early Exits for Computation-
  and Energy-Efficient Inference
Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference
Xiangjie Li
Chen Lou
Zhengping Zhu
Yuchi Chen
Yingtao Shen
Yehan Ma
An Zou
27
21
0
09 Jun 2022
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks
Y. Fu
Haichuan Yang
Jiayi Yuan
Meng Li
Cheng Wan
Raghuraman Krishnamoorthi
Vikas Chandra
Yingyan Lin
36
19
0
02 Jun 2022
Transkimmer: Transformer Learns to Layer-wise Skim
Transkimmer: Transformer Learns to Layer-wise Skim
Yue Guan
Zhengyi Li
Jingwen Leng
Zhouhan Lin
Minyi Guo
80
38
0
15 May 2022
Efficient Deep Visual and Inertial Odometry with Adaptive Visual
  Modality Selection
Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection
Mingyu Yang
Yu Chen
Hun-Seok Kim
44
27
0
12 May 2022
A Safety Assurable Human-Inspired Perception Architecture
A Safety Assurable Human-Inspired Perception Architecture
Rick Salay
Krzysztof Czarnecki
AAML
AI4CE
26
1
0
10 May 2022
A Closer Look at Branch Classifiers of Multi-exit Architectures
A Closer Look at Branch Classifiers of Multi-exit Architectures
Shaohui Lin
Bo Ji
Rongrong Ji
Angela Yao
14
4
0
28 Apr 2022
CONTINUER: Maintaining Distributed DNN Services During Edge Failures
CONTINUER: Maintaining Distributed DNN Services During Edge Failures
A. Majeed
Peter Kilpatrick
I. Spence
Blesson Varghese
16
0
0
25 Apr 2022
Enabling All In-Edge Deep Learning: A Literature Review
Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi
Mohammed Hasanuzzaman
Chandra Thapa
Haithem Afli
T. Scully
48
22
0
07 Apr 2022
PALBERT: Teaching ALBERT to Ponder
PALBERT: Teaching ALBERT to Ponder
Nikita Balagansky
Daniil Gavrilov
MoE
29
6
0
07 Apr 2022
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses
Robik Shrestha
Kushal Kafle
Christopher Kanan
CML
35
13
0
05 Apr 2022
Dynamic Multimodal Fusion
Dynamic Multimodal Fusion
Zihui Xue
R. Marculescu
43
48
0
31 Mar 2022
Head2Toe: Utilizing Intermediate Representations for Better Transfer
  Learning
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning
Utku Evci
Vincent Dumoulin
Hugo Larochelle
Michael C. Mozer
30
83
0
10 Jan 2022
Problem-dependent attention and effort in neural networks with
  applications to image resolution and model selection
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection
Chris Rohlfs
31
4
0
05 Jan 2022
Compact Multi-level Sparse Neural Networks with Input Independent
  Dynamic Rerouting
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
49
0
0
21 Dec 2021
Previous
1234
Next