Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1908.09791
Cited By
Once-for-All: Train One Network and Specialize it for Efficient Deployment
26 August 2019
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Once-for-All: Train One Network and Specialize it for Efficient Deployment"
50 / 257 papers shown
Title
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Chong Yu
Tao Chen
Zhongxue Gan
Jiayuan Fan
MQ
ViT
30
23
0
18 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
36
0
0
13 May 2023
Explainable Knowledge Distillation for On-device Chest X-Ray Classification
C. Termritthikun
Ayaz Umer
Suwichaya Suwanwimolkul
Feng Xia
Ivan Lee
24
13
0
10 May 2023
Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Y. Fu
Yuecheng Li
Chenghui Li
Jason M. Saragih
Peizhao Zhang
Xiaoliang Dai
Yingyan Lin
3DH
44
2
0
24 Apr 2023
Device management and network connectivity as missing elements in TinyML landscape
T. Szydlo
M. Nagy
27
2
0
23 Apr 2023
SSS3D: Fast Neural Architecture Search For Efficient Three-Dimensional Semantic Segmentation
O. Therrien
Marihan Amein
Zhuoran Xiong
W. Gross
B. Meyer
3DPC
31
0
0
21 Apr 2023
Small-footprint slimmable networks for keyword spotting
Zuhaib Akhtar
Mohammad Omar Khursheed
Dongsu Du
Yuzong Liu
30
2
0
21 Apr 2023
Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent Transportation
Yifeng Shi
Feng Lv
Xinliang Wang
Chunlong Xia
Shaojie Li
Shu-Zhen Yang
Teng Xi
Gang Zhang
VLM
43
13
0
12 Apr 2023
MemeFier: Dual-stage Modality Fusion for Image Meme Classification
C. Koutlis
Manos Schinas
Symeon Papadopoulos
19
12
0
06 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
26
46
0
30 Mar 2023
System-status-aware Adaptive Network for Online Streaming Video Understanding
Lin Geng Foo
Jia Gong
Zhipeng Fan
Xiaozhong Liu
AI4TS
32
15
0
28 Mar 2023
DetOFA: Efficient Training of Once-for-All Networks for Object Detection Using Path Filter
Yuiko Sakuma
Masato Ishii
T. Narihira
36
2
0
23 Mar 2023
ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Chaojian Li
Wenwan Chen
Jiayi Yuan
Yingyan Lin
Ashutosh Sabharwal
25
0
0
19 Mar 2023
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models
Yucheng Ding
Chaoyue Niu
Fan Wu
Shaojie Tang
Chengfei Lyu
Guihai Chen
24
6
0
18 Mar 2023
Efficient Transformer-based 3D Object Detection with Dynamic Token Halting
Mao Ye
Gregory P. Meyer
Yuning Chai
Qiang Liu
32
8
0
09 Mar 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
101
0
27 Feb 2023
Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training
Xiaoying Zhi
Varun Babbar
P. Sun
Fran Silavong
Ruibo Shi
Sean J. Moran
Sean Moran
42
1
0
17 Feb 2023
XploreNAS: Explore Adversarially Robust & Hardware-efficient Neural Architectures for Non-ideal Xbars
Abhiroop Bhattacharjee
Abhishek Moitra
Priyadarshini Panda
AAML
20
1
0
15 Feb 2023
Q-Diffusion: Quantizing Diffusion Models
Xiuyu Li
Yijia Liu
Long Lian
Hua Yang
Zhen Dong
Daniel Kang
Shanghang Zhang
Kurt Keutzer
DiffM
MQ
38
152
0
08 Feb 2023
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits
Simone Sarti
Eugenio Lomurno
Andrea Falanti
Matteo Matteucci
23
3
0
03 Feb 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
36
60
0
26 Jan 2023
Enabling Hard Constraints in Differentiable Neural Network and Accelerator Co-Exploration
Deokki Hong
Kanghyun Choi
Hyeyoon Lee
Joonsang Yu
Noseong Park
Youngsok Kim
Jinho Lee
19
3
0
23 Jan 2023
β
β
β
-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search
Peng Ye
Tong He
Baopu Li
Tao Chen
Lei Bai
Wanli Ouyang
OOD
46
7
0
16 Jan 2023
Efficient Evaluation Methods for Neural Architecture Search: A Survey
Xiangning Xie
Xiaotian Song
Zeqiong Lv
Gary G. Yen
Weiping Ding
Yizhou Sun
32
12
0
14 Jan 2023
High-Throughput, High-Performance Deep Learning-Driven Light Guide Plate Surface Visual Quality Inspection Tailored for Real-World Manufacturing Environments
Carol Xu
M. Famouri
Gautam Bathla
M. Shafiee
Alexander Wong
23
2
0
20 Dec 2022
A Study on the Intersection of GPU Utilization and CNN Inference
J. Kosaian
Amar Phanishayee
23
3
0
15 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
24
2
0
10 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
26
2
0
06 Dec 2022
GENNAPE: Towards Generalized Neural Architecture Performance Estimators
Keith G. Mills
Fred X. Han
Jialin Zhang
Fabián A. Chudak
A. Mamaghani
Mohammad Salameh
Wei Lu
Shangling Jui
Di Niu
24
4
0
30 Nov 2022
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression
Jiaqi Gu
Ben Keller
Jean Kossaifi
Anima Anandkumar
Brucek Khailany
David Z. Pan
ViT
35
8
0
30 Nov 2022
SteppingNet: A Stepping Neural Network with Incremental Accuracy Enhancement
Wenhao Sun
Grace Li Zhang
Xunzhao Yin
Cheng Zhuo
Huaxi Gu
Bing Li
Ulf Schlichtmann
15
1
0
27 Nov 2022
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Chaoting Xu
Yunhe Wang
20
270
0
23 Nov 2022
NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction
Yun Yi
Haokui Zhang
Wenze Hu
Nannan Wang
Xiaoyu Wang
AI4TS
AI4CE
32
8
0
15 Nov 2022
Multi-Objective Evolutionary for Object Detection Mobile Architectures Search
Haichao Zhang
Jiashi Li
Xin Xia
K. Hao
Xuefeng Xiao
39
2
0
05 Nov 2022
Once-for-All Sequence Compression for Self-Supervised Speech Models
Hsuan-Jui Chen
Yen Meng
Hung-yi Lee
30
4
0
04 Nov 2022
QuaLA-MiniLM: a Quantized Length Adaptive MiniLM
Shira Guskin
Moshe Wasserblat
Chang Wang
Haihao Shen
MQ
11
2
0
31 Oct 2022
PredNAS: A Universal and Sample Efficient Neural Architecture Search Framework
Liuchun Yuan
Zehao Huang
Naiyan Wang
29
0
0
26 Oct 2022
Efficiently Controlling Multiple Risks with Pareto Testing
Bracha Laufer-Goldshtein
Adam Fisch
Regina Barzilay
Tommi Jaakkola
36
16
0
14 Oct 2022
Pareto-aware Neural Architecture Generation for Diverse Computational Budgets
Yong Guo
Yaofo Chen
Yin Zheng
Qi Chen
P. Zhao
Jian Chen
Junzhou Huang
Mingkui Tan
28
5
0
14 Oct 2022
Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing
Peng Ye
Shengji Tang
Baopu Li
Tao Chen
Wanli Ouyang
31
13
0
09 Oct 2022
Demystifying Map Space Exploration for NPUs
Sheng-Chun Kao
A. Parashar
Po-An Tsai
T. Krishna
38
11
0
07 Oct 2022
In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks
Kaibin Huang
Hai Wu
Zhiyan Liu
Xiaojuan Qi
11
9
0
07 Oct 2022
Designing and Training of Lightweight Neural Networks on Edge Devices using Early Halting in Knowledge Distillation
Rahul Mishra
Hari Prabhat Gupta
40
8
0
30 Sep 2022
Slimmable Networks for Contrastive Self-supervised Learning
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yi Yang
35
1
0
30 Sep 2022
Towards Regression-Free Neural Networks for Diverse Compute Platforms
Rahul Duggal
Hao Zhou
Shuo Yang
Jun Fang
Yuanjun Xiong
Wei Xia
UQCV
31
1
0
27 Sep 2022
Searching a High-Performance Feature Extractor for Text Recognition Network
Hui Zhang
Quanming Yao
James T. Kwok
X. Bai
28
7
0
27 Sep 2022
Tiered Pruning for Efficient Differentialble Inference-Aware Neural Architecture Search
Slawomir Kierat
Mateusz Sieniawski
Denys Fridman
Chendi Yu
Szymon Migacz
Pawel M. Morkisz
A. Fit-Florea
3DPC
19
0
0
23 Sep 2022
EZNAS: Evolving Zero Cost Proxies For Neural Architecture Scoring
Yash Akhauri
J. P. Muñoz
Nilesh Jain
Ravi Iyer
46
13
0
15 Sep 2022
You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms
Xiangzhong Luo
Di Liu
Hao Kong
Shuo Huai
Hui Chen
Weichen Liu
36
9
0
30 Aug 2022
Hardware-aware mobile building block evaluation for computer vision
Maxim Bonnaerens
Matthias Anton Freiberger
Marian Verhelst
J. Dambre
27
1
0
26 Aug 2022
Previous
1
2
3
4
5
6
Next