Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.03443
Cited By
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
9 December 2018
Bichen Wu
Xiaoliang Dai
Peizhao Zhang
Yanghan Wang
Fei Sun
Yiming Wu
Yuandong Tian
Peter Vajda
Yangqing Jia
Kurt Keutzer
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search"
50 / 297 papers shown
Title
Differentiable Channel Selection in Self-Attention For Person Re-Identification
Yancheng Wang
Nebojsa Jojic
Yingzhen Yang
29
0
0
13 May 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
54
1
0
08 Mar 2025
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
Xingrun Xing
Zheng Liu
Shitao Xiao
Boyan Gao
Yiming Liang
Wanpeng Zhang
Haokun Lin
Guoqi Li
Jiajun Zhang
LRM
64
1
0
10 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
72
0
0
26 Jan 2025
Improving Accuracy and Generalization for Efficient Visual Tracking
Ram J. Zaveri
Shivang Patel
Yu Gu
Gianfranco Doretto
VLM
86
0
0
28 Nov 2024
NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models
Yang Xu
Huihong Shi
Zhongfeng Wang
45
0
0
07 Sep 2024
Combining Neural Architecture Search and Automatic Code Optimization: A Survey
Inas Bachiri
Hadjer Benmeziane
Smail Niar
Riyadh Baghdadi
Hamza Ouarnoughi
Abdelkrime Aries
40
0
0
07 Aug 2024
DεpS: Delayed ε-Shrinking for Faster Once-For-All Training
Aditya Annavajjala
Alind Khare
Animesh Agrawal
Igor Fedorov
Hugo Latapie
Myungjin Lee
Alexey Tumanov
CLL
42
0
0
08 Jul 2024
P
2
^2
2
-ViT: Power-of-Two Post-Training Quantization and Acceleration for Fully Quantized Vision Transformer
Huihong Shi
Xin Cheng
Wendong Mao
Zhongfeng Wang
MQ
48
3
0
30 May 2024
FR-NAS: Forward-and-Reverse Graph Predictor for Efficient Neural Architecture Search
Haoming Zhang
Ran Cheng
AI4CE
GNN
37
0
0
24 Apr 2024
Unsupervised Domain Adaptation Architecture Search with Self-Training for Land Cover Mapping
Clifford Broni-Bediako
Junshi Xia
Naoto Yokoya
AI4CE
34
3
0
23 Apr 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
43
17
0
29 Mar 2024
METER: a mobile vision transformer architecture for monocular depth estimation
Lorenzo Papa
Paolo Russo
Irene Amerini
MDE
27
18
0
13 Mar 2024
G-EvoNAS: Evolutionary Neural Architecture Search Based on Network Growth
Juan Zou
Weiwei Jiang
Yizhang Xia
Yuan Liu
Zhanglu Hou
26
0
0
05 Mar 2024
Multi-objective Differentiable Neural Architecture Search
R. Sukthanker
Arber Zela
B. Staffler
Samuel Dooley
Josif Grabocka
Frank Hutter
47
1
0
28 Feb 2024
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Guohao Li
Pablo Arbelaez
Ali K. Thabet
30
12
0
19 Dec 2023
Masked Autoencoders Are Robust Neural Architecture Search Learners
Yiming Hu
Xiangxiang Chu
Bo-Wen Zhang
OOD
40
0
0
20 Nov 2023
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Jianlei Yang
Jiacheng Liao
Fanding Lei
Meichen Liu
Junyi Chen
Lingkun Long
Han Wan
Bei Yu
Weisheng Zhao
MoE
35
2
0
03 Nov 2023
DONNAv2 -- Lightweight Neural Architecture Search for Vision tasks
Sweta Priyadarshi
Tianyu Jiang
Hsin-Pai Cheng
S. Rama Krishna
Viswanath Ganapathy
C. Patel
44
0
0
26 Sep 2023
Distributionally Robust Classification on a Data Budget
Ben Feuer
Ameya Joshi
Minh Pham
C. Hegde
OOD
37
2
0
07 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
27
1
0
02 Aug 2023
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search
Bhavna Gopal
Arjun Sridhar
Tunhou Zhang
Yiran Chen
18
3
0
06 Jul 2023
Robustifying DARTS by Eliminating Information Bypass Leakage via Explicit Sparse Regularization
Jiuling Zhang
Zhiming Ding
AAML
27
3
0
12 Jun 2023
Performance-optimized deep neural networks are evolving into worse models of inferotemporal visual cortex
Drew Linsley
I. F. Rodriguez
Thomas Fel
Michael Arcaro
Saloni Sharma
Margaret Livingstone
Thomas Serre
35
19
0
06 Jun 2023
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
43
9
0
26 May 2023
Auto-CARD: Efficient and Robust Codec Avatar Driving for Real-time Mobile Telepresence
Y. Fu
Yuecheng Li
Chenghui Li
Jason M. Saragih
Peizhao Zhang
Xiaoliang Dai
Yingyan Lin
3DH
44
2
0
24 Apr 2023
ALiSNet: Accurate and Lightweight Human Segmentation Network for Fashion E-Commerce
Amrollah Seifoddini
K. Vernooij
Timon Künzle
A. Canopoli
Malte F. Alf
Anna Volokitin
Reza Shirvany
3DH
26
0
0
15 Apr 2023
TinyDet: Accurate Small Object Detection in Lightweight Generic Detectors
Shaoyu Chen
Tianheng Cheng
Jiemin Fang
Qian Zhang
Yuan Li
Wenyu Liu
Xinggang Wang
ObjD
24
5
0
07 Apr 2023
ERSAM: Neural Architecture Search For Energy-Efficient and Real-Time Social Ambiance Measurement
Chaojian Li
Wenwan Chen
Jiayi Yuan
Yingyan Lin
Ashutosh Sabharwal
25
0
0
19 Mar 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
36
101
0
27 Feb 2023
Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search
Guangliang Cheng
Peng Sun
Ting-Bing Xu
Shuchang Lyu
Peiwen Lin
26
1
0
16 Feb 2023
The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment
Jared Fernandez
Jacob Kahn
Clara Na
Yonatan Bisk
Emma Strubell
FedML
33
10
0
13 Feb 2023
Oscillation-free Quantization for Low-bit Vision Transformers
Shi Liu
Zechun Liu
Kwang-Ting Cheng
MQ
23
34
0
04 Feb 2023
Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits
Simone Sarti
Eugenio Lomurno
Andrea Falanti
Matteo Matteucci
23
3
0
03 Feb 2023
ZiCo: Zero-shot NAS via Inverse Coefficient of Variation on Gradients
Guihong Li
Yuedong Yang
Kartikeya Bhardwaj
R. Marculescu
36
61
0
26 Jan 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
17
1
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
40
2
0
25 Jan 2023
HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks
Jinqi Xiao
Chengming Zhang
Yu Gong
Miao Yin
Yang Sui
Lizhi Xiang
Dingwen Tao
Bo Yuan
29
19
0
20 Jan 2023
β
β
β
-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search
Peng Ye
Tong He
Baopu Li
Tao Chen
Lei Bai
Wanli Ouyang
OOD
46
7
0
16 Jan 2023
Efficient Evaluation Methods for Neural Architecture Search: A Survey
Xiangning Xie
Xiaotian Song
Zeqiong Lv
Gary G. Yen
Weiping Ding
Yizhou Sun
32
12
0
14 Jan 2023
Pruning Compact ConvNets for Efficient Inference
Sayan Ghosh
Karthik Prasad
Xiaoliang Dai
Peizhao Zhang
Bichen Wu
Graham Cormode
Peter Vajda
VLM
19
4
0
11 Jan 2023
OVO: One-shot Vision Transformer Search with Online distillation
Zimian Wei
H. Pan
Xin-Yi Niu
Dongsheng Li
ViT
29
1
0
28 Dec 2022
A Study on the Intersection of GPU Utilization and CNN Inference
J. Kosaian
Amar Phanishayee
23
3
0
15 Dec 2022
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression
Jiaqi Gu
Ben Keller
Jean Kossaifi
Anima Anandkumar
Brucek Khailany
David Z. Pan
ViT
35
8
0
30 Nov 2022
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
Wenyuan Zeng
Meng Li
Wenjie Xiong
Tong Tong
Wen-jie Lu
Jin Tan
Runsheng Wang
Ru Huang
24
20
0
25 Nov 2022
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Chaoting Xu
Yunhe Wang
20
270
0
23 Nov 2022
RepGhost: A Hardware-Efficient Ghost Module via Re-parameterization
Chengpeng Chen
Zichao Guo
Haien Zeng
Pengfei Xiong
Jian Dong
26
37
0
11 Nov 2022
NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators
Aditya Manglik
Minesh Patel
Haiyu Mao
Behzad Salami
Jisung Park
Lois Orosa
O. Mutlu
20
1
0
10 Nov 2022
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Jin Zhang
Feng Zhang
G. Yu
...
Mingyang Qian
Huixin Ma
Yanan Li
Xiaotao Wang
Lei Lei
15
10
0
07 Nov 2022
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Maurizio Denna
Abdelbadie Younes
Ganzorig Gankhuyag
...
Jing Liu
Garas Gendy
Nabil Sabor
J. Hou
Guanghui He
SupR
MQ
23
31
0
07 Nov 2022
1
2
3
4
5
6
Next