Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02178
Cited By
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
5 October 2021
Sachin Mehta
Mohammad Rastegari
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
50 / 419 papers shown
Title
End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning
Jinrong Zhang
Wu Wen
Sheng-lan Liu
Yunheng Li
Qifeng Li
Lin Feng
29
0
0
27 Sep 2023
Decision Fusion Network with Perception Fine-tuning for Defect Classification
Xiaoheng Jiang
Shilong Tian
Zhiwen Zhu
Yang Lu
Hao Liu
Li Chen
Shupan Li
Mingliang Xu
11
1
0
22 Sep 2023
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
Zhenzhen Chu
Jiayu Chen
Cen Chen
Chengyu Wang
Ziheng Wu
Jun Huang
Weining Qian
ViT
11
2
0
21 Sep 2023
A Machine Learning-oriented Survey on Tiny Machine Learning
Luigi Capogrosso
Federico Cunico
D. Cheng
Franco Fummi
Marco Cristani
SyDa
MU
26
33
0
21 Sep 2023
EPTQ: Enhanced Post-Training Quantization via Label-Free Hessian
Ofir Gordon
H. Habi
Arnon Netzer
MQ
33
1
0
20 Sep 2023
PseudoCal: Towards Initialisation-Free Deep Learning-Based Camera-LiDAR Self-Calibration
Mathieu Cocheteux
Julien Moreau
Franck Davoine
13
3
0
18 Sep 2023
RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework
Yuelei Wang
Ting Zhang
Liangjin Zhao
Lin Hu
Zhechao Wang
...
Kaiqiang Chen
Xuan Zeng
Zhirui Wang
Hongqi Wang
Xian Sun
22
4
0
16 Sep 2023
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool
Josiah W. Smith
Murat Torlak
21
1
0
16 Sep 2023
Complex-Valued Neural Networks for Data-Driven Signal Processing and Signal Understanding
Josiah W. Smith
21
9
0
14 Sep 2023
Mitigating Adversarial Attacks in Federated Learning with Trusted Execution Environments
Simon Queyrut
V. Schiavoni
Pascal Felber
AAML
FedML
18
6
0
13 Sep 2023
Mobile Vision Transformer-based Visual Object Tracking
Goutam Yelluru Gopal
Maria A. Amer
19
5
0
11 Sep 2023
Multimodal Fish Feeding Intensity Assessment in Aquaculture
Meng Cui
Xubo Liu
Haohe Liu
Zhuangzhuang Du
Tao Chen
Guoping Lian
Daoliang Li
Wenwu Wang
26
5
0
10 Sep 2023
DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices
Guanyu Xu
Zhiwei Hao
Yong Luo
Han Hu
J. An
Shiwen Mao
ViT
37
14
0
10 Sep 2023
Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts
Erik Daxberger
Floris Weers
Bowen Zhang
Tom Gunter
Ruoming Pang
Marcin Eichner
Michael Emmersberger
Yinfei Yang
Alexander Toshev
Xianzhi Du
MoE
9
9
0
08 Sep 2023
On the Efficacy of Multi-scale Data Samplers for Vision Applications
Elvis Nunez
Thomas Merth
Anish K. Prabhu
Mehrdad Farajtabar
Mohammad Rastegari
Sachin Mehta
Maxwell Horton
23
1
0
08 Sep 2023
Separable Self and Mixed Attention Transformers for Efficient Object Tracking
Goutam Yelluru Gopal
Maria A. Amer
VOT
ViT
25
25
0
07 Sep 2023
Compressing Vision Transformers for Low-Resource Visual Learning
Eric Youn
J. SaiMitheran
Sanjana Prabhu
Siyuan Chen
ViT
24
2
0
05 Sep 2023
ExMobileViT: Lightweight Classifier Extension for Mobile Vision Transformer
Gyeongdong Yang
Yungwook Kwon
Hyunjin Kim
ViT
16
1
0
04 Sep 2023
U-SEANNet: A Simple, Efficient and Applied U-Shaped Network for Diagnosis of Nasal Diseases on Nasal Endoscopic Images
Yubiao Yue
Jun Xue
Chao Wang
Haihua Liang
Zhenzhang Li
16
0
0
27 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
28
20
0
27 Aug 2023
GRASP: A Rehearsal Policy for Efficient Online Continual Learning
Md Yousuf Harun
Jhair Gallardo
Junyu Chen
Christopher Kanan
CLL
28
9
0
25 Aug 2023
Ultrafast-and-Ultralight ConvNet-Based Intelligent Monitoring System for Diagnosing Early-Stage Mpox Anytime and Anywhere
Yubiao Yue
X. Shi
Li-Xia Qin
Xinyue Zhang
Jia-lin Xu
Zipei Zheng
Zhenzhang Li
Y. Li
13
4
0
25 Aug 2023
TurboViT: Generating Fast Vision Transformers via Generative Architecture Search
Alexander Wong
Saad Abbasi
Saeejith Nair
ViT
27
1
0
22 Aug 2023
Ear-Keeper: Real-time Diagnosis of Ear Lesions Utilizing Ultralight-Ultrafast ConvNet and Large-scale Ear Endoscopic Dataset
Yubiao Yue
Xinyu Zeng
X. Shi
Mei Zhang
Fan Zhang
Haihua Liang
Yanmei Chen
Zhenzhang Li
Zefeng Xie
20
1
0
21 Aug 2023
Large Transformers are Better EEG Learners
Bingxin Wang
Xiao-Ying Fu
Yuan Lan
Luchan Zhang
Wei Zheng
Yang Xiang
15
4
0
20 Aug 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
Ben Kang
Xin Chen
D. Wang
Houwen Peng
Huchuan Lu
12
46
0
14 Aug 2023
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
Yichen Yuan
Yifan Wang
Lijun Wang
Xiaoqi Zhao
Huchuan Lu
Yu Wang
Wei Su
Lei Zhang
VOS
19
7
0
13 Aug 2023
Semantic-embedded Similarity Prototype for Scene Recognition
Chuanxin Song
Hanbo Wu
X. Ma
Yibin Li
24
3
0
11 Aug 2023
LEFormer: A Hybrid CNN-Transformer Architecture for Accurate Lake Extraction from Remote Sensing Imagery
Ben Chen
Xuechao Zou
Yu-an Zhang
Jiayu Li
Kaihang Li
Junliang Xing
Pin Tao
ViT
11
10
0
08 Aug 2023
Frustratingly Easy Model Generalization by Dummy Risk Minimization
Juncheng Wang
Jindong Wang
Xixu Hu
Shujun Wang
Xingxu Xie
10
1
0
04 Aug 2023
CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion
Fenghe Tang
Jianrui Ding
Lingtao Wang
C. Ning
S. Kevin Zhou
MedIm
15
37
0
02 Aug 2023
Improved Prognostic Prediction of Pancreatic Cancer Using Multi-Phase CT by Integrating Neural Distance and Texture-Aware Transformer
Hexin Dong
Jiawen Yao
Yuxing Tang
Ming Yuan
Yingda Xia
...
Le Lu
Li Zhang
Zai-De Liu
Yu Shi
Ling Zhang
MedIm
14
2
0
01 Aug 2023
FLatten Transformer: Vision Transformer using Focused Linear Attention
Dongchen Han
Xuran Pan
Yizeng Han
Shiji Song
Gao Huang
23
154
0
01 Aug 2023
LGViT: Dynamic Early Exiting for Accelerating Vision Transformer
Guanyu Xu
Jiawei Hao
Li Shen
Han Hu
Yong Luo
Hui Lin
J. Shen
24
15
0
01 Aug 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
17
0
0
29 Jul 2023
Adaptive Frequency Filters As Efficient Global Token Mixers
Zhipeng Huang
Zhizheng Zhang
Cuiling Lan
Zhengjun Zha
Yan Lu
B. Guo
25
36
0
26 Jul 2023
CLIP-KD: An Empirical Study of CLIP Model Distillation
Chuanguang Yang
Zhulin An
Libo Huang
Junyu Bi
Xinqiang Yu
Hansheng Yang
Boyu Diao
Yongjun Xu
VLM
21
27
0
24 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
34
175
0
18 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
16
2
0
18 Jul 2023
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
22
66
0
17 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
37
62
0
16 Jul 2023
Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar
Runwei Guan
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yong 0001Yue
Yutao Yue
VOS
29
16
0
14 Jul 2023
WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces
Shanliang Yao
Runwei Guan
Zhaodong Wu
Yi Ni
Zile Huang
...
H. Seo
Ka Lok Man
Jieming Ma
Xiaohui Zhu
Yutao Yue
26
23
0
13 Jul 2023
Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU
Zhihe Zhao
Neiwen Ling
Nan Guan
Guoliang Xing
20
11
0
10 Jul 2023
EdgeFace: Efficient Face Recognition Model for Edge Devices
Anjith George
Christophe Ecabert
Hatef Otroshi-Shahreza
Ketan Kotwal
S´ebastien Marcel
CVBM
18
23
0
04 Jul 2023
Fourier-Mixed Window Attention: Accelerating Informer for Long Sequence Time-Series Forecasting
Nhat Tran
Jack Xin
AI4TS
31
6
0
02 Jul 2023
MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Mustafa Munir
William Avery
R. Marculescu
ViT
GNN
31
33
0
01 Jul 2023
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Chaoning Zhang
Dongshen Han
Yu Qiao
Jung Uk Kim
Sung-Ho Bae
Seungkyu Lee
Choong Seon Hong
VLM
31
327
0
25 Jun 2023
Dynamic Perceiver for Efficient Visual Recognition
Yizeng Han
Dongchen Han
Zeyu Liu
Yulin Wang
Xuran Pan
Yifan Pu
Chaorui Deng
Junlan Feng
S. Song
Gao Huang
16
29
0
20 Jun 2023
NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning
Yun Yi
Haokui Zhang
Rong Xiao
Nan Wang
Xiaoyu Wang
GNN
24
2
0
19 Jun 2023
Previous
1
2
3
4
5
6
7
8
9
Next