Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02178
Cited By
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
5 October 2021
Sachin Mehta
Mohammad Rastegari
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
50 / 419 papers shown
Title
Calibration and Uncertainty for multiRater Volume Assessment in multiorgan Segmentation (CURVAS) challenge results
Meritxell Riera-Marin
S. Ko
Julia Rodriguez-Comas
Matthias Stefan May
Zhaohong Pan
...
Anton Aubanell
Andreu Antolin
Javier Garcia-Lopez
M. A. G. Ballester
Adrian Galdran
UQCV
43
0
0
13 May 2025
MUBox: A Critical Evaluation Framework of Deep Machine Unlearning
Xiang Li
Bhavani Thuraisingham
Wenqi Wei
MU
AILaw
ELM
37
0
0
13 May 2025
The Application of Deep Learning for Lymph Node Segmentation: A Systematic Review
Jingguo Qu
Xinyang Han
Man-Lik Chui
Yao Pu
Simon Takadiyi Gunda
...
Jing Qin
Ann Dorothy King
Winnie Chiu-Wing Chu
J. Cai
Michael Tin-Cheung Ying
31
0
0
09 May 2025
OcularAge: A Comparative Study of Iris and Periocular Images for Pediatric Age Estimation
Naveenkumar G. Venkataswamy
Poorna Ravi
Stephanie Schuckers
Masudul H Imtiaz
51
0
0
08 May 2025
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer
Sainath Dey
Mitul Goswami
Jashika Sethi
Prasant Kumar Pattnaik
ViT
30
0
0
07 May 2025
Image Recognition with Online Lightweight Vision Transformer: A Survey
Zherui Zhang
Rongtao Xu
Jie Zhou
Changwei Wang
Xingtian Pei
...
Jiguang Zhang
Li Guo
Longxiang Gao
W. Xu
Shibiao Xu
ViT
139
0
0
06 May 2025
Vision Transformers in Precision Agriculture: A Comprehensive Survey
Saber Mehdipour
Seyed Abolghasem Mirroshandel
Seyed Amirhossein Tabatabaei
34
0
0
30 Apr 2025
Hybrid Knowledge Transfer through Attention and Logit Distillation for On-Device Vision Systems in Agricultural IoT
Stanley Mugisha
Rashid Kisitu
Florence Tushabe
17
0
0
21 Apr 2025
Lightweight Road Environment Segmentation using Vector Quantization
Jiyong Kwag
Alper Yilmaz
Charles Toth
24
0
0
19 Apr 2025
GFT: Gradient Focal Transformer
Boris Kriuk
Simranjit Kaur Gill
Shoaib Aslam
Amir Fakhrutdinov
31
0
0
14 Apr 2025
EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively
Bingyang Wang
Kaer Huang
Bin Li
Yiqiang Yan
L. Zhang
Huchuan Lu
You He
VLM
37
0
0
07 Apr 2025
Contour Integration Underlies Human-Like Vision
Ben Lonnqvist
Elsa Scialom
Abdülkadir Gökce
Zehra Merchant
Michael H. Herzog
Martin Schrimpf
VLM
33
0
0
07 Apr 2025
Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents
Zichuan Li
Jian Cui
Xiaojing Liao
Luyi Xing
LLMAG
40
0
0
04 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
66
1
0
03 Apr 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
42
0
0
29 Mar 2025
Towards Long-Range ENSO Prediction with an Explainable Deep Learning Model
Qi Chen
Yinghao Cui
Guobin Hong
Karumuri Ashok
Yuchun Pu
Xiaogu Zheng
Xuanze Zhang
Wei Zhong
Peng Zhan
Z. Wang
AI4Cl
35
0
0
25 Mar 2025
MobilePlantViT: A Mobile-friendly Hybrid ViT for Generalized Plant Disease Image Classification
Moshiur Rahman Tonmoy
Md. Mithun Hossain
Nilanjan Dey
M. Mridha
42
1
0
20 Mar 2025
RETHINED: A New Benchmark and Baseline for Real-Time High-Resolution Image Inpainting On Edge Devices
Marcelo Sanchez
G. Triginer
Ignacio Sarasua
Lara Raad
C. Ballester
63
0
0
18 Mar 2025
Context-guided Responsible Data Augmentation with Diffusion Models
Khawar Islam
Naveed Akhtar
48
1
0
12 Mar 2025
SSVQ: Unleashing the Potential of Vector Quantization with Sign-Splitting
Shuaiting Li
Juncan Deng
Chenxuan Wang
Kedong Xu
Rongtao Deng
Hong Gu
Haibin Shen
Kejie Huang
MQ
53
0
0
11 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
50
0
0
08 Mar 2025
ColFigPhotoAttnNet: Reliable Finger Photo Presentation Attack Detection Leveraging Window-Attention on Color Spaces
Anudeep Vurity
Emanuela Marasco
Raghavendra Ramachandra
Jongwoo Park
AAML
41
1
0
07 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
171
0
0
05 Mar 2025
Real-Time Aerial Fire Detection on Resource-Constrained Devices Using Knowledge Distillation
Sabina Jangirova
Branislava Jankovic
Waseem Ullah
Latif U. Khan
Mohsen Guizani
48
0
0
28 Feb 2025
Simpler Fast Vision Transformers with a Jumbo CLS Token
A. Fuller
Yousef Yassin
Daniel G. Kyrollos
Evan Shelhamer
James R. Green
69
0
0
24 Feb 2025
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Linyi Jiang
Silvery Fu
Yifei Zhu
Bo Li
ViT
150
0
0
14 Feb 2025
MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks
Lotfi Abdelkrim Mecharbat
Alberto Marchisio
Muhammad Shafique
M. Ghassemi
Tuka Alhanai
84
0
0
11 Feb 2025
MicroViT: A Vision Transformer with Low Complexity Self Attention for Edge Device
Novendra Setyawan
Chi-Chia Sun
Mao-Hsiu Hsu
W. Kuo
Jun-Wei Hsieh
ViT
49
2
0
09 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
72
0
0
26 Jan 2025
Towards Robust Unsupervised Attention Prediction in Autonomous Driving
Mengshi Qi
Xiaoyang Bi
Pengfei Zhu
Huadong Ma
50
0
0
25 Jan 2025
Rethinking Knowledge in Distillation: An In-context Sample Retrieval Perspective
Jinjing Zhu
Songze Li
Lin Wang
47
0
0
13 Jan 2025
Soft Vision-Based Tactile-Enabled SixthFinger: Advancing Daily Objects Manipulation for Stroke Survivors
Basma B. Hasanen
Mashood M. Mohsan
Abdulaziz Alkayas
F. Renda
Irfan Hussain
36
0
0
12 Jan 2025
START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation
Jintao Guo
Lei Qi
Yinghuan Shi
Yang Gao
31
1
0
08 Jan 2025
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Xubin Wang
Weijia Jia
36
0
0
08 Jan 2025
Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain
Juntao Zhang
Kun Bian
Peng Cheng
You Zhou
Jianning Liu
Wenbo An
Jun Zhou
Kun Shao
Mamba
52
2
0
08 Jan 2025
Boosting Adversarial Transferability with Spatial Adversarial Alignment
Zhaoyu Chen
Haijing Guo
Kaixun Jiang
Jiyuan Fu
Xinyu Zhou
Dingkang Yang
H. Tang
Bo-wen Li
Wenqiang Zhang
AAML
38
0
0
03 Jan 2025
FAST: Fast Audio Spectrogram Transformer
Anugunj Naman
Gaibo Zhang
26
0
0
03 Jan 2025
Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Bohang Sun
Pietro Liò
ViT
AAML
40
1
0
02 Jan 2025
Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers
Yunshan Zhong
Yuyao Zhou
Yuxin Zhang
Shen Li
Yong Li
Fei Chao
Zhanpeng Zeng
Rongrong Ji
MQ
94
0
0
31 Dec 2024
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
38
0
0
27 Dec 2024
HyperCLIP: Adapting Vision-Language models with Hypernetworks
Victor Akinwande
Mohammad Sadegh Norouzzadeh
Devin Willmott
Anna Bair
Madan Ravi Ganesh
J. Zico Kolter
CLIP
VLM
88
0
0
21 Dec 2024
ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Seungdong Yoa
Seungjun Lee
Hyeseung Cho
Bumsoo Kim
Woohyung Lim
ViT
70
0
0
21 Dec 2024
CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Andrei Znobishchev
Valerii Filev
Oleg Kudashev
Nikita Orlov
Humphrey Shi
72
0
0
17 Dec 2024
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone
Mustafa Munir
Md Mostafijur Rahman
R. Marculescu
MedIm
ViT
74
0
0
14 Dec 2024
MultiTASC++: A Continuously Adaptive Scheduler for Edge-Based Multi-Device Cascade Inference
Sokratis Nikolaidis
Stylianos I. Venieris
I. Venieris
81
0
0
05 Dec 2024
CubeFormer: A Simple yet Effective Baseline for Lightweight Image Super-Resolution
Jikai Wang
Huan Zheng
Jianbing Shen
SupR
73
0
0
03 Dec 2024
Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images
Xiangyong Lu
Masanori Suganuma
Takayuki Okatani
72
0
0
03 Dec 2024
Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Wei Zhou
Lei Zhao
Runyu Zhang
Yifan Cui
Hongpu Huang
Kun Qie
Chen Wang
AI4TS
73
0
0
30 Nov 2024
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Haoyang He
J. Zhang
Yuxuan Cai
Hongxu Chen
Xiaobin Hu
Zhenye Gan
Y. Wang
Chengjie Wang
Yunsheng Wu
Lei Xie
Mamba
88
3
0
24 Nov 2024
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
110
3
0
22 Nov 2024
1
2
3
4
5
6
7
8
9
Next