ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.02178
  4. Cited By
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision
  Transformer

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer

5 October 2021
Sachin Mehta
Mohammad Rastegari
    ViT
ArXivPDFHTML

Papers citing "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

50 / 419 papers shown
Title
A Comprehensive Survey of Convolutions in Deep Learning: Applications,
  Challenges, and Future Trends
A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends
Abolfazl Younesi
Mohsen Ansari
Mohammadamin Fazli
A. Ejlali
Muhammad Shafique
Joerg Henkel
3DV
44
44
0
23 Feb 2024
AutoMMLab: Automatically Generating Deployable Models from Language
  Instructions for Computer Vision Tasks
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Zekang Yang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
MLLM
VLM
53
8
0
23 Feb 2024
Attention-aware Semantic Communications for Collaborative Inference
Attention-aware Semantic Communications for Collaborative Inference
Jiwoong Im
Nayoung Kwon
Taewoo Park
Jiheon Woo
Jaeho Lee
Yongjune Kim
46
2
0
23 Feb 2024
EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera
  Relocalization
EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization
Zhendong Xiao
Changhao Chen
Shan Yang
Wu Wei
33
1
0
21 Feb 2024
YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional
  and Large Kernel Design for Antenna Interference Source Detection
YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection
Xiaoyu Tang
Xingming Chen
Jintao Cheng
Jin Wu
Rui Fan
Chengxi Zhang
Zebo Zhou
23
4
0
20 Feb 2024
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Switch EMA: A Free Lunch for Better Flatness and Sharpness
Siyuan Li
Zicheng Liu
Juanxi Tian
Ge Wang
Zedong Wang
...
Cheng Tan
Tao Lin
Yang Liu
Baigui Sun
Stan Z. Li
30
6
0
14 Feb 2024
Professional Agents -- Evolving Large Language Models into Autonomous
  Experts with Human-Level Competencies
Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies
Zhixuan Chu
Yan Wang
Feng Zhu
Lu Yu
Longfei Li
Jinjie Gu
LLMAG
18
8
0
06 Feb 2024
A Survey on Transformer Compression
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
31
27
0
05 Feb 2024
Exploiting Low-level Representations for Ultra-Fast Road Segmentation
Exploiting Low-level Representations for Ultra-Fast Road Segmentation
Huan Zhou
Feng Xue
Yucong Li
Shi Gong
Yiqun Li
Yu Zhou
AI4TS
SSeg
21
3
0
04 Feb 2024
Lightweight Pixel Difference Networks for Efficient Visual
  Representation Learning
Lightweight Pixel Difference Networks for Efficient Visual Representation Learning
Z. Su
Jiehua Zhang
Longguang Wang
Hua Zhang
Zhen Liu
M. Pietikäinen
Li Liu
32
21
0
01 Feb 2024
EPSD: Early Pruning with Self-Distillation for Efficient Model
  Compression
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression
Dong Chen
Ning Liu
Yichen Zhu
Zhengping Che
Rui Ma
Fachao Zhang
Xiaofeng Mou
Yi Chang
Jian Tang
29
3
0
31 Jan 2024
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
Seokju Yun
Youngmin Ro
ViT
38
29
0
29 Jan 2024
OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning
OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning
Chu Myaet Thwal
Minh N. H. Nguyen
Ye Lin Tun
Seongjin Kim
My T. Thai
Choong Seon Hong
51
5
0
22 Jan 2024
Datasets, Clues and State-of-the-Arts for Multimedia Forensics: An
  Extensive Review
Datasets, Clues and State-of-the-Arts for Multimedia Forensics: An Extensive Review
Ankit Yadav
Dinesh Kumar Vishwakarma
AAML
25
6
0
13 Jan 2024
SPFormer: Enhancing Vision Transformer with Superpixel Representation
SPFormer: Enhancing Vision Transformer with Superpixel Representation
Jieru Mei
Liang-Chieh Chen
Alan L. Yuille
Cihang Xie
ViT
MDE
21
4
0
05 Jan 2024
PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity
  Compensation
PanGu-πππ: Enhancing Language Model Architectures via Nonlinearity Compensation
Yunhe Wang
Hanting Chen
Yehui Tang
Tianyu Guo
Kai Han
...
Qinghua Xu
Qun Liu
Jun Yao
Chao Xu
Dacheng Tao
67
15
0
27 Dec 2023
Synthesizing Black-box Anti-forensics DeepFakes with High Visual Quality
Synthesizing Black-box Anti-forensics DeepFakes with High Visual Quality
Bing Fan
Shu Hu
Feng Ding
AAML
35
17
0
17 Dec 2023
ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution
  and Transformer Networks
ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution and Transformer Networks
Pumeng Lyu
Tao Tang
Fenghua Ling
Jing-Jia Luo
Niklas Boers
Wanli Ouyang
Lei Bai
12
5
0
16 Dec 2023
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework
  on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous
  Modalities
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities
Runwei Guan
Haocheng Zhao
Shanliang Yao
Ka Lok Man
Xiaohui Zhu
...
Yong Yue
Jeremy S. Smith
Eng Gee Lim
Weiping Ding
Yutao Yue
17
4
0
14 Dec 2023
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
Chong Zhou
Xiangtai Li
Chen Change Loy
Bo Dai
VLM
30
44
0
11 Dec 2023
Building Variable-sized Models via Learngene Pool
Building Variable-sized Models via Learngene Pool
Boyu Shi
Shiyu Xia
Xu Yang
Haokun Chen
Zhi Kou
Xin Geng
18
1
0
10 Dec 2023
F3-Pruning: A Training-Free and Generalized Pruning Strategy towards
  Faster and Finer Text-to-Video Synthesis
F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis
Sitong Su
Jianzhi Liu
Lianli Gao
Jingkuan Song
DiffM
VGen
17
4
0
06 Dec 2023
Developing a Resource-Constraint EdgeAI model for Surface Defect
  Detection
Developing a Resource-Constraint EdgeAI model for Surface Defect Detection
Atah Nuh Mih
Hung Cao
Asfia Kawnine
Monica Wachowicz
26
0
0
04 Dec 2023
SRSNetwork: Siamese Reconstruction-Segmentation Networks based on
  Dynamic-Parameter Convolution
SRSNetwork: Siamese Reconstruction-Segmentation Networks based on Dynamic-Parameter Convolution
Bingkun Nian
Fenghe Tang
Jianrui Ding
Pingping Zhang
Jie-jin Yang
S.Kevin Zhou
Wei Liu
29
0
0
04 Dec 2023
MobileUtr: Revisiting the relationship between light-weight CNN and
  Transformer for efficient medical image segmentation
MobileUtr: Revisiting the relationship between light-weight CNN and Transformer for efficient medical image segmentation
Fenghe Tang
Bingkun Nian
Jianrui Ding
Quan Quan
Jie-jin Yang
Wei Liu
S.Kevin Zhou
ViT
MedIm
23
3
0
04 Dec 2023
Token Fusion: Bridging the Gap between Token Pruning and Token Merging
Token Fusion: Bridging the Gap between Token Pruning and Token Merging
Minchul Kim
Shangqian Gao
Yen-Chang Hsu
Yilin Shen
Hongxia Jin
23
29
0
02 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment
  Anything
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
40
139
0
01 Dec 2023
QuadraNet: Improving High-Order Neural Interaction Efficiency with
  Hardware-Aware Quadratic Neural Networks
QuadraNet: Improving High-Order Neural Interaction Efficiency with Hardware-Aware Quadratic Neural Networks
Chenhui Xu
Fuxun Yu
Zirui Xu
Chenchen Liu
Jinjun Xiong
Xiang Chen
30
4
0
29 Nov 2023
Cross-level Attention with Overlapped Windows for Camouflaged Object
  Detection
Cross-level Attention with Overlapped Windows for Camouflaged Object Detection
Jiepan Li
Fangxiao Lu
Nan Xue
Zhuo Li
Hongyan Zhang
Wei He
25
2
0
28 Nov 2023
Advancing Vision Transformers with Group-Mix Attention
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Li Yuan
Jiangliu Wang
Yibing Song
Ping Luo
112
16
0
26 Nov 2023
Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for
  Mobile Robots
Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots
Youqi Liao
Shuhao Kang
Jianping Li
Yang Liu
Yun Liu
Zhen Dong
Bisheng Yang
Xieyuanli Chen
19
10
0
21 Nov 2023
Double-Condensing Attention Condenser: Leveraging Attention in Deep
  Learning to Detect Skin Cancer from Skin Lesion Images
Double-Condensing Attention Condenser: Leveraging Attention in Deep Learning to Detect Skin Cancer from Skin Lesion Images
Chi-en Amy Tai
Elizabeth Janes
Chris Czarnecki
Alexander Wong
MedIm
26
2
0
20 Nov 2023
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of
  Post-Training ViTs Quantization
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
Yunshan Zhong
Jiawei Hu
Mingbao Lin
Mengzhao Chen
Rongrong Ji
MQ
28
3
0
16 Nov 2023
MARformer: An Efficient Metal Artifact Reduction Transformer for Dental
  CBCT Images
MARformer: An Efficient Metal Artifact Reduction Transformer for Dental CBCT Images
Yuxuan Shi
Jun Xu
Dinggang Shen
MedIm
25
0
0
16 Nov 2023
FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer
FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer
Chi-Chih Chang
Yuan-Yao Sung
Shixing Yu
N. Huang
Diana Marculescu
Kai-Chiang Wu
ViT
13
1
0
07 Nov 2023
SBCFormer: Lightweight Network Capable of Full-size ImageNet
  Classification at 1 FPS on Single Board Computers
SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers
Xiangyong Lu
Masanori Suganuma
Takayuki Okatani
35
10
0
07 Nov 2023
Dual-Stream Attention Transformers for Sewer Defect Classification
Dual-Stream Attention Transformers for Sewer Defect Classification
Abdullah Al Redwan Newaz
Mahdi Abdelguerfi
Kendall N. Niles
Joe Tom
ViT
39
0
0
07 Nov 2023
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices
Jianlei Yang
Jiacheng Liao
Fanding Lei
Meichen Liu
Junyi Chen
Lingkun Long
Han Wan
Bei Yu
Weisheng Zhao
MoE
33
2
0
03 Nov 2023
StairNet: Visual Recognition of Stairs for Human-Robot Locomotion
StairNet: Visual Recognition of Stairs for Human-Robot Locomotion
Andrew Garrett Kurbis
Dmytro Kuzmenko
Bogdan Ivanyuk-Skulskiy
Alex Mihailidis
Brokoslaw Laschowski
15
0
0
31 Oct 2023
Triplet Attention Transformer for Spatiotemporal Predictive Learning
Triplet Attention Transformer for Spatiotemporal Predictive Learning
Xuesong Nie
Xi Chen
Haoyuan Jin
Zhihang Zhu
Yunfeng Yan
Donglian Qi
ViT
14
10
0
28 Oct 2023
Location-Aware Visual Question Generation with Lightweight Models
Location-Aware Visual Question Generation with Lightweight Models
Nicholas Collin Suwono
Justin Chih-Yao Chen
Tun-Min Hung
T. Huang
I-Bin Liao
Yung-Hui Li
Lun-Wei Ku
Shao-Hua Sun
13
4
0
23 Oct 2023
Domain-Generalized Face Anti-Spoofing with Unknown Attacks
Domain-Generalized Face Anti-Spoofing with Unknown Attacks
Zong-Wei Hong
Yu-Chen Lin
Hsuan-Tung Liu
Yi-Ren Yeh
Chu-Song Chen
CVBM
AAML
25
4
0
18 Oct 2023
CLIP for Lightweight Semantic Segmentation
CLIP for Lightweight Semantic Segmentation
Ke Jin
Wankou Yang
VLM
21
1
0
11 Oct 2023
No Token Left Behind: Efficient Vision Transformer via Dynamic Token
  Idling
No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Xuwei Xu
Changlin Li
Yudong Chen
Xiaojun Chang
Jiajun Liu
Sen Wang
ViT
21
5
0
09 Oct 2023
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision
  Transformers
Plug n' Play: Channel Shuffle Module for Enhancing Tiny Vision Transformers
Xuwei Xu
Sen Wang
Yudong Chen
Jiajun Liu
ViT
21
1
0
09 Oct 2023
Enhancing Representations through Heterogeneous Self-Supervised Learning
Enhancing Representations through Heterogeneous Self-Supervised Learning
Zhongyu Li
Bo-Wen Yin
Yongxiang Liu
Li Liu
Ming-Ming Cheng
SSL
23
2
0
08 Oct 2023
Entropic Score metric: Decoupling Topology and Size in Training-free NAS
Entropic Score metric: Decoupling Topology and Size in Training-free NAS
Niccolò Cavagnero
Luc Robbiano
Francesca Pistilli
Barbara Caputo
Giuseppe Averta
18
2
0
06 Oct 2023
Diffusion Models as Masked Audio-Video Learners
Diffusion Models as Masked Audio-Video Learners
Elvis Nunez
Yanzi Jin
Mohammad Rastegari
Sachin Mehta
Maxwell Horton
20
2
0
05 Oct 2023
Improving Drumming Robot Via Attention Transformer Network
Improving Drumming Robot Via Attention Transformer Network
Yang Yi
Zonghan Li
26
0
0
04 Oct 2023
Distilling Inductive Bias: Knowledge Distillation Beyond Model
  Compression
Distilling Inductive Bias: Knowledge Distillation Beyond Model Compression
Gousia Habib
Tausifa Jan Saleem
Brejesh Lall
VLM
25
0
0
30 Sep 2023
Previous
123456789
Next