ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.02178
  4. Cited By
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision
  Transformer

MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer

5 October 2021
Sachin Mehta
Mohammad Rastegari
    ViT
ArXivPDFHTML

Papers citing "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

50 / 419 papers shown
Title
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural
  Network
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
Xuan Shen
Yaohua Wang
Ming Lin
Yi-Li Huang
Hao Tang
Xiuyu Sun
Yanzhi Wang
67
33
0
05 Mar 2023
Co-learning Planning and Control Policies Constrained by Differentiable
  Logic Specifications
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications
Zikang Xiong
Daniel Lawson
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
11
0
0
02 Mar 2023
Generic-to-Specific Distillation of Masked Autoencoders
Generic-to-Specific Distillation of Masked Autoencoders
Wei Huang
Zhiliang Peng
Li Dong
Furu Wei
Jianbin Jiao
QiXiang Ye
30
22
0
28 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
33
101
0
27 Feb 2023
Spatially-Adaptive Feature Modulation for Efficient Image
  Super-Resolution
Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution
Long Sun
Jiangxin Dong
Jinhui Tang
Jin-shan Pan
SupR
33
79
0
27 Feb 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation
  for Efficient Skeleton-based Action Recognition
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
32
3
0
26 Feb 2023
A Convolutional Vision Transformer for Semantic Segmentation of
  Side-Scan Sonar Data
A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
Hayat Rajani
N. Gracias
Rafael García
ViT
24
12
0
24 Feb 2023
LightCTS: A Lightweight Framework for Correlated Time Series Forecasting
LightCTS: A Lightweight Framework for Correlated Time Series Forecasting
Zhichen Lai
Dalin Zhang
Huan Li
Christian S. Jensen
Hua Lu
Yan Zhao
AI4TS
27
29
0
23 Feb 2023
A Convolutional-Transformer Network for Crack Segmentation with Boundary
  Awareness
A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness
Huaqi Tao
Bing Liu
Jinqiang Cui
Hong Zhang
ViT
15
23
0
23 Feb 2023
Time to Embrace Natural Language Processing (NLP)-based Digital
  Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep
  Learning Pipelines
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines
M. Cen
Xingyu Li
Bangwei Guo
J. Jonnagaddala
Hong Zhang
Xuesong Xu
MedIm
LM&MA
11
0
0
21 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
89
8
0
21 Feb 2023
MedViT: A Robust Vision Transformer for Generalized Medical Image
  Classification
MedViT: A Robust Vision Transformer for Generalized Medical Image Classification
Omid Nejati Manzari
Hamid Ahmadabadi
Hossein Kashiani
S. B. Shokouhi
Ahmad Ayatollahi
ViT
MedIm
28
177
0
19 Feb 2023
Self-supervised pseudo-colorizing of masked cells
Self-supervised pseudo-colorizing of masked cells
Royden Wagner
Carlos Fernandez Lopez
Christoph Stiller
17
0
0
12 Feb 2023
Short-Term Memory Convolutions
Short-Term Memory Convolutions
Grzegorz Stefański
Krzysztof Arendt
P. Daniluk
Bartlomiej Jasik
Artur Szumaczuk
14
4
0
08 Feb 2023
CECT: Controllable Ensemble CNN and Transformer for COVID-19 Image
  Classification
CECT: Controllable Ensemble CNN and Transformer for COVID-19 Image Classification
Zhao Liu
Leizhao Shen
ViT
29
7
0
05 Feb 2023
Joint Training of Deep Ensembles Fails Due to Learner Collusion
Joint Training of Deep Ensembles Fails Due to Learner Collusion
Alan Jeffares
Tennison Liu
Jonathan Crabbé
M. Schaar
FedML
53
15
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
37
2
0
25 Jan 2023
Connecting metrics for shape-texture knowledge in computer vision
Connecting metrics for shape-texture knowledge in computer vision
Tiago Gaspar Oliveira
Tiago Marques
Arlindo L. Oliveira
19
0
0
25 Jan 2023
Head-Free Lightweight Semantic Segmentation with Linear Transformer
Head-Free Lightweight Semantic Segmentation with Linear Transformer
B. Dong
Pichao Wang
Fan Wang
ViT
22
65
0
11 Jan 2023
Skip-Attention: Improving Vision Transformers by Paying Less Attention
Skip-Attention: Improving Vision Transformers by Paying Less Attention
Shashanka Venkataramanan
Amir Ghodrati
Yuki M. Asano
Fatih Porikli
A. Habibian
ViT
18
25
0
05 Jan 2023
Enabling Augmented Segmentation and Registration in Ultrasound-Guided
  Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT Volume
Enabling Augmented Segmentation and Registration in Ultrasound-Guided Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT Volume
A. Li
Jiayi Han
Yongjian Zhao
Keyu Li
Li Liu
25
4
0
05 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Sucheng Ren
Fangyun Wei
Zheng-Wei Zhang
Han Hu
35
34
0
03 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Rethinking Mobile Block for Efficient Attention-based Models
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
44
90
0
03 Jan 2023
RangeAugment: Efficient Online Augmentation with Range Learning
RangeAugment: Efficient Online Augmentation with Range Learning
Sachin Mehta
Saeid Naderiparizi
Fartash Faghri
Maxwell Horton
Lailin Chen
Ali Farhadi
Oncel Tuzel
Mohammad Rastegari
24
6
0
20 Dec 2022
Boosting Automatic COVID-19 Detection Performance with Self-Supervised
  Learning and Batch Knowledge Ensembling
Boosting Automatic COVID-19 Detection Performance with Self-Supervised Learning and Batch Knowledge Ensembling
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
SSL
25
8
0
19 Dec 2022
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of
  Vision Transformers
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
Zhikai Li
Junrui Xiao
Lianwei Yang
Qingyi Gu
MQ
26
81
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
30
159
0
15 Dec 2022
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on
  Graph Optimization
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization
Zhiying Xu
H. Peng
Wei Wang
GNN
26
3
0
02 Dec 2022
TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep
  Neural Networks
TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep Neural Networks
Taoyong Cui
Jianze Li
Yuhan Dong
Li Liu
17
1
0
25 Nov 2022
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Chaoting Xu
Yunhe Wang
18
270
0
23 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
31
129
0
22 Nov 2022
Spikeformer: A Novel Architecture for Training High-Performance
  Low-Latency Spiking Neural Network
Spikeformer: A Novel Architecture for Training High-Performance Low-Latency Spiking Neural Network
Yudong Li
Yunlin Lei
Xu Yang
21
26
0
19 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards
  Linear-Angular Attention at Vision Transformer Inference
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
35
31
0
18 Nov 2022
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer
Haokui Zhang
Wenze Hu
Xiaoyu Wang
ViT
19
8
0
14 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
25
6
0
14 Nov 2022
Watermarking in Secure Federated Learning: A Verification Framework
  Based on Client-Side Backdooring
Watermarking in Secure Federated Learning: A Verification Framework Based on Client-Side Backdooring
Wenyuan Yang
Shuo Shao
Yue Yang
Xiyao Liu
Ximeng Liu
Zhihua Xia
Gerald Schaefer
Hui Fang
FedML
14
21
0
14 Nov 2022
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision
  Transformer Acceleration with a Linear Taylor Attention
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention
Jyotikrishna Dass
Shang Wu
Huihong Shi
Chaojian Li
Zhifan Ye
Zhongfeng Wang
Yingyan Lin
17
49
0
09 Nov 2022
Efficient Joint Detection and Multiple Object Tracking with Spatially
  Aware Transformer
Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer
S. S. Nijhawan
Leo Hoshikawa
Atsushi Irie
Masakazu Yoshimura
Junji Otsuka
Takeshi Ohashi
VOT
ViT
27
0
0
09 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
26
56
0
07 Nov 2022
Boosting Binary Neural Networks via Dynamic Thresholds Learning
Boosting Binary Neural Networks via Dynamic Thresholds Learning
Jiehua Zhang
Xueyang Zhang
Z. Su
Zitong Yu
Yanghe Feng
Xin Lu
M. Pietikäinen
Li Liu
MQ
30
0
0
04 Nov 2022
Contextual Learning in Fourier Complex Field for VHR Remote Sensing
  Images
Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Yan Zhang
Xiyuan Gao
Qingyan Duan
Jiaxu Leng
Xiao Pu
Xinbo Gao
ViT
16
1
0
28 Oct 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
26
2
0
28 Oct 2022
Deep Model Reassembly
Deep Model Reassembly
Xingyi Yang
Zhou Daquan
Songhua Liu
Jingwen Ye
Xinchao Wang
MoMe
20
120
0
24 Oct 2022
Boosting vision transformers for image retrieval
Boosting vision transformers for image retrieval
Chull Hwan Song
Jooyoung Yoon
Shunghyun Choi
Yannis Avrithis
ViT
29
31
0
21 Oct 2022
Multi-view Gait Recognition based on Siamese Vision Transformer
Multi-view Gait Recognition based on Siamese Vision Transformer
Yanchen Yang
Lijun Yun
Ruoyu Li
Feiyan Cheng
21
5
0
19 Oct 2022
Spatio-channel Attention Blocks for Cross-modal Crowd Counting
Spatio-channel Attention Blocks for Cross-modal Crowd Counting
Youjia Zhang
Soyun Choi
Sungeun Hong
18
24
0
19 Oct 2022
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
Haoran You
Zhanyi Sun
Huihong Shi
Zhongzhi Yu
Yang Katie Zhao
Yongan Zhang
Chaojian Li
Baopu Li
Yingyan Lin
ViT
19
76
0
18 Oct 2022
Token Merging: Your ViT But Faster
Token Merging: Your ViT But Faster
Daniel Bolya
Cheng-Yang Fu
Xiaoliang Dai
Peizhao Zhang
Christoph Feichtenhofer
Judy Hoffman
MoMe
30
417
0
17 Oct 2022
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Taojiannan Yang
Haokui Zhang
Wenze Hu
C. L. P. Chen
Xiaoyu Wang
ViT
16
0
0
08 Oct 2022
Towards Light Weight Object Detection System
Towards Light Weight Object Detection System
K. Dharma
V. Dayana
Menglan Wu
Venkateswara Rao Cherukuri
Hau Hwang
16
1
0
08 Oct 2022
Previous
123456789
Next