Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.02178
Cited By
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
5 October 2021
Sachin Mehta
Mohammad Rastegari
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
50 / 419 papers shown
Title
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
Xuan Shen
Yaohua Wang
Ming Lin
Yi-Li Huang
Hao Tang
Xiuyu Sun
Yanzhi Wang
67
33
0
05 Mar 2023
Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications
Zikang Xiong
Daniel Lawson
Joe Eappen
A. H. Qureshi
Suresh Jagannathan
11
0
0
02 Mar 2023
Generic-to-Specific Distillation of Masked Autoencoders
Wei Huang
Zhiliang Peng
Li Dong
Furu Wei
Jianbin Jiao
QiXiang Ye
30
22
0
28 Feb 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
33
101
0
27 Feb 2023
Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution
Long Sun
Jiangxin Dong
Jinhui Tang
Jin-shan Pan
SupR
33
79
0
27 Feb 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
32
3
0
26 Feb 2023
A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
Hayat Rajani
N. Gracias
Rafael García
ViT
24
12
0
24 Feb 2023
LightCTS: A Lightweight Framework for Correlated Time Series Forecasting
Zhichen Lai
Dalin Zhang
Huan Li
Christian S. Jensen
Hua Lu
Yan Zhao
AI4TS
27
29
0
23 Feb 2023
A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness
Huaqi Tao
Bing Liu
Jinqiang Cui
Hong Zhang
ViT
15
23
0
23 Feb 2023
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines
M. Cen
Xingyu Li
Bangwei Guo
J. Jonnagaddala
Hong Zhang
Xuesong Xu
MedIm
LM&MA
11
0
0
21 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
89
8
0
21 Feb 2023
MedViT: A Robust Vision Transformer for Generalized Medical Image Classification
Omid Nejati Manzari
Hamid Ahmadabadi
Hossein Kashiani
S. B. Shokouhi
Ahmad Ayatollahi
ViT
MedIm
28
177
0
19 Feb 2023
Self-supervised pseudo-colorizing of masked cells
Royden Wagner
Carlos Fernandez Lopez
Christoph Stiller
17
0
0
12 Feb 2023
Short-Term Memory Convolutions
Grzegorz Stefański
Krzysztof Arendt
P. Daniluk
Bartlomiej Jasik
Artur Szumaczuk
14
4
0
08 Feb 2023
CECT: Controllable Ensemble CNN and Transformer for COVID-19 Image Classification
Zhao Liu
Leizhao Shen
ViT
29
7
0
05 Feb 2023
Joint Training of Deep Ensembles Fails Due to Learner Collusion
Alan Jeffares
Tennison Liu
Jonathan Crabbé
M. Schaar
FedML
53
15
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
37
2
0
25 Jan 2023
Connecting metrics for shape-texture knowledge in computer vision
Tiago Gaspar Oliveira
Tiago Marques
Arlindo L. Oliveira
19
0
0
25 Jan 2023
Head-Free Lightweight Semantic Segmentation with Linear Transformer
B. Dong
Pichao Wang
Fan Wang
ViT
22
65
0
11 Jan 2023
Skip-Attention: Improving Vision Transformers by Paying Less Attention
Shashanka Venkataramanan
Amir Ghodrati
Yuki M. Asano
Fatih Porikli
A. Habibian
ViT
18
25
0
05 Jan 2023
Enabling Augmented Segmentation and Registration in Ultrasound-Guided Spinal Surgery via Realistic Ultrasound Synthesis from Diagnostic CT Volume
A. Li
Jiayi Han
Yongjian Zhao
Keyu Li
Li Liu
25
4
0
05 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Sucheng Ren
Fangyun Wei
Zheng-Wei Zhang
Han Hu
35
34
0
03 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
44
90
0
03 Jan 2023
RangeAugment: Efficient Online Augmentation with Range Learning
Sachin Mehta
Saeid Naderiparizi
Fartash Faghri
Maxwell Horton
Lailin Chen
Ali Farhadi
Oncel Tuzel
Mohammad Rastegari
24
6
0
20 Dec 2022
Boosting Automatic COVID-19 Detection Performance with Self-Supervised Learning and Batch Knowledge Ensembling
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
SSL
25
8
0
19 Dec 2022
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
Zhikai Li
Junrui Xiao
Lianwei Yang
Qingyi Gu
MQ
26
81
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
30
159
0
15 Dec 2022
AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization
Zhiying Xu
H. Peng
Wei Wang
GNN
26
3
0
02 Dec 2022
TAOTF: A Two-stage Approximately Orthogonal Training Framework in Deep Neural Networks
Taoyong Cui
Jianze Li
Yuhan Dong
Li Liu
17
1
0
25 Nov 2022
GhostNetV2: Enhance Cheap Operation with Long-Range Attention
Yehui Tang
Kai Han
Jianyuan Guo
Chang Xu
Chaoting Xu
Yunhe Wang
18
270
0
23 Nov 2022
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
31
129
0
22 Nov 2022
Spikeformer: A Novel Architecture for Training High-Performance Low-Latency Spiking Neural Network
Yudong Li
Yunlin Lei
Xu Yang
21
26
0
19 Nov 2022
Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference
Haoran You
Yunyang Xiong
Xiaoliang Dai
Bichen Wu
Peizhao Zhang
Haoqi Fan
Peter Vajda
Yingyan Lin
35
31
0
18 Nov 2022
Fcaformer: Forward Cross Attention in Hybrid Vision Transformer
Haokui Zhang
Wenze Hu
Xiaoyu Wang
ViT
19
8
0
14 Nov 2022
ParCNetV2: Oversized Kernel with Enhanced Attention
Ruihan Xu
Haokui Zhang
Wenze Hu
Shiliang Zhang
Xiaoyu Wang
ViT
25
6
0
14 Nov 2022
Watermarking in Secure Federated Learning: A Verification Framework Based on Client-Side Backdooring
Wenyuan Yang
Shuo Shao
Yue Yang
Xiyao Liu
Ximeng Liu
Zhihua Xia
Gerald Schaefer
Hui Fang
FedML
14
21
0
14 Nov 2022
ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention
Jyotikrishna Dass
Shang Wu
Huihong Shi
Chaojian Li
Zhifan Ye
Zhongfeng Wang
Yingyan Lin
17
49
0
09 Nov 2022
Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer
S. S. Nijhawan
Leo Hoshikawa
Atsushi Irie
Masakazu Yoshimura
Junji Otsuka
Takeshi Ohashi
VOT
ViT
27
0
0
09 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
26
56
0
07 Nov 2022
Boosting Binary Neural Networks via Dynamic Thresholds Learning
Jiehua Zhang
Xueyang Zhang
Z. Su
Zitong Yu
Yanghe Feng
Xin Lu
M. Pietikäinen
Li Liu
MQ
30
0
0
04 Nov 2022
Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Yan Zhang
Xiyuan Gao
Qingyan Duan
Jiaxu Leng
Xiao Pu
Xinbo Gao
ViT
16
1
0
28 Oct 2022
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
26
2
0
28 Oct 2022
Deep Model Reassembly
Xingyi Yang
Zhou Daquan
Songhua Liu
Jingwen Ye
Xinchao Wang
MoMe
20
120
0
24 Oct 2022
Boosting vision transformers for image retrieval
Chull Hwan Song
Jooyoung Yoon
Shunghyun Choi
Yannis Avrithis
ViT
29
31
0
21 Oct 2022
Multi-view Gait Recognition based on Siamese Vision Transformer
Yanchen Yang
Lijun Yun
Ruoyu Li
Feiyan Cheng
21
5
0
19 Oct 2022
Spatio-channel Attention Blocks for Cross-modal Crowd Counting
Youjia Zhang
Soyun Choi
Sungeun Hong
18
24
0
19 Oct 2022
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
Haoran You
Zhanyi Sun
Huihong Shi
Zhongzhi Yu
Yang Katie Zhao
Yongan Zhang
Chaojian Li
Baopu Li
Yingyan Lin
ViT
19
76
0
18 Oct 2022
Token Merging: Your ViT But Faster
Daniel Bolya
Cheng-Yang Fu
Xiaoliang Dai
Peizhao Zhang
Christoph Feichtenhofer
Judy Hoffman
MoMe
30
417
0
17 Oct 2022
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Taojiannan Yang
Haokui Zhang
Wenze Hu
C. L. P. Chen
Xiaoyu Wang
ViT
16
0
0
08 Oct 2022
Towards Light Weight Object Detection System
K. Dharma
V. Dayana
Menglan Wu
Venkateswara Rao Cherukuri
Hau Hwang
16
1
0
08 Oct 2022
Previous
1
2
3
4
5
6
7
8
9
Next