ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.11816
  4. Cited By
Incorporating Convolution Designs into Visual Transformers

Incorporating Convolution Designs into Visual Transformers

22 March 2021
Kun Yuan
Shaopeng Guo
Ziwei Liu
Aojun Zhou
F. Yu
Wei Wu
    ViT
ArXivPDFHTML

Papers citing "Incorporating Convolution Designs into Visual Transformers"

50 / 218 papers shown
Title
Interpret Vision Transformers as ConvNets with Dynamic Convolutions
Interpret Vision Transformers as ConvNets with Dynamic Convolutions
Chong Zhou
Chen Change Loy
Bo Dai
ViT
32
1
0
19 Sep 2023
Padding Aware Neurons
Padding Aware Neurons
Dario Garcia-Gasulla
Víctor Giménez-Ábalos
Pablo A. Martin-Torres
VLM
FAtt
KELM
27
0
0
14 Sep 2023
HAT: Hybrid Attention Transformer for Image Restoration
HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen
Xintao Wang
Wenlong Zhang
Xiangtao Kong
Yu Qiao
Jiantao Zhou
Chao Dong
26
44
0
11 Sep 2023
Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer
  for Exposure Correction
Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction
Gehui Li
Jinyuan Liu
Long Ma
Zhiying Jiang
Xin-Yue Fan
Risheng Liu
20
6
0
02 Sep 2023
PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Zhixin Ling
Zhen Xing
Xiangdong Zhou
Manliang Cao
G. Zhou
ViT
26
17
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
31
20
0
27 Aug 2023
SG-Former: Self-guided Transformer with Evolving Token Reallocation
SG-Former: Self-guided Transformer with Evolving Token Reallocation
Sucheng Ren
Xingyi Yang
Songhua Liu
Xinchao Wang
ViT
27
41
0
23 Aug 2023
Patch Is Not All You Need
Patch Is Not All You Need
Chang-bo Li
Jie M. Zhang
Yang Wei
Zhilong Ji
Jinfeng Bai
Shiguang Shan
ViT
49
1
0
21 Aug 2023
TransFace: Calibrating Transformer Training for Face Recognition from a
  Data-Centric Perspective
TransFace: Calibrating Transformer Training for Face Recognition from a Data-Centric Perspective
Jun Dan
Yang Liu
Haoyu Xie
Jiankang Deng
H. Xie
Xuansong Xie
Baigui Sun
ViT
25
21
0
20 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
24
2
0
10 Aug 2023
SDLFormer: A Sparse and Dense Locality-enhanced Transformer for
  Accelerated MR Image Reconstruction
SDLFormer: A Sparse and Dense Locality-enhanced Transformer for Accelerated MR Image Reconstruction
Rahul G.S.
Sriprabha Ramnarayanan
Mohammad Al Fahim
Keerthi Ram
Preejith S.P
M. Sivaprakasam
ViT
MedIm
16
2
0
08 Aug 2023
Benchmarking Ultra-High-Definition Image Reflection Removal
Benchmarking Ultra-High-Definition Image Reflection Removal
Zhenyuan Zhang
Zhenbo Song
Kaihao Zhang
Wenhan Luo
Zhaoxin Fan
Jianfeng Lu
28
1
0
01 Aug 2023
Capturing Co-existing Distortions in User-Generated Content for
  No-reference Video Quality Assessment
Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment
Kun Yuan
Zishang Kong
Chuanchuan Zheng
Ming-Ting Sun
Xingsen Wen
ViT
27
14
0
31 Jul 2023
Sparse then Prune: Toward Efficient Vision Transformers
Sparse then Prune: Toward Efficient Vision Transformers
Yogi Prasetyo
N. Yudistira
A. Widodo
VLM
ViT
16
1
0
22 Jul 2023
DoseDiff: Distance-aware Diffusion Model for Dose Prediction in
  Radiotherapy
DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy
Yiwen Zhang
Chuanpu Li
Liming Zhong
Ze-xiang Chen
Wei Yang
Xuetao Wang
DiffM
MedIm
13
10
0
28 Jun 2023
Revisiting Token Pruning for Object Detection and Instance Segmentation
Revisiting Token Pruning for Object Detection and Instance Segmentation
Yifei Liu
Mathias Gehrig
Nico Messikommer
Marco Cannici
Davide Scaramuzza
ViT
VLM
37
24
0
12 Jun 2023
Energy-Based Models for Cross-Modal Localization using Convolutional
  Transformers
Energy-Based Models for Cross-Modal Localization using Convolutional Transformers
Alan Wu
Michael S. Ryoo
33
3
0
06 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
23
0
0
02 Jun 2023
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
Rui Shao
Tianxing Wu
Liqiang Nie
Ziwei Liu
16
11
0
01 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
42
28
0
01 Jun 2023
Dual Path Transformer with Partition Attention
Dual Path Transformer with Partition Attention
Zhengkai Jiang
Liang Liu
Jiangning Zhang
Yabiao Wang
Mingang Chen
Chengjie Wang
ViT
34
2
0
24 May 2023
PanoContext-Former: Panoramic Total Scene Understanding with a
  Transformer
PanoContext-Former: Panoramic Total Scene Understanding with a Transformer
Yuan Dong
C. Fang
Liefeng Bo
Zilong Dong
Ping Tan
MDE
ViT
15
10
0
21 May 2023
Mimetic Initialization of Self-Attention Layers
Mimetic Initialization of Self-Attention Layers
Asher Trockman
J. Zico Kolter
30
30
0
16 May 2023
Visual Tuning
Visual Tuning
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
46
38
0
10 May 2023
Low-Light Image Enhancement via Structure Modeling and Guidance
Low-Light Image Enhancement via Structure Modeling and Guidance
Xiaogang Xu
Ruixing Wang
Jiangbo Lu
3DV
DiffM
25
66
0
10 May 2023
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT
  Beyond Language
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu
Yinan He
Wenhai Wang
Weiyun Wang
Yi Wang
...
Yali Wang
Limin Wang
Ping Luo
Jifeng Dai
Yu Qiao
LRM
MLLM
22
79
0
09 May 2023
Vision Conformer: Incorporating Convolutions into Vision Transformer
  Layers
Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Brian Kenji Iwana
Akihiro Kusuda
ViT
45
2
0
27 Apr 2023
PVP: Pre-trained Visual Parameter-Efficient Tuning
PVP: Pre-trained Visual Parameter-Efficient Tuning
Zhao-quan Song
Ke Yang
Naiyang Guan
Junjie Zhu
Peng Qiao
Qingyong Hu
VPVLM
VLM
27
3
0
26 Apr 2023
Vision Transformers, a new approach for high-resolution and large-scale
  mapping of canopy heights
Vision Transformers, a new approach for high-resolution and large-scale mapping of canopy heights
Ibrahim Fayad
P. Ciais
Martin Schwartz
J. Wigneron
N. Baghdadi
...
Alexandre d’Aspremont
F. Frappart
Sassan Saatchi
Agnès Pellissier-Tanon
Hassan Bazzi
31
32
0
22 Apr 2023
A Review of Deep Learning for Video Captioning
A Review of Deep Learning for Video Captioning
Moloud Abdar
Meenakshi Kollati
Swaraja Kuraparthi
Farhad Pourpanah
Daniel J. McDuff
...
Shuicheng Yan
Abduallah A. Mohamed
Abbas Khosravi
Erik Cambria
Fatih Porikli
3DV
27
20
0
22 Apr 2023
Two Birds, One Stone: A Unified Framework for Joint Learning of Image
  and Video Style Transfers
Two Birds, One Stone: A Unified Framework for Joint Learning of Image and Video Style Transfers
Bohai Gu
Heng Fan
Libo Zhang
16
9
0
22 Apr 2023
Canvas: End-to-End Kernel Architecture Search in Neural Networks
Canvas: End-to-End Kernel Architecture Search in Neural Networks
Chenggang Zhao
Genghan Zhang
Mingyu Gao
15
1
0
16 Apr 2023
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality
  Assessment
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment
Kai Zhao
Kun Yuan
Ming-Ting Sun
Xingsen Wen
16
20
0
13 Apr 2023
DUFormer: Solving Power Line Detection Task in Aerial Images using
  Semantic Segmentation
DUFormer: Solving Power Line Detection Task in Aerial Images using Semantic Segmentation
Deyu An
Qian Zhang
Jianshu Chao
Tingli Li
Feng Qiao
Yong Deng
Zhen-Peng Bian
ViT
22
6
0
12 Apr 2023
ElegansNet: a brief scientific report and initial experiments
ElegansNet: a brief scientific report and initial experiments
Francesco Bardozzo
Andrea Terlizzi
Pietro Lio'
R. Tagliaferri
26
1
0
06 Apr 2023
Astroformer: More Data Might not be all you need for Classification
Astroformer: More Data Might not be all you need for Classification
Rishit Dagli
28
7
0
03 Apr 2023
Rethinking Local Perception in Lightweight Vision Transformer
Rethinking Local Perception in Lightweight Vision Transformer
Qi Fan
Huaibo Huang
Jiyang Guan
Ran He
ViT
25
30
0
31 Mar 2023
MoViT: Memorizing Vision Transformers for Medical Image Analysis
MoViT: Memorizing Vision Transformers for Medical Image Analysis
Yiqing Shen
Pengfei Guo
Jinpu Wu
Qi Huang
Nhat Le
Jinyuan Zhou
Shanshan Jiang
Mathias Unberath
ViT
MedIm
26
10
0
27 Mar 2023
FER-former: Multi-modal Transformer for Facial Expression Recognition
FER-former: Multi-modal Transformer for Facial Expression Recognition
Yande Li
Mingjie Wang
Minglun Gong
Y. Lu
Li Liu
30
8
0
23 Mar 2023
Learning A Sparse Transformer Network for Effective Image Deraining
Learning A Sparse Transformer Network for Effective Image Deraining
Xiang Chen
Hao‐Ran Li
Mingqiang Li
Jin-shan Pan
ViT
35
214
0
21 Mar 2023
Robustifying Token Attention for Vision Transformers
Robustifying Token Attention for Vision Transformers
Yong Guo
David Stutz
Bernt Schiele
ViT
21
24
0
20 Mar 2023
Extracting Motion and Appearance via Inter-Frame Attention for Efficient
  Video Frame Interpolation
Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
Guozhen Zhang
Yuhan Zhu
Hongya Wang
Youxin Chen
Gangshan Wu
Limin Wang
71
84
0
01 Mar 2023
RemoteNet: Remote Sensing Image Segmentation Network based on
  Global-Local Information
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information
S. Kumar
Abhishek Kumar
Dong-Gyu Lee
14
3
0
25 Feb 2023
TFormer: A Transmission-Friendly ViT Model for IoT Devices
TFormer: A Transmission-Friendly ViT Model for IoT Devices
Zhichao Lu
Chuntao Ding
Felix Juefei Xu
Vishnu Naresh Boddeti
Shangguang Wang
Yun Yang
21
13
0
15 Feb 2023
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition
Jiayu Jiao
Yuyao Tang
Kun-Li Channing Lin
Yipeng Gao
Jinhua Ma
Yaowei Wang
Wei-Shi Zheng
MedIm
ViT
24
136
0
03 Feb 2023
Robust Transformer with Locality Inductive Bias and Feature
  Normalization
Robust Transformer with Locality Inductive Bias and Feature Normalization
Omid Nejati Manzari
Hossein Kashiani
Hojat Asgarian Dehkordi
S. B. Shokouhi
ViT
24
14
0
27 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Rethinking Mobile Block for Efficient Attention-based Models
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
44
90
0
03 Jan 2023
OAMixer: Object-aware Mixing Layer for Vision Transformers
OAMixer: Object-aware Mixing Layer for Vision Transformers
H. Kang
Sangwoo Mo
Jinwoo Shin
VLM
39
4
0
13 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Andrew Zhang
Guoli Song
Jie Chen
31
1
0
10 Dec 2022
From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets
From CNNs to Shift-Invariant Twin Models Based on Complex Wavelets
Hubert Leterme
K. Polisano
V. Perrier
Alahari Karteek
39
1
0
01 Dec 2022
Previous
12345
Next