Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.11943
Cited By
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
22 November 2022
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition"
50 / 72 papers shown
Title
FreCT: Frequency-augmented Convolutional Transformer for Robust Time Series Anomaly Detection
Wenxin Zhang
Ding Xu
Guangzhen Yao
Xiaojian Lin
Renxiang Guan
Chengze Du
Renda Han
Xi Xuan
Cuicui Luo
AI4TS
54
0
0
02 May 2025
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation
Guoyi Zhang
Siyang Chen
Guangsheng Xu
Han Wang
Xiaohu Zhang
34
0
0
20 Apr 2025
DefMamba: Deformable Visual State Space Model
Leiye Liu
Miao Zhang
Jihao Yin
Tingwei Liu
Wei Ji
Yongri Piao
Huchuan Lu
Mamba
55
0
0
08 Apr 2025
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
Bo Yin
Jiao-Long Cao
Ming-Ming Cheng
Qibin Hou
3DPC
MDE
48
0
0
07 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
39
0
0
31 Mar 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
39
0
0
29 Mar 2025
Offline Meteorology-Pollution Coupling Global Air Pollution Forecasting Model with Bilinear Pooling
Xu Fan
Yuetan Lin
Bing Gong
H. Li
OffRL
AI4CE
39
0
0
24 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
166
0
0
05 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
112
1
0
27 Feb 2025
A Collaborative Jade Recognition System for Mobile Devices Based on Lightweight and Large Models
Zhenyu Wang
Wenjia Li
Pengyu Zhu
51
0
0
21 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
150
51
0
21 Feb 2025
All-in-One Image Compression and Restoration
Huimin Zeng
Jiacheng Li
Ziqiang Zheng
Zhiwei Xiong
83
1
0
05 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
70
0
0
26 Jan 2025
Learning Dynamic Local Context Representations for Infrared Small Target Detection
Guoyi Zhang
Guangsheng Xu
Han Wang
Siyang Chen
Yunxiao Shan
Xiaohu Zhang
29
1
0
23 Dec 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
J. T. Wang
93
1
0
25 Nov 2024
Multi-Token Enhancing for Vision Representation Learning
Zhong-Yu Li
Yu-Song Hu
Bo Yin
Ming-Ming Cheng
66
1
0
24 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
40
0
0
12 Nov 2024
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes
Muhammad Ali
Mamoona Javaid
Mubashir Noman
M. Fiaz
Salman Khan
29
0
0
31 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
31
2
0
11 Oct 2024
SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
53
5
0
15 Sep 2024
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Yonghao Yu
Dongcheng Zhao
Guobin Shen
Yiting Dong
Yi Zeng
45
0
0
11 Sep 2024
DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
Xiaoya Tang
Bodong Zhang
Beatrice S. Knudsen
Tolga Tasdizen
ViT
MedIm
42
1
0
18 Jul 2024
AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs
Yunling Zheng
Zeyi Xu
Fanghui Xue
Biao Yang
Jiancheng Lyu
Shuai Zhang
Y. Qi
Jack Xin
48
0
0
16 Jul 2024
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Pierre-David Létourneau
Manish Kumar Singh
Hsin-Pai Cheng
Shizhong Han
Yunxiao Shi
Dalton Jones
M. H. Langston
Hong Cai
Fatih Porikli
34
0
0
16 Jul 2024
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao
Xinggang Wang
Lianghui Zhu
Qian Zhang
Chang Huang
51
4
0
28 May 2024
Efficient Degradation-aware Any Image Restoration
Eduard Zamfir
Zongwei Wu
Nancy Mehta
Danda Dani Paudel
Yulun Zhang
Radu Timofte
27
5
0
24 May 2024
Infinite-Dimensional Feature Interaction
Chenhui Xu
Fuxun Yu
Maoliang Li
Zihao Zheng
Zirui Xu
Jinjun Xiong
Xiang Chen
34
1
0
22 May 2024
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
39
3
0
22 May 2024
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
38
5
0
22 May 2024
MambaOut: Do We Really Need Mamba for Vision?
Weihao Yu
Xinchao Wang
Mamba
45
48
0
13 May 2024
Rewrite the Stars
Xu Ma
Xiyang Dai
Yue Bai
Yizhou Wang
Yun Fu
28
94
0
29 Mar 2024
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
40
17
0
29 Mar 2024
ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding
Novendra Setyawan
Ghufron Wahyu Kurniawan
Chi-Chia Sun
Jun-Wei Hsieh
Hui-Kai Su
W. Kuo
ViT
MoE
29
0
0
22 Mar 2024
LSKNet: A Foundation Lightweight Backbone for Remote Sensing
Yuxuan Li
Xiang Li
Yimain Dai
Qibin Hou
Li Liu
Yongxiang Liu
Ming-Ming Cheng
Jian Yang
34
31
0
18 Mar 2024
Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution
Jinchen Zhu
Mingjian Zhang
Ling Zheng
Shizhuang Weng
34
0
0
11 Mar 2024
Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction
Yonghao Dong
Le Wang
Sanpin Zhou
Gang Hua
Changyin Sun
37
5
0
09 Mar 2024
See More Details: Efficient Image Super-Resolution by Experts Mining
Eduard Zamfir
Zongwei Wu
Nancy Mehta
Yulun Zhang
Radu Timofte
SupR
46
9
0
05 Feb 2024
Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
Gang Wu
Junjun Jiang
Junpeng Jiang
Xianming Liu
SupR
41
7
0
11 Jan 2024
Recaptured Raw Screen Image and Video Demoiréing via Channel and Spatial Modulations
Huanjing Yue
Yijia Cheng
Xin Liu
Jingyu Yang
43
6
0
31 Oct 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
41
36
0
30 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
29
4
0
10 Oct 2023
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
34
75
0
20 Sep 2023
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation
Bo Yin
Xuying Zhang
Zhongyu Li
Li Liu
Ming-Ming Cheng
Qibin Hou
24
43
0
18 Sep 2023
ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal Prediction
Wenxuan Zhang
Xuechao Zou
Li Wu
Xiaoying Wang
Jianqiang Huang
Junliang Xing
19
0
0
01 Sep 2023
ADNet: Lane Shape Prediction via Anchor Decomposition
Lingyu Xiao
Xiang Li
Sen Yang
Wankou Yang
28
19
0
21 Aug 2023
Efficient Monaural Speech Enhancement using Spectrum Attention Fusion
Jinyu Long
Jetic Gū
Binhao Bai
Zhibo Yang
Pingsun Wei
Junli Li
17
0
0
04 Aug 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
32
175
0
18 Jul 2023
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
20
66
0
17 Jul 2023
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
37
28
0
01 Jun 2023
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive Imaging
Junyu Wang
Shijie Wang
Ruijie Zhang
Zengqiang Zheng
Wenyu Liu
Xinggang Wang
19
1
0
16 May 2023
1
2
Next