Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.08579
Cited By
Scale-Aware Modulation Meet Transformer
17 July 2023
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scale-Aware Modulation Meet Transformer"
31 / 31 papers shown
Title
HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective
Yu Zhang
Fengyuan Liu
Juan Lyu
Yi Wei
Changdong Yu
26
0
0
13 May 2025
Unconstrained Salient and Camouflaged Object Detection
Zhangjun Zhou
Yiping Li
Chunlin Zhong
Jianuo Huang
Jialun Pei
He Tang
84
0
0
14 Dec 2024
Symmetric Perception and Ordinal Regression for Detecting Scoliosis Natural Image
Xiaojia Zhu
Rui Chen
Xiaoqi Guo
Z. Shao
Yuhu Dai
Ming Zhang
Chuandong Lang
82
0
0
24 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
47
1
0
12 Nov 2024
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes
Muhammad Ali
Mamoona Javaid
Mubashir Noman
M. Fiaz
Salman Khan
36
0
0
31 Oct 2024
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
Ning Wang
Yun Xiao
Xiaopeng Peng
Xiaojun Chang
Xuanhong Wang
Dingyi Fang
28
2
0
20 Oct 2024
SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance
Shun Zou
Mingya Zhang
Bingjian Fan
Zhengyi Zhou
Xiuguo Zou
Mamba
32
3
0
17 Sep 2024
LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation
Trung Dang
Huy Hoang Nguyen
A. Tiulpin
Mamba
34
3
0
26 Aug 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
33
3
0
26 Jul 2024
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation
Hyunwoo Yu
Yubin Cho
Beoungwoo Kang
Seunghun Moon
Kyeongbo Kong
Suk-Ju Kang
30
3
0
24 Jul 2024
DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention
Xiaoya Tang
Bodong Zhang
Beatrice S. Knudsen
Tolga Tasdizen
ViT
MedIm
50
1
0
18 Jul 2024
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Yuheng Shi
Minjing Dong
Chang Xu
Mamba
48
32
0
23 May 2024
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
51
3
0
22 May 2024
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
HSViT: Horizontally Scalable Vision Transformer
Chenhao Xu
Chang-Tsun Li
Chee Peng Lim
Douglas Creighton
ViT
34
2
0
08 Apr 2024
Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining
Xiang Chen
Jinshan Pan
Jiangxin Dong
AI4CE
38
23
0
02 Apr 2024
Rewrite the Stars
Xu Ma
Xiyang Dai
Yue Bai
Yizhou Wang
Yun Fu
44
95
0
29 Mar 2024
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Donghyun Kim
Byeongho Heo
Dongyoon Han
50
14
0
28 Mar 2024
IA2U: A Transfer Plugin with Multi-Prior for In-Air Model to Underwater
Jingchun Zhou
Qilin Gai
Kin-Man Lam
Xianping Fu
24
0
0
12 Dec 2023
TransNeXt: Robust Foveal Visual Perception for Vision Transformers
Dai Shi
ViT
23
76
0
28 Nov 2023
Hierarchical Side-Tuning for Vision Transformers
Weifeng Lin
Ziheng Wu
Wentao Yang
Mingxin Huang
Jun Huang
Lianwen Jin
26
8
0
09 Oct 2023
RMT: Retentive Networks Meet Vision Transformers
Qihang Fan
Huaibo Huang
Mingrui Chen
Hongmin Liu
Ran He
ViT
43
75
0
20 Sep 2023
Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition
Qibin Hou
Cheng Lu
Mingg-Ming Cheng
Jiashi Feng
ViT
34
129
0
22 Nov 2022
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
Sachin Mehta
Mohammad Rastegari
ViT
218
1,213
0
05 Oct 2021
Mobile-Former: Bridging MobileNet and Transformer
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
ViT
183
476
0
12 Aug 2021
CMT: Convolutional Neural Networks Meet Vision Transformers
Jianyuan Guo
Kai Han
Han Wu
Yehui Tang
Chunjing Xu
Yunhe Wang
Chang Xu
ViT
351
500
0
13 Jul 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
289
3,623
0
24 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Nayeon Lee
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
290
979
0
27 Jan 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,567
0
17 Apr 2017
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,220
0
16 Nov 2016
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,329
0
05 Nov 2016
1