ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 824 papers shown
Title
RevColV2: Exploring Disentangled Representations in Masked Image
  Modeling
RevColV2: Exploring Disentangled Representations in Masked Image Modeling
Qi Han
Yuxuan Cai
Xiangyu Zhang
41
7
0
02 Sep 2023
ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal
  Prediction
ARFA: An Asymmetric Receptive Field Autoencoder Model for Spatiotemporal Prediction
Wenxuan Zhang
Xuechao Zou
Li Wu
Xiaoying Wang
Jianqiang Huang
Junliang Xing
24
0
0
01 Sep 2023
SFUSNet: A Spatial-Frequency domain-based Multi-branch Network for
  diagnosis of Cervical Lymph Node Lesions in Ultrasound Images
SFUSNet: A Spatial-Frequency domain-based Multi-branch Network for diagnosis of Cervical Lymph Node Lesions in Ultrasound Images
Yubiao Yue
Jun Xue
Haihua Liang
Bingchun Luo
Zhenzhang Li
23
0
0
31 Aug 2023
Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary
  Resuscitation Instruction via Temporal Action Segmentation
Prompt-enhanced Hierarchical Transformer Elevating Cardiopulmonary Resuscitation Instruction via Temporal Action Segmentation
Yang Liu
Xiao-Yu Zhong
Shiyao Zhai
Zhicheng Du
Zhenyuan Gao
...
V. Pandey
Sanyang Han
Runming Wang
Yuxing Han
Peiwu Qin
MedIm
33
4
0
31 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
38
20
0
27 Aug 2023
Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid
  Algorithm with Transformer and CNN Encoders
Transfer Learning for Microstructure Segmentation with CS-UNet: A Hybrid Algorithm with Transformer and CNN Encoders
Khaled Alrfou
Tian Zhao
Amir Kordijazi
ViT
29
3
0
26 Aug 2023
HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt
  interaction tasks
HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks
Zichao Dong
Weikun Zhang
Xufeng Huang
Hang Ji
Xin Zhan
Junbo Chen
VLM
21
4
0
24 Aug 2023
HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using
  Harvest Piles and Remote Sensing
HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using Harvest Piles and Remote Sensing
Jonathan Xu
Amna Elmustafa
L. Weldegebriel
Emnet Negash
Richard Lee
Chenlin Meng
Stefano Ermon
David B. Lobell
12
3
0
23 Aug 2023
Towards Privacy-Supporting Fall Detection via Deep Unsupervised
  RGB2Depth Adaptation
Towards Privacy-Supporting Fall Detection via Deep Unsupervised RGB2Depth Adaptation
Hejun Xiao
Kunyu Peng
Xiangsheng Huang
Alina Roitberg
Hao Li
Zhao Wang
Rainer Stiefelhagen
26
3
0
23 Aug 2023
TurboViT: Generating Fast Vision Transformers via Generative
  Architecture Search
TurboViT: Generating Fast Vision Transformers via Generative Architecture Search
Alexander Wong
Saad Abbasi
Saeejith Nair
ViT
35
1
0
22 Aug 2023
SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for
  Remote Sensing Images Change Detection
SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection
Dalong Zheng
Zebin Wu
Jia-Wei Liu
Zhihui Wei
ViT
23
0
0
22 Aug 2023
Refashioning Emotion Recognition Modelling: The Advent of Generalised
  Large Models
Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models
Zixing Zhang
Liyizhe Peng
Tao Pang
Jing Han
Huan Zhao
Bjorn W. Schuller
40
13
0
21 Aug 2023
Large Transformers are Better EEG Learners
Large Transformers are Better EEG Learners
Bingxin Wang
Xiao-Ying Fu
Yuan Lan
Luchan Zhang
Wei Zheng
Yang Xiang
25
5
0
20 Aug 2023
Efficient Representation Learning for Healthcare with
  Cross-Architectural Self-Supervision
Efficient Representation Learning for Healthcare with Cross-Architectural Self-Supervision
P. Singh
Jacopo Cirrone
OOD
SSL
27
2
0
19 Aug 2023
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity
  Control
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control
Zi-Yuan Hu
Yanyang Li
M. Lyu
Liwei Wang
VLM
35
15
0
18 Aug 2023
The Impact of Background Removal on Performance of Neural Networks for
  Fashion Image Classification and Segmentation
The Impact of Background Removal on Performance of Neural Networks for Fashion Image Classification and Segmentation
Junhui Liang
Yong-Jin Liu
Vladimir Vlassov
28
4
0
18 Aug 2023
Diffusion Models for Image Restoration and Enhancement -- A
  Comprehensive Survey
Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Xin Li
Yulin Ren
Xin Jin
Cuiling Lan
Xingyu Wang
Wenjun Zeng
Xinchao Wang
Zhibo Chen
43
86
0
18 Aug 2023
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
Yijiang Li
Xinjiang Wang
Lihe Yang
Xue Jiang
Wayne Zhang
Ying Gao
30
15
0
18 Aug 2023
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
Tobias Christian Nauen
Sebastián M. Palacio
Federico Raue
Andreas Dengel
42
3
0
18 Aug 2023
ICAR: Image-based Complementary Auto Reasoning
ICAR: Image-based Complementary Auto Reasoning
Xijun Wang
An-Chun Liang
Junbang Liang
Ming-Shun Lin
Yukuan Lou
Shan Yang
22
1
0
17 Aug 2023
Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image
  Compression
Dynamic Kernel-Based Adaptive Spatial Aggregation for Learned Image Compression
Huairui Wang
Nianxiang Fu
Zhenzhong Chen
Shanghui Liu
33
2
0
17 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
30
13
0
11 Aug 2023
Progressive Spatio-temporal Perception for Audio-Visual Question
  Answering
Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Guangyao Li
Wenxuan Hou
Di Hu
37
26
0
10 Aug 2023
FeatEnHancer: Enhancing Hierarchical Features for Object Detection and
  Beyond Under Low-Light Vision
FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light Vision
K. Hashmi
Goutham Kallempudi
D. Stricker
Muhammad Zeshan Afzal
38
19
0
07 Aug 2023
Revealing the Underlying Patterns: Investigating Dataset Similarity,
  Performance, and Generalization
Revealing the Underlying Patterns: Investigating Dataset Similarity, Performance, and Generalization
Akshit Achara
R. Pandey
SSL
47
0
0
07 Aug 2023
A Hybrid CNN-Transformer Architecture with Frequency Domain Contrastive
  Learning for Image Deraining
A Hybrid CNN-Transformer Architecture with Frequency Domain Contrastive Learning for Image Deraining
Cheng-i Wang
Wei Li
42
0
0
07 Aug 2023
Multi-scale Alternated Attention Transformer for Generalized Stereo
  Matching
Multi-scale Alternated Attention Transformer for Generalized Stereo Matching
Wei Miao
Hong Zhao
Tom Tongjia Chen
Wei Huang
Changyan Xiao
ViT
26
0
0
06 Aug 2023
M2Former: Multi-Scale Patch Selection for Fine-Grained Visual
  Recognition
M2Former: Multi-Scale Patch Selection for Fine-Grained Visual Recognition
Ji-Hee Moon
Junseok K. Lee
Yu-Ling Lee
Seongsik Park
37
4
0
04 Aug 2023
DETR Doesn't Need Multi-Scale or Locality Design
DETR Doesn't Need Multi-Scale or Locality Design
Yutong Lin
Yuhui Yuan
Zheng-Wei Zhang
Chen Li
Nanning Zheng
Han Hu
37
5
0
03 Aug 2023
Data Augmentation for Human Behavior Analysis in Multi-Person
  Conversations
Data Augmentation for Human Behavior Analysis in Multi-Person Conversations
Kun Li
Dan Guo
Guoliang Chen
Feiyang Liu
Meng Wang
ViT
30
8
0
03 Aug 2023
PPI-NET: End-to-End Parametric Primitive Inference
PPI-NET: End-to-End Parametric Primitive Inference
Liang Wang
Xiaogang Wang
35
1
0
03 Aug 2023
Revisiting DETR Pre-training for Object Detection
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
35
4
0
02 Aug 2023
Continual Domain Adaptation on Aerial Images under Gradually Degrading
  Weather
Continual Domain Adaptation on Aerial Images under Gradually Degrading Weather
C. S. Jahan
Andreas E. Savakis
11
1
0
02 Aug 2023
PVG: Progressive Vision Graph for Vision Recognition
PVG: Progressive Vision Graph for Vision Recognition
Jiafu Wu
Jian Li
Jiangning Zhang
Boshen Zhang
M. Chi
Yabiao Wang
Chengjie Wang
ViT
35
13
0
01 Aug 2023
StylePrompter: All Styles Need Is Attention
StylePrompter: All Styles Need Is Attention
Chenyi Zhuang
Pan Gao
A. Smolic
38
1
0
30 Jul 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth
  Estimation
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
36
13
0
27 Jul 2023
MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation
MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation
R. Birkl
Diana Wofk
Matthias Muller
MDE
29
134
0
26 Jul 2023
Controllable Guide-Space for Generalizable Face Forgery Detection
Controllable Guide-Space for Generalizable Face Forgery Detection
Yingjie Guo
Cheng Zhen
Pengfei Yan
CVBM
AAML
40
21
0
26 Jul 2023
E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning
E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning
Cheng Han
Qifan Wang
Yiming Cui
Zhiwen Cao
Wenguan Wang
Siyuan Qi
Dongfang Liu
VPVLM
VLM
27
48
0
25 Jul 2023
Is attention all you need in medical image analysis? A review
Is attention all you need in medical image analysis? A review
G. Papanastasiou
Nikolaos Dikaios
Jiahao Huang
Chengjia Wang
Guang Yang
ViT
MedIm
25
23
0
24 Jul 2023
A Good Student is Cooperative and Reliable: CNN-Transformer
  Collaborative Learning for Semantic Segmentation
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation
Jinjing Zhu
Yuan Luo
Xueye Zheng
Hao Wang
Lin Wang
25
33
0
24 Jul 2023
Expediting Building Footprint Extraction from High-resolution Remote
  Sensing Images via progressive lenient supervision
Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision
Haonan Guo
Bo Du
Chen Wu
Xin Su
Lefei Zhang
21
0
0
23 Jul 2023
An Intelligent Remote Sensing Image Quality Inspection System
An Intelligent Remote Sensing Image Quality Inspection System
Yi Yu
Tao Wang
Kang Ran
Changjiang Li
Hao Wu
27
1
0
22 Jul 2023
MatSpectNet: Material Segmentation Network with Domain-Aware and
  Physically-Constrained Hyperspectral Reconstruction
MatSpectNet: Material Segmentation Network with Domain-Aware and Physically-Constrained Hyperspectral Reconstruction
Yuwen Heng
Yihong Wu
Jiawen Chen
S. Dasmahapatra
Hansung Kim
24
1
0
21 Jul 2023
Strip-MLP: Efficient Token Interaction for Vision MLP
Strip-MLP: Efficient Token Interaction for Vision MLP
Guiping Cao
Shengda Luo
Wen-Fong Huang
X. Lan
D. Jiang
Yaowei Wang
Jianguo Zhang
40
10
0
21 Jul 2023
Meta-Transformer: A Unified Framework for Multimodal Learning
Meta-Transformer: A Unified Framework for Multimodal Learning
Yiyuan Zhang
Kaixiong Gong
Kaipeng Zhang
Hongsheng Li
Yu Qiao
Wanli Ouyang
Xiangyu Yue
33
137
0
20 Jul 2023
Findings of Factify 2: Multimodal Fake News Detection
Findings of Factify 2: Multimodal Fake News Detection
S. Suryavardan
Shreyash Mishra
Megha Chakraborty
Parth Patwa
Anku Rani
...
Amitava Das
Amit P. Sheth
Manoj Kumar Chinnakotla
Asif Ekbal
Srijan Kumar
30
14
0
19 Jul 2023
Watch out Venomous Snake Species: A Solution to SnakeCLEF2023
Watch out Venomous Snake Species: A Solution to SnakeCLEF2023
Feiran Hu
Peng Wang
Yangyang Li
Chenlong Duan
Zijian Zhu
Fei Wang
Faen Zhang
Yong Li
Xiu-Shen Wei
30
6
0
19 Jul 2023
NTIRE 2023 Quality Assessment of Video Enhancement Challenge
NTIRE 2023 Quality Assessment of Video Enhancement Challenge
Xiaohong Liu
Xiongkuo Min
Wei Sun
Yulun Zhang
Peng Sun
...
Te Shi
Azadeh Mansouri
Hossein Motamednia
Amirhossein Bakhtiari
Ahmad Mahmoudi-Aznaveh
36
18
0
19 Jul 2023
RepViT: Revisiting Mobile CNN From ViT Perspective
RepViT: Revisiting Mobile CNN From ViT Perspective
Ao Wang
Hui Chen
Zijia Lin
Hengjun Pu
Guiguang Ding
34
180
0
18 Jul 2023
Previous
123...91011...151617
Next