ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution
v1v2 (latest)

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXiv (abs)PDFHTMLGithub (14834★)

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 840 papers shown
Title
Hyneter: Hybrid Network Transformer for Object Detection
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
78
4
0
18 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep
  Learning Model Training
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
82
8
0
16 Feb 2023
CholecTriplet2022: Show me a tool and tell me the triplet -- an
  endoscopic vision challenge for surgical action triplet detection
CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection
C. Nwoye
Tong Yu
Saurav Sharma
Aditya Murali
Deepak Alapatt
...
Pietro Mascagni
B. Seeliger
Cristians Gonzalez
Didier Mutter
N. Padoy
102
20
0
13 Feb 2023
Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation
  Models with Feature Representations for Multi-Modal Fact Verification
Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation Models with Feature Representations for Multi-Modal Fact Verification
Wei-Wei Du
Hongfa Wu
Wei-Yao Wang
Chao-Han Huck Yang
72
7
0
12 Feb 2023
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels
Qi Chen
Chao Li
Jia Ning
Stephen Lin
Kun He
AAML
77
2
0
09 Feb 2023
Knowledge Distillation in Vision Transformers: A Critical Review
Knowledge Distillation in Vision Transformers: A Critical Review
Gousia Habib
Tausifa Jan Saleem
Brejesh Lall
89
16
0
04 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial
  Defense
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAMLVLM
99
4
0
02 Feb 2023
FCB-SwinV2 Transformer for Polyp Segmentation
FCB-SwinV2 Transformer for Polyp Segmentation
Kerr Fitzgerald
B. Matuszewski
ViTMedIm
72
13
0
02 Feb 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image
  and Video
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Haiyang Xu
Qinghao Ye
Mingshi Yan
Yaya Shi
Jiabo Ye
...
Guohai Xu
Ji Zhang
Songfang Huang
Feiran Huang
Jingren Zhou
MLLMVLMMoE
116
171
0
01 Feb 2023
Cross-Architectural Positive Pairs improve the effectiveness of
  Self-Supervised Learning
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
P. Singh
Jacopo Cirrone
SSL
117
0
0
27 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
106
3
0
25 Jan 2023
Connecting metrics for shape-texture knowledge in computer vision
Connecting metrics for shape-texture knowledge in computer vision
Tiago Gaspar Oliveira
Tiago Marques
Arlindo L. Oliveira
23
0
0
25 Jan 2023
ClimaX: A foundation model for weather and climate
ClimaX: A foundation model for weather and climate
Tung Nguyen
Johannes Brandstetter
Ashish Kapoor
Jayesh K. Gupta
Aditya Grover
AI4ClAI4CE
113
271
0
24 Jan 2023
Zorro: the masked multimodal transformer
Zorro: the masked multimodal transformer
Adrià Recasens
Jason Lin
João Carreira
Drew Jaegle
Luyu Wang
...
Pauline Luc
Antoine Miech
Lucas Smaira
Ross Hemsley
Andrew Zisserman
92
21
0
23 Jan 2023
Autonomous Rendezvous with Non-cooperative Target Objects with Swarm
  Chasers and Observers
Autonomous Rendezvous with Non-cooperative Target Objects with Swarm Chasers and Observers
Trupti Mahendrakar
Steven Holmberg
A. Ekblad
Emma Conti
Ryan T. White
M. Wilde
Isaac Silver
27
7
0
22 Jan 2023
SuperScaler: Supporting Flexible DNN Parallelization via a Unified
  Abstraction
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Zhiqi Lin
Youshan Miao
Guodong Liu
Xiaoxiang Shi
Quanlu Zhang
...
Xu Cao
Cheng-Wu Li
Mao Yang
Lintao Zhang
Lidong Zhou
58
6
0
21 Jan 2023
FlatFormer: Flattened Window Attention for Efficient Point Cloud
  Transformer
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Zhijian Liu
Xinyu Yang
Haotian Tang
Shang Yang
Song Han
99
69
0
20 Jan 2023
CSwin2SR: Circular Swin2SR for Compressed Image Super-Resolution
CSwin2SR: Circular Swin2SR for Compressed Image Super-Resolution
Honggui Li
M. Trocan
Mohamad Sawan
Dimitri Galayko
21
4
0
20 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
95
11
0
17 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and
  Future Trends
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
228
158
0
13 Jan 2023
1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification
  Track
1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track
Yilu Guo
Xing-Jian Shi
Weijie Chen
Shicai Yang
Di Xie
Shiliang Pu
Yueting Zhuang
3DGS
34
1
0
12 Jan 2023
Vision Transformers Are Good Mask Auto-Labelers
Vision Transformers Are Good Mask Auto-Labelers
Shiyi Lan
Xitong Yang
Zhiding Yu
Zuxuan Wu
J. Álvarez
Anima Anandkumar
ISegViTMedIm
95
19
0
10 Jan 2023
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
All in Tokens: Unifying Output Space of Visual Tasks via Soft Token
Jia Ning
Chen Li
Zheng Zhang
Zigang Geng
Qi Dai
Kun He
Han Hu
128
46
0
05 Jan 2023
Towards Long-Term Time-Series Forecasting: Feature, Pattern, and
  Distribution
Towards Long-Term Time-Series Forecasting: Feature, Pattern, and Distribution
Yan Li
Xin Lu
Haoyi Xiong
Jian Tang
Jian Su
Bo Jin
Dejing Dou
AI4TS
70
27
0
05 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Sucheng Ren
Fangyun Wei
Zheng Zhang
Han Hu
139
42
0
03 Jan 2023
Rethinking Mobile Block for Efficient Attention-based Models
Rethinking Mobile Block for Efficient Attention-based Models
Jiangning Zhang
Xiangtai Li
Jian Li
Liang Liu
Zhucun Xue
Boshen Zhang
Zhe Jiang
Tianxin Huang
Yabiao Wang
Chengjie Wang
MQ
150
102
0
03 Jan 2023
Edge Enhanced Image Style Transfer via Transformers
Edge Enhanced Image Style Transfer via Transformers
Chi Zhang
Jun Yang
Zaiyan Dai
Peng-Xia Cao
55
10
0
02 Jan 2023
Transformers in Action Recognition: A Review on Temporal Modeling
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
67
9
0
29 Dec 2022
Local Learning on Transformers via Feature Reconstruction
Local Learning on Transformers via Feature Reconstruction
P. Pathak
Jingwei Zhang
Dimitris Samaras
ViT
123
5
0
29 Dec 2022
Boosting Out-of-Distribution Detection with Multiple Pre-trained Models
Boosting Out-of-Distribution Detection with Multiple Pre-trained Models
Feng Xue
Zi He
Chuanlong Xie
Falong Tan
Zhenguo Li
OODD
123
7
0
24 Dec 2022
Reversible Column Networks
Reversible Column Networks
Yuxuan Cai
Yi Zhou
Qi Han
Jianjian Sun
Xiangwen Kong
Jun Yu Li
Xiangyu Zhang
VLM
92
58
0
22 Dec 2022
SLGTformer: An Attention-Based Approach to Sign Language Recognition
SLGTformer: An Attention-Based Approach to Sign Language Recognition
Neil Song
Yu Xiang
SLR
58
0
0
21 Dec 2022
Universal Object Detection with Large Vision Model
Universal Object Detection with Large Vision Model
Feng-Huei Lin
Wenze Hu
Yaowei Wang
Yonghong Tian
Guangming Lu
Fanglin Chen
Yong-mei Xu
Xiaoyu Wang
VLMObjD
98
8
0
19 Dec 2022
Analysis and application of multispectral data for water segmentation
  using machine learning
Analysis and application of multispectral data for water segmentation using machine learning
Shubham Gupta
D. Uma
R. Hebbar
97
0
0
16 Dec 2022
Attentive Mask CLIP
Attentive Mask CLIP
Yifan Yang
Weiquan Huang
Yixuan Wei
Houwen Peng
Xinyang Jiang
...
Fangyun Wei
Yin Wang
Han Hu
Lili Qiu
Yuqing Yang
CLIPVLM
83
27
0
16 Dec 2022
Rethinking Vision Transformers for MobileNet Size and Speed
Rethinking Vision Transformers for MobileNet Size and Speed
Yanyu Li
Ju Hu
Yang Wen
Georgios Evangelidis
Kamyar Salahi
Yanzhi Wang
Sergey Tulyakov
Jian Ren
ViT
112
169
0
15 Dec 2022
FlexiViT: One Model for All Patch Sizes
FlexiViT: One Model for All Patch Sizes
Lucas Beyer
Pavel Izmailov
Alexander Kolesnikov
Mathilde Caron
Simon Kornblith
Xiaohua Zhai
Matthias Minderer
Michael Tschannen
Ibrahim Alabdulmohsin
Filip Pavetić
VLM
150
94
0
15 Dec 2022
Vision Transformers are Parameter-Efficient Audio-Visual Learners
Vision Transformers are Parameter-Efficient Audio-Visual Learners
Yan-Bo Lin
Yi-Lin Sung
Jie Lei
Joey Tianyi Zhou
Gedas Bertasius
105
78
0
15 Dec 2022
What do Vision Transformers Learn? A Visual Exploration
What do Vision Transformers Learn? A Visual Exploration
Amin Ghiasi
Hamid Kazemi
Eitan Borgnia
Steven Reich
Manli Shu
Micah Goldblum
A. Wilson
Tom Goldstein
ViT
91
64
0
13 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Zhang
Guoli Song
Jie Chen
98
1
0
10 Dec 2022
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive
  Learning
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Jishnu Mukhoti
Tsung-Yu Lin
Omid Poursaeed
Rui Wang
Ashish Shah
Philip Torr
Ser-Nam Lim
VLM
127
83
0
09 Dec 2022
Spurious Features Everywhere -- Large-Scale Detection of Harmful
  Spurious Features in ImageNet
Spurious Features Everywhere -- Large-Scale Detection of Harmful Spurious Features in ImageNet
Yannic Neuhaus
Maximilian Augustin
Valentyn Boreiko
Matthias Hein
AAML
127
32
0
09 Dec 2022
Mitigation of Spatial Nonstationarity with Vision Transformers
Mitigation of Spatial Nonstationarity with Vision Transformers
Lei Liu
Javier E. Santos
Mavsa Prodanović
Michael J. Pyrcz
50
4
0
09 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using
  CLIP and StableDiffusion
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLMDiffM
106
42
0
07 Dec 2022
ResFormer: Scaling ViTs with Multi-Resolution Training
ResFormer: Scaling ViTs with Multi-Resolution Training
Rui Tian
Zuxuan Wu
Qiuju Dai
Hang-Rui Hu
Yu Qiao
Yu-Gang Jiang
ViT
95
35
0
01 Dec 2022
Finding Differences Between Transformers and ConvNets Using
  Counterfactual Simulation Testing
Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing
Nataniel Ruiz
Sarah Adel Bargal
Cihang Xie
Kate Saenko
Stan Sclaroff
ViT
69
5
0
29 Nov 2022
Transferability Estimation Based On Principal Gradient Expectation
Transferability Estimation Based On Principal Gradient Expectation
Huiyan Qi
Lechao Cheng
Jingjing Chen
Yue Yu
Xue Song
Zunlei Feng
Yueping Jiang
82
2
0
29 Nov 2022
Metal-conscious Embedding for CBCT Projection Inpainting
Metal-conscious Embedding for CBCT Projection Inpainting
F. Fan
Yangkong Wang
L. Ritschl
R. Biniazan
M. Beister
Björn Kreher
Yixing Huang
Steffen Kappler
Andreas Maier
MedIm
23
0
0
29 Nov 2022
Class Adaptive Network Calibration
Class Adaptive Network Calibration
Bingyuan Liu
Jérôme Rony
Adrian Galdran
Jose Dolz
Ismail Ben Ayed
94
10
0
28 Nov 2022
UperFormer: A Multi-scale Transformer-based Decoder for Semantic
  Segmentation
UperFormer: A Multi-scale Transformer-based Decoder for Semantic Segmentation
Jing Xu
W. Shi
Pan Gao
Zhengwei Wang
Qizhu Li
ViT
27
2
0
25 Nov 2022
Previous
123...1314151617
Next