ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 823 papers shown
Title
The effectiveness of MAE pre-pretraining for billion-scale pretraining
The effectiveness of MAE pre-pretraining for billion-scale pretraining
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
126
63
0
23 Mar 2023
ViC-MAE: Self-Supervised Representation Learning from Images and Video
  with Contrastive Masked Autoencoders
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
J. Hernandez
Ruben Villegas
Vicente Ordonez
SSL
33
2
0
21 Mar 2023
Human Pose as Compositional Tokens
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
33
47
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the
  Future
Large AI Models in Health Informatics: Applications, Challenges, and the Future
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
42
127
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
EVA-02: A Visual Representation for Neon Genesis
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
40
259
0
20 Mar 2023
Robustifying Token Attention for Vision Transformers
Robustifying Token Attention for Vision Transformers
Yong Guo
David Stutz
Bernt Schiele
ViT
21
24
0
20 Mar 2023
Internal Structure Attention Network for Fingerprint Presentation Attack
  Detection from Optical Coherence Tomography
Internal Structure Attention Network for Fingerprint Presentation Attack Detection from Optical Coherence Tomography
Hao Sun
Yilong Zhang
Peng Chen
Haixia Wang
Ronghua Liang
40
4
0
20 Mar 2023
LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer
LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer
Rui Li
Xiaowei Zhao
23
3
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image
  Segmentation
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
35
138
0
17 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
21
37
0
17 Mar 2023
High Accurate and Explainable Multi-Pill Detection Framework with Graph
  Neural Network-Assisted Multimodal Data Fusion
High Accurate and Explainable Multi-Pill Detection Framework with Graph Neural Network-Assisted Multimodal Data Fusion
Anh Duy Nguyen
H. Pham
Huynh Thanh Trung
Quoc Viet Hung Nguyen
Thao Nguyen Truong
Phi Le Nguyen
MedIm
24
6
0
17 Mar 2023
ELFIS: Expert Learning for Fine-grained Image Recognition Using Subsets
ELFIS: Expert Learning for Fine-grained Image Recognition Using Subsets
Pablo J. Villacorta
Jesús M. Rodríguez-de-Vera
Marc Bolaños
Ignacio Sarasúa
Bhalaji Nagarajan
Petia Radeva
30
1
0
16 Mar 2023
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness
  with Dataset Reinforcement
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement
Fartash Faghri
Hadi Pouransari
Sachin Mehta
Mehrdad Farajtabar
Ali Farhadi
Mohammad Rastegari
Oncel Tuzel
43
9
0
15 Mar 2023
Fully neuromorphic vision and control for autonomous drone flight
Fully neuromorphic vision and control for autonomous drone flight
Federico Paredes-Valles
J. Hagenaars
Julien Dupeyroux
S. Stroobants
Ying Xu
Guido de Croon
33
36
0
15 Mar 2023
Deep Learning for Iris Recognition: A Review
Deep Learning for Iris Recognition: A Review
Yi Yin
Si-Liang He
Renye Zhang
Hongli Chang
Xu Han
Jinghua Zhang
PILM
29
10
0
15 Mar 2023
Exploring Resiliency to Natural Image Corruptions in Deep Learning using
  Design Diversity
Exploring Resiliency to Natural Image Corruptions in Deep Learning using Design Diversity
Rafael Rosales
Pablo Munoz
Michael Paulitsch
30
2
0
15 Mar 2023
ODIN: On-demand Data Formulation to Mitigate Dataset Lock-in
ODIN: On-demand Data Formulation to Mitigate Dataset Lock-in
SP Choi
Jihun Lee
HyeongSeok Ahn
Sanghee Jung
Bumsoo Kang
VLM
18
0
0
13 Mar 2023
Multi-metrics adaptively identifies backdoors in Federated learning
Multi-metrics adaptively identifies backdoors in Federated learning
Siquan Huang
Yijiang Li
Chong Chen
Leyu Shi
Ying Gao
AAML
43
19
0
12 Mar 2023
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Jierun Chen
Shiu-hong Kao
Hao He
Weipeng Zhuo
Song Wen
Chul-Ho Lee
Shueng-Han Gary Chan
OOD
35
782
0
07 Mar 2023
Fine-Grained ImageNet Classification in the Wild
Fine-Grained ImageNet Classification in the Wild
Maria Lymperaiou
Konstantinos Thomas
Giorgos Stamou
VLM
33
1
0
04 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
163
217
0
03 Mar 2023
Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Sora Takashima
Ryo Hayamizu
Nakamasa Inoue
Hirokatsu Kataoka
Rio Yokota
68
18
0
02 Mar 2023
Time Series as Images: Vision Transformer for Irregularly Sampled Time
  Series
Time Series as Images: Vision Transformer for Irregularly Sampled Time Series
Zekun Li
Shiyang Li
Xifeng Yan
AI4TS
29
47
0
01 Mar 2023
Efficient and Explicit Modelling of Image Hierarchies for Image
  Restoration
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
Yawei Li
Yuchen Fan
Xiaoyu Xiang
D. Demandolx
Rakesh Ranjan
Radu Timofte
Luc Van Gool
35
173
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
  Collaborative AutoML System
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wei Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
43
2
0
01 Mar 2023
Single-Cell Multimodal Prediction via Transformers
Single-Cell Multimodal Prediction via Transformers
Wenzhuo Tang
Haifang Wen
Renming Liu
Jiayuan Ding
Wei Jin
Yuying Xie
Hui Liu
Jiliang Tang
AI4CE
24
11
0
01 Mar 2023
Learning to Generalize towards Unseen Domains via a Content-Aware Style
  Invariant Model for Disease Detection from Chest X-rays
Learning to Generalize towards Unseen Domains via a Content-Aware Style Invariant Model for Disease Detection from Chest X-rays
Mohammad Zunaed
M. Haque
Taufiq Hasan
OOD
20
5
0
27 Feb 2023
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
74
485
0
23 Feb 2023
Human MotionFormer: Transferring Human Motions with Vision Transformers
Human MotionFormer: Transferring Human Motions with Vision Transformers
Hongyu Liu
Xintong Han
Chengbin Jin
Lihui Qian
Huawei Wei
...
Faqiang Wang
Haoye Dong
Yibing Song
Jia Xu
Qifeng Chen
16
11
0
22 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
Hyneter: Hybrid Network Transformer for Object Detection
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
31
3
0
18 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep
  Learning Model Training
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
33
6
0
16 Feb 2023
CholecTriplet2022: Show me a tool and tell me the triplet -- an
  endoscopic vision challenge for surgical action triplet detection
CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection
C. Nwoye
Tong Yu
Saurav Sharma
Aditya Murali
Deepak Alapatt
...
Pietro Mascagni
B. Seeliger
Cristians Gonzalez
Didier Mutter
N. Padoy
32
17
0
13 Feb 2023
Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation
  Models with Feature Representations for Multi-Modal Fact Verification
Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation Models with Feature Representations for Multi-Modal Fact Verification
Wei-Wei Du
Hongfa Wu
Wei-Yao Wang
Wen-Chih Peng
24
5
0
12 Feb 2023
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels
Qi Chen
Chao Li
Jia Ning
Stephen Lin
Kun He
AAML
21
2
0
09 Feb 2023
Knowledge Distillation in Vision Transformers: A Critical Review
Knowledge Distillation in Vision Transformers: A Critical Review
Gousia Habib
Tausifa Jan Saleem
Brejesh Lall
29
15
0
04 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial
  Defense
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAML
VLM
52
4
0
02 Feb 2023
FCB-SwinV2 Transformer for Polyp Segmentation
FCB-SwinV2 Transformer for Polyp Segmentation
Kerr Fitzgerald
B. Matuszewski
ViT
MedIm
21
12
0
02 Feb 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image
  and Video
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Haiyang Xu
Qinghao Ye
Mingshi Yan
Yaya Shi
Jiabo Ye
...
Guohai Xu
Ji Zhang
Songfang Huang
Feiran Huang
Jingren Zhou
MLLM
VLM
MoE
43
160
0
01 Feb 2023
Cross-Architectural Positive Pairs improve the effectiveness of
  Self-Supervised Learning
Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
P. Singh
Jacopo Cirrone
SSL
45
0
0
27 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
40
2
0
25 Jan 2023
Connecting metrics for shape-texture knowledge in computer vision
Connecting metrics for shape-texture knowledge in computer vision
Tiago Gaspar Oliveira
Tiago Marques
Arlindo L. Oliveira
19
0
0
25 Jan 2023
ClimaX: A foundation model for weather and climate
ClimaX: A foundation model for weather and climate
Tung Nguyen
Johannes Brandstetter
Ashish Kapoor
Jayesh K. Gupta
Aditya Grover
AI4Cl
AI4CE
11
245
0
24 Jan 2023
Zorro: the masked multimodal transformer
Zorro: the masked multimodal transformer
Adrià Recasens
Jason Lin
João Carreira
Drew Jaegle
Luyu Wang
...
Pauline Luc
Antoine Miech
Lucas Smaira
Ross Hemsley
Andrew Zisserman
39
20
0
23 Jan 2023
Autonomous Rendezvous with Non-cooperative Target Objects with Swarm
  Chasers and Observers
Autonomous Rendezvous with Non-cooperative Target Objects with Swarm Chasers and Observers
Trupti Mahendrakar
Steven Holmberg
A. Ekblad
Emma Conti
Ryan T. White
M. Wilde
Isaac Silver
15
7
0
22 Jan 2023
SuperScaler: Supporting Flexible DNN Parallelization via a Unified
  Abstraction
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Zhiqi Lin
Youshan Miao
Guodong Liu
Xiaoxiang Shi
Quanlu Zhang
...
Xu Cao
Cheng-Wu Li
Mao Yang
Lintao Zhang
Lidong Zhou
24
6
0
21 Jan 2023
FlatFormer: Flattened Window Attention for Efficient Point Cloud
  Transformer
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Zhijian Liu
Xinyu Yang
Haotian Tang
Shang Yang
Song Han
35
64
0
20 Jan 2023
CSwin2SR: Circular Swin2SR for Compressed Image Super-Resolution
CSwin2SR: Circular Swin2SR for Compressed Image Super-Resolution
Honggui Li
M. Trocan
Mohamad Sawan
Dimitri Galayko
4
4
0
20 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
43
11
0
17 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and
  Future Trends
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
31
126
0
13 Jan 2023
1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification
  Track
1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track
Yilu Guo
Xing-Jian Shi
Weijie Chen
Shicai Yang
Di Xie
Shiliang Pu
Yueting Zhuang
3DGS
14
1
0
12 Jan 2023
Previous
123...121314151617
Next