Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.09883
Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 823 papers shown
Title
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
Rohit Menon
Nils Dengler
Sicong Pan
Gokul Krishna Chenchani
Maren Bennewitz
EDL
91
0
0
06 Mar 2025
Computational Analysis of Degradation Modeling in Blind Panoramic Image Quality Assessment
Jiebin Yan
Ziwen Tan
Jiale Rao
Lei Wu
Yifan Zuo
Yuming Fang
52
0
0
05 Mar 2025
Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer
Yury Belousov
Vitaliy Kinakh
Teddy Furon
S. Voloshynovskiy
AAML
77
0
0
05 Mar 2025
Adaptive Camera Sensor for Vision Models
Eunsu Baek
Sunghwan Han
Taesik Gong
Hyung-Sin Kim
VLM
Presented at
ResearchTrend Connect | VLM
on
28 Mar 2025
164
0
0
04 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
69
0
0
03 Mar 2025
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting
Ali Caglayan
Nevrez Imamoglu
T. Kouyama
67
0
0
03 Mar 2025
Investigating the contribution of terrain-following coordinates and conservation schemes in AI-driven precipitation forecasts
Yingkai Sha
John S. Schreck
William E. Chapman
David John Gagne II
35
1
0
01 Mar 2025
FLStore: Efficient Federated Learning Storage for non-training workloads
Ahmad Faraz Khan
Samuel Fountain
Ahmed M. Abdelmoniem
A. R. Butt
A. Anwar
FedML
48
0
0
01 Mar 2025
Robust and Efficient Writer-Independent IMU-Based Handwriting Recognization
Jindong Li
Tim Hamann
Jens Barth
Peter Kaempf
Dario Zanca
Bjoern M. Eskofier
41
0
0
28 Feb 2025
Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions
Palawat Busaranuvong
Emmanuel O. Agu
Reza Saadati Fard
Deepak Kumar
Shefalika Gautam
B. Tulu
Diane Strong
MedIm
60
0
0
27 Feb 2025
GONet: A Generalizable Deep Learning Model for Glaucoma Detection
Or Abramovich
Hadas Pizem
Jonathan Fhima
Eran Berkowitz
Ben Gofrit
...
Meital Baskin
Jan Van Eijgen
Ingeborg Stalmans
E. Blumenthal
Joachim A. Behar
64
1
0
26 Feb 2025
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
Paul Koch
Marian Schluter
Jörg Krüger
76
0
0
24 Feb 2025
MaxGlaViT: A novel lightweight vision transformer-based approach for early diagnosis of glaucoma stages from fundus images
Mustafa Yurdakul
Kubra Uyar
Şakir Tasdemir
58
1
0
24 Feb 2025
MEX: Memory-efficient Approach to Referring Multi-Object Tracking
Huu-Thien Tran
Phuoc-Sang Pham
Thai-Son Tran
Khoa Luu
VOT
81
1
0
20 Feb 2025
Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-Localization
Zhongwei Chen
Zhao-Xu Yang
Hai-Jun Rong
SSL
56
0
0
17 Feb 2025
Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization
Yuanze Xu
Ming Dai
Wenxiao Cai
Wankou Yang
72
0
0
17 Feb 2025
Learning Musical Representations for Music Performance Question Answering
Xingjian Diao
Chunhui Zhang
Tingxuan Wu
Ming Cheng
Z. Ouyang
Weiyi Wu
Jiang Gui
73
7
0
10 Feb 2025
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection
Dongsu Song
Daehwa Ko
Jay Hoon Jung
AAML
64
0
0
10 Feb 2025
Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning
Liuqing Chen
Shuhong Xiao
Shixian Ding
Shanhai Hu
Lingyun Sun
71
0
0
10 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
61
0
0
07 Feb 2025
Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge
Anh-Kiet Duong
Petra Gomez-Krämer
35
2
0
27 Jan 2025
A margin-based replacement for cross-entropy loss
Michael W. Spratling
Heiko H. Schütt
68
0
0
21 Jan 2025
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Kaiyuan Tian
Linbo Qiao
Baihui Liu
Gongqingjian Jiang
Dongsheng Li
36
0
0
21 Jan 2025
DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains
Junyu Xia
Jiesong Bai
Yihang Dong
ViT
74
0
0
21 Jan 2025
Keypoint Aware Masked Image Modelling
Madhava Krishna
Convin.AI
73
0
0
03 Jan 2025
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
152
612
0
31 Dec 2024
Adaptive Dataset Quantization
Muquan Li
Dongyang Zhang
Qiang Dong
Xiurui Xie
Ke Qin
DD
MQ
88
0
0
22 Dec 2024
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Jiazhou Zhou
Lin Wang
Xuming Hu
74
4
0
22 Dec 2024
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
Tien-Yu Chi
Hung-Yueh Chiang
Chi-Chih Chang
N. Huang
Kai-Chiang Wu
90
0
0
21 Dec 2024
Safety Monitoring of Machine Learning Perception Functions: a Survey
Raul Sena Ferreira
Joris Guérin
Kevin Delmas
Jérémie Guiochet
H. Waeselynck
72
0
0
09 Dec 2024
Gesture Classification in Artworks Using Contextual Image Features
Azhar Hussian
Mathias Zinnen
Thi My Hang Tran
Andreas Maier
Vincent Christlein
82
0
0
04 Dec 2024
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam
M. Zaheer
Arif Mahmood
Karthik Nandakumar
Naveed Akhtar
DiffM
85
2
0
03 Dec 2024
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods
Jiamian Hu
Yuanyuan Hong
Yihua Chen
He Wang
Moriaki Yasuhara
68
0
0
03 Dec 2024
MeasureNet: Measurement Based Celiac Disease Identification
Aayush Kumar Tyagi
Vaibhav Mishra
Ashok Tiwari
Lalita Mehra
Prasenjit Das
G. Makharia
Prathosh AP
Mausam
85
0
0
02 Dec 2024
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation
Sunghun Yang
Minhyeok Lee
Suhwan Cho
Jungho Lee
Sangyoun Lee
MDE
85
0
0
02 Dec 2024
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Alice Heiman
Xiaoman Zhang
E. Chen
Sung Eun Kim
Pranav Rajpurkar
HILM
MedIm
82
0
0
27 Nov 2024
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning
Hoàng-Ân Lê
P. Berg
Minh Pham
69
0
0
26 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
73
0
0
25 Nov 2024
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing
Hao Liu
Mamba
AI4CE
77
1
0
22 Nov 2024
ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation
Xiaoman Zhang
Hong-Yu Zhou
Xiaoli Yang
Oishi Banerjee
J. N. Acosta
Josh Miller
Ouwen Huang
Pranav Rajpurkar
LM&MA
72
3
0
22 Nov 2024
Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach
Vaishnavi Khindkar
V. Balasubramanian
Chetan Arora
A. Subramanian
C. V. Jawahar
74
0
0
20 Nov 2024
Emotional Images: Assessing Emotions in Images and Potential Biases in Generative Models
Maneet Mehta
Cody Buntain
EGVM
32
2
0
08 Nov 2024
Confidence Calibration of Classifiers with Many Classes
Adrien LeCoz
Stéphane Herbin
Faouzi Adjed
UQCV
37
1
0
05 Nov 2024
AM Flow: Adapters for Temporal Processing in Action Recognition
Tanay Agrawal
Abid Ali
A. Dantcheva
François Brémond
39
0
0
04 Nov 2024
MamT
4
^4
4
: Multi-view Attention Networks for Mammography Cancer Classification
Alisher Ibragimov
Sofya Senotrusova
Arsenii Litvinov
E. Ushakov
E. Karpulevich
Yury Markin
44
0
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
26
0
0
31 Oct 2024
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
Jia Fu
Xiao Zhang
Sepideh Pashami
Fatemeh Rahimian
Anders Holst
DiffM
AAML
32
0
0
31 Oct 2024
Context-Aware Token Selection and Packing for Enhanced Vision Transformer
Tianyi Zhang
B. Li
Jae-sun Seo
Yu Cao
35
0
0
31 Oct 2024
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
Adrian Iordache
B. Alexe
Radu Tudor Ionescu
31
1
0
29 Oct 2024
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection
Jia Wei
Yun Li
Xiaomao Fan
Wenjun Ma
Meiyu Qiu
Hongyu Chen
Wenbin Lei
16
0
0
29 Oct 2024
Previous
1
2
3
4
5
...
15
16
17
Next