ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.13797
  4. Cited By
PVT v2: Improved Baselines with Pyramid Vision Transformer

PVT v2: Improved Baselines with Pyramid Vision Transformer

25 June 2021
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
    ViT
    AI4TS
ArXivPDFHTML

Papers citing "PVT v2: Improved Baselines with Pyramid Vision Transformer"

50 / 551 papers shown
Title
Toward a Deeper Understanding: RetNet Viewed through Convolution
Toward a Deeper Understanding: RetNet Viewed through Convolution
Chenghao Li
Chaoning Zhang
ViT
35
7
0
11 Sep 2023
Gall Bladder Cancer Detection from US Images with Only Image Level
  Labels
Gall Bladder Cancer Detection from US Images with Only Image Level Labels
Soumen Basu
Ashish Papanai
Mayank Gupta
Pankaj Gupta
Chetan Arora
17
6
0
11 Sep 2023
A survey on efficient vision transformers: algorithms, techniques, and
  performance benchmarking
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking
Lorenzo Papa
Paolo Russo
Irene Amerini
Luping Zhou
30
42
0
05 Sep 2023
Mask-Attention-Free Transformer for 3D Instance Segmentation
Mask-Attention-Free Transformer for 3D Instance Segmentation
Xin Lai
Yuhui Yuan
Ruihang Chu
Yukang Chen
Han Hu
Jiaya Jia
MedIm
ISeg
3DPC
40
30
0
04 Sep 2023
Large Separable Kernel Attention: Rethinking the Large Kernel Attention
  Design in CNN
Large Separable Kernel Attention: Rethinking the Large Kernel Attention Design in CNN
Kin Wai Lau
L. Po
Yasar Abbas Ur Rehman
VLM
24
200
0
04 Sep 2023
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
Zhuofan Xia
Xuran Pan
Shiji Song
Li Erran Li
Gao Huang
ViT
27
24
0
04 Sep 2023
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor
  Formula for Image Dehazing
MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu
Kaihao Zhang
Chenxi Wang
Wenhan Luo
Hongdong Li
Zhi Jin
ViT
34
84
0
27 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
34
20
0
27 Aug 2023
Unlocking Fine-Grained Details with Wavelet-based High-Frequency
  Enhancement in Transformers
Unlocking Fine-Grained Details with Wavelet-based High-Frequency Enhancement in Transformers
Reza Azad
A. Kazerouni
Alaa Sulaiman
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
Abin Jose
Dorit Merhof
ViT
MedIm
23
9
0
25 Aug 2023
Learning Heavily-Degraded Prior for Underwater Object Detection
Learning Heavily-Degraded Prior for Underwater Object Detection
C. Fu
Xin-Yue Fan
Jiewen Xiao
Wanqi Yuan
Risheng Liu
Zhongxuan Luo
24
22
0
24 Aug 2023
Vision Transformer Adapters for Generalizable Multitask Learning
Vision Transformer Adapters for Generalizable Multitask Learning
Deblina Bhattacharjee
Sabine Süsstrunk
Mathieu Salzmann
ViT
21
8
0
23 Aug 2023
SG-Former: Self-guided Transformer with Evolving Token Reallocation
SG-Former: Self-guided Transformer with Evolving Token Reallocation
Sucheng Ren
Xingyi Yang
Songhua Liu
Xinchao Wang
ViT
27
41
0
23 Aug 2023
SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows
  from Noisy Labels
SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels
Han Yang
Tianyu Wang
Xiao Hu
Chi-Wing Fu
NoLa
51
13
0
23 Aug 2023
A Benchmark Study on Calibration
A Benchmark Study on Calibration
Linwei Tao
Younan Zhu
Haolan Guo
Minjing Dong
Chang Xu
21
9
0
23 Aug 2023
Transformer-based Detection of Microorganisms on High-Resolution Petri
  Dish Images
Transformer-based Detection of Microorganisms on High-Resolution Petri Dish Images
Nikolas Ebert
D. Stricker
Oliver Wasenmüller
MedIm
ViT
31
4
0
18 Aug 2023
Multi-scale Target-Aware Framework for Constrained Image Splicing
  Detection and Localization
Multi-scale Target-Aware Framework for Constrained Image Splicing Detection and Localization
Yuxuan Tan
Yuanman Li
Li Zeng
J. Ye
Wei Wang
Xia Li
33
6
0
18 Aug 2023
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
Diverse Cotraining Makes Strong Semi-Supervised Segmentor
Yijiang Li
Xinjiang Wang
Lihe Yang
Xue Jiang
Wayne Zhang
Ying Gao
24
15
0
18 Aug 2023
Frequency Perception Network for Camouflaged Object Detection
Frequency Perception Network for Camouflaged Object Detection
Runmin Cong
Mengyao Sun
Sanyi Zhang
Xiaofei Zhou
Wei Zhang
Yao-Min Zhao
ObjD
31
54
0
17 Aug 2023
Improving Audio-Visual Segmentation with Bidirectional Generation
Improving Audio-Visual Segmentation with Bidirectional Generation
Dawei Hao
Yuxin Mao
Bowen He
Xiaodong Han
Yuchao Dai
Yiran Zhong
VOS
VGen
33
30
0
16 Aug 2023
Large-kernel Attention for Efficient and Robust Brain Lesion
  Segmentation
Large-kernel Attention for Efficient and Robust Brain Lesion Segmentation
Liam Chalcroft
Ruben Lourencco Pereira
Mikael Brudfors
Andrew S. Kayser
M. D’Esposito
Cathy J. Price
Ioannis Pappas
John Ashburner
ViT
3DV
MedIm
29
8
0
14 Aug 2023
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
Yichen Yuan
Yifan Wang
Lijun Wang
Xiaoqi Zhao
Huchuan Lu
Yu Wang
Wei Su
Lei Zhang
VOS
24
7
0
13 Aug 2023
M&M: Tackling False Positives in Mammography with a Multi-view and
  Multi-instance Learning Sparse Detector
M&M: Tackling False Positives in Mammography with a Multi-view and Multi-instance Learning Sparse Detector
Yen Nhi Truong Vu
Dan Guo
Ahmed Taha
Jason Su
Thomas P. Matthews
22
3
0
11 Aug 2023
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention
Liang Shang
Yanli Liu
Zhengyang Lou
Shuxue Quan
N. Adluru
Bochen Guan
W. Sethares
24
2
0
10 Aug 2023
All-pairs Consistency Learning for Weakly Supervised Semantic
  Segmentation
All-pairs Consistency Learning for Weakly Supervised Semantic Segmentation
Weixuan Sun
Yanhao Zhang
Zhen Qin
Zheyuan Liu
Lin Cheng
Fanyi Wang
Yiran Zhong
Nick Barnes
ViT
41
4
0
08 Aug 2023
Communication-Efficient Framework for Distributed Image Semantic
  Wireless Transmission
Communication-Efficient Framework for Distributed Image Semantic Wireless Transmission
Bingyan Xie
Yongpeng Wu
Yuxuan Shi
Derrick Wing Kwan Ng
Wenjun Zhang
26
12
0
07 Aug 2023
DiT: Efficient Vision Transformers with Dynamic Token Routing
DiT: Efficient Vision Transformers with Dynamic Token Routing
Yuchen Ma
Zhengcong Fei
Junshi Huang
ViT
26
2
0
07 Aug 2023
Strategic Preys Make Acute Predators: Enhancing Camouflaged Object
  Detectors by Generating Camouflaged Objects
Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects
Chunming He
Kai Li
Yachao Zhang
Yulun Zhang
Z. Guo
Xiu Li
Martin Danelljan
F. I. F. Richard Yu
AAML
35
44
0
06 Aug 2023
FLatten Transformer: Vision Transformer using Focused Linear Attention
FLatten Transformer: Vision Transformer using Focused Linear Attention
Dongchen Han
Xuran Pan
Yizeng Han
Shiji Song
Gao Huang
23
155
0
01 Aug 2023
Diffusion Model for Camouflaged Object Detection
Diffusion Model for Camouflaged Object Detection
Zhe Chen
Rongrong Gao
Tian-Zhu Xiang
Fanzhao Lin
DiffM
29
19
0
01 Aug 2023
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yun-Qiu Lv
Yiran Zhong
Yuchao Dai
DiffM
43
28
0
31 Jul 2023
Validating polyp and instrument segmentation methods in colonoscopy
  through Medico 2020 and MedAI 2021 Challenges
Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges
Debesh Jha
Vanshali Sharma
Debapriya Banik
Debayan Bhattacharya
K. Roy
...
Sharib Ali
Michael A. Riegler
P. Halvorsen
Thomas de Lange
Ulas Bagci
30
1
0
30 Jul 2023
Pre-training Vision Transformers with Very Limited Synthesized Images
Pre-training Vision Transformers with Very Limited Synthesized Images
Ryo Nakamura1
Hirokatsu Kataoka
Sora Takashima
Edgar Josafat Martinez-Noriega
Rio Yokota
Nakamasa Inoue
34
7
0
27 Jul 2023
MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical
  Image Segmentation
MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical Image Segmentation
Liang Xu
Mingxi Chen
Yiyu Cheng
Pengfei Shao
Shuwei Shen
Peng Yao
Ronald X. Xu
ViT
32
0
0
27 Jul 2023
Adaptive Frequency Filters As Efficient Global Token Mixers
Adaptive Frequency Filters As Efficient Global Token Mixers
Zhipeng Huang
Zhizheng Zhang
Cuiling Lan
Zhengjun Zha
Yan Lu
B. Guo
30
37
0
26 Jul 2023
When Multi-Task Learning Meets Partial Supervision: A Computer Vision
  Review
When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review
Maxime Fontana
Michael W. Spratling
Miaojing Shi
47
6
0
25 Jul 2023
Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation
Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation
Jinxian Liu
Chen Ju
Chaofan Ma
Yanfeng Wang
Yu Wang
Ya-Qin Zhang
VOS
27
23
0
25 Jul 2023
COCO-O: A Benchmark for Object Detectors under Natural Distribution
  Shifts
COCO-O: A Benchmark for Object Detectors under Natural Distribution Shifts
Xiaofeng Mao
YueFeng Chen
Yao Zhu
Da Chen
Hang Su
Rong Zhang
H. Xue
ObjD
OOD
38
18
0
24 Jul 2023
A Good Student is Cooperative and Reliable: CNN-Transformer
  Collaborative Learning for Semantic Segmentation
A Good Student is Cooperative and Reliable: CNN-Transformer Collaborative Learning for Semantic Segmentation
Jinjing Zhu
Yuan Luo
Xueye Zheng
Hao Wang
Lin Wang
22
33
0
24 Jul 2023
SimCol3D -- 3D Reconstruction during Colonoscopy Challenge
SimCol3D -- 3D Reconstruction during Colonoscopy Challenge
A. Rau
Sophia Bano
Yueming Jin
P. Azagra
Javier Morlana
...
Mobarakol Islam
Hongliang Ren
Laurence B. Lovat
José M. M. Montiel
Danail Stoyanov
27
10
0
20 Jul 2023
WeakPolyp: You Only Look Bounding Box for Polyp Segmentation
WeakPolyp: You Only Look Bounding Box for Polyp Segmentation
JunChao Wei
Yiwen Hu
Shuguang Cui
S.Kevin Zhou
Zhen Li
34
18
0
20 Jul 2023
Meta-Transformer: A Unified Framework for Multimodal Learning
Meta-Transformer: A Unified Framework for Multimodal Learning
Yiyuan Zhang
Kaixiong Gong
Kaipeng Zhang
Hongsheng Li
Yu Qiao
Wanli Ouyang
Xiangyu Yue
24
137
0
20 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global
  Self-Attention
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
16
2
0
18 Jul 2023
Ord2Seq: Regarding Ordinal Regression as Label Sequence Prediction
Ord2Seq: Regarding Ordinal Regression as Label Sequence Prediction
Jinhong Wang
Yi Cheng
Jintai Chen
Tingting Chen
Danny Chen
Jian Wu
22
9
0
18 Jul 2023
Scale-Aware Modulation Meet Transformer
Scale-Aware Modulation Meet Transformer
Wei-Shiang Lin
Ziheng Wu
Jiayu Chen
Jun Huang
Lianwen Jin
MoE
ViT
27
66
0
17 Jul 2023
Study of Vision Transformers for Covid-19 Detection from Chest X-rays
Study of Vision Transformers for Covid-19 Detection from Chest X-rays
S. Angara
S. Thirunagaru
ViT
MedIm
24
1
0
17 Jul 2023
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance
  Segmentation
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation
Jialun Pei
Tao Jiang
He Tang
Nian Liu
Yueming Jin
Deng-Ping Fan
Pheng-Ann Heng
ISeg
30
8
0
16 Jul 2023
Efficient Convolution and Transformer-Based Network for Video Frame
  Interpolation
Efficient Convolution and Transformer-Based Network for Video Frame Interpolation
Issa Khalifeh
L. Murn
M. Mrak
E. Izquierdo
ViT
28
2
0
12 Jul 2023
HoughLaneNet: Lane Detection with Deep Hough Transform and Dynamic
  Convolution
HoughLaneNet: Lane Detection with Deep Hough Transform and Dynamic Convolution
Jia-Qi Zhang
Haoqi Duan
Jun-Long Chen
Ariel Shamir
Miao Wang
35
16
0
07 Jul 2023
AVSegFormer: Audio-Visual Segmentation with Transformer
AVSegFormer: Audio-Visual Segmentation with Transformer
Sheng Gao
Zhe Chen
Guo Chen
Wenhai Wang
Tong Lu
VOS
34
46
0
03 Jul 2023
HODINet: High-Order Discrepant Interaction Network for RGB-D Salient
  Object Detection
HODINet: High-Order Discrepant Interaction Network for RGB-D Salient Object Detection
Kang Yi
Jing Xu
Xiao Jin
Fu-Bin Guo
Yanfeng Wu
26
0
0
03 Jul 2023
Previous
123...567...101112
Next