ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.12058
  4. Cited By
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
v1v2 (latest)

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

28 January 2023
Liya Wang
A. Tien
ArXiv (abs)PDFHTML

Papers citing "Aerial Image Object Detection With Vision Transformer Detector (ViTDet)"

44 / 144 papers shown
Title
ContrastMask: Contrastive Learning to Segment Every Thing
ContrastMask: Contrastive Learning to Segment Every Thing
Xuehui Wang
Kai Zhao
Ruixin Zhang
Shouhong Ding
Yan Wang
Wei Shen
ISeg
91
37
0
18 Mar 2022
Masked Autoencoders for Point Cloud Self-supervised Learning
Masked Autoencoders for Point Cloud Self-supervised Learning
Yatian Pang
Wenxiao Wang
Francis E. H. Tay
Wen Liu
Yonghong Tian
Liuliang Yuan
3DPCViT
111
477
0
13 Mar 2022
MVP: Multimodality-guided Visual Pre-training
MVP: Multimodality-guided Visual Pre-training
Longhui Wei
Lingxi Xie
Wen-gang Zhou
Houqiang Li
Qi Tian
58
107
0
10 Mar 2022
Graph Masked Autoencoders with Transformers
Graph Masked Autoencoders with Transformers
Sixiao Zhang
Hongxu Chen
Haoran Yang
Xiangguo Sun
Philip S. Yu
Guandong Xu
53
18
0
17 Feb 2022
A Unified Framework for Masked and Mask-Free Face Recognition via
  Feature Rectification
A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification
Shaozhe Hao
Chaofeng Chen
Zhenfang Chen
Kwan-Yee K. Wong
CVBM
28
6
0
15 Feb 2022
MaskGIT: Masked Generative Image Transformer
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
153
695
0
08 Feb 2022
How to Understand Masked Autoencoders
How to Understand Masked Autoencoders
Shuhao Cao
Peng Xu
David Clifton
78
42
0
08 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech,
  Vision and Language
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSLVLMViT
105
859
0
07 Feb 2022
Context Autoencoder for Self-Supervised Representation Learning
Context Autoencoder for Self-Supervised Representation Learning
Xiaokang Chen
Mingyu Ding
Xiaodi Wang
Ying Xin
Shentong Mo
Yunhao Wang
Shumin Han
Ping Luo
Gang Zeng
Jingdong Wang
SSL
127
396
0
07 Feb 2022
Adversarial Masking for Self-Supervised Learning
Adversarial Masking for Self-Supervised Learning
Yuge Shi
N. Siddharth
Philip Torr
Adam R. Kosiorek
SSL
134
86
0
31 Jan 2022
Mask-based Latent Reconstruction for Reinforcement Learning
Mask-based Latent Reconstruction for Reinforcement Learning
Tao Yu
Zhizheng Zhang
Cuiling Lan
Yan Lu
Zhibo Chen
80
45
0
28 Jan 2022
Time Series Generation with Masked Autoencoder
Time Series Generation with Masked Autoencoder
Meng-yue Zha
SiuTim Wong
Mengqi Liu
Tong Zhang
Kani Chen
SyDaAI4TS
50
17
0
14 Jan 2022
MGAE: Masked Autoencoders for Self-Supervised Learning on Graphs
MGAE: Masked Autoencoders for Self-Supervised Learning on Graphs
Qiaoyu Tan
Ninghao Liu
Xiao Shi Huang
Rui Chen
Soo-Hyun Choi
Helen Zhou
SSL
70
41
0
07 Jan 2022
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Masked Feature Prediction for Self-Supervised Visual Pre-Training
Chen Wei
Haoqi Fan
Saining Xie
Chaoxia Wu
Alan Yuille
Christoph Feichtenhofer
ViT
152
670
0
16 Dec 2021
BEVT: BERT Pretraining of Video Transformers
BEVT: BERT Pretraining of Video Transformers
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Yu-Gang Jiang
Luowei Zhou
Lu Yuan
ViT
89
209
0
02 Dec 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
142
685
0
29 Nov 2021
Mask Transfiner for High-Quality Instance Segmentation
Mask Transfiner for High-Quality Instance Segmentation
Lei Ke
Martin Danelljan
Xia Li
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
ISeg
64
117
0
26 Nov 2021
SimMIM: A Simple Framework for Masked Image Modeling
SimMIM: A Simple Framework for Masked Image Modeling
Zhenda Xie
Zheng Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
215
1,363
0
18 Nov 2021
iBOT: Image BERT Pre-Training with Online Tokenizer
iBOT: Image BERT Pre-Training with Online Tokenizer
Jinghao Zhou
Chen Wei
Huiyu Wang
Wei Shen
Cihang Xie
Alan Yuille
Tao Kong
88
742
0
15 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViTTPM
477
7,827
0
11 Nov 2021
MLIM: Vision-and-Language Model Pre-training with Masked Language and
  Image Modeling
MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling
Tarik Arici
M. S. Seyfioglu
T. Neiman
Yi Tian Xu
Son N. Tran
Trishul Chilimbi
Belinda Zeng
Ismail B. Tutar
VLM
49
15
0
24 Sep 2021
Oriented R-CNN for Object Detection
Oriented R-CNN for Object Detection
Xingxing Xie
Gong Cheng
Jiabao Wang
Xiwen Yao
Junwei Han
ObjD
176
701
0
12 Aug 2021
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive
  Learning
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning
Hao Tan
Jie Lei
Thomas Wolf
Joey Tianyi Zhou
102
66
0
21 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
292
2,845
0
15 Jun 2021
MST: Masked Self-Supervised Transformer for Visual Representation
MST: Masked Self-Supervised Transformer for Visual Representation
Zhaowen Li
Zhiyang Chen
Fan Yang
Wei Li
Yousong Zhu
...
Rui Deng
Liwei Wu
Rui Zhao
Ming Tang
Jinqiao Wang
ViT
89
167
0
10 Jun 2021
Learning High-Precision Bounding Box for Rotated Object Detection via
  Kullback-Leibler Divergence
Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
Xue Yang
Xiaojiang Yang
Jirui Yang
Qi Ming
Wentao Wang
Qi Tian
Junchi Yan
107
389
0
03 Jun 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
467
21,603
0
25 Mar 2021
ReDet: A Rotation-equivariant Detector for Aerial Object Detection
ReDet: A Rotation-equivariant Detector for Aerial Object Detection
Jiaming Han
Jian Ding
Nan Xue
Guisong Xia
91
539
0
13 Mar 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
679
41,483
0
22 Oct 2020
RarePlanes: Synthetic Data Takes Flight
RarePlanes: Synthetic Data Takes Flight
Jacob Shermeyer
T. Hossler
A. V. Etten
Daniel Hogan
Ryan Lewis
Daeil Kim
65
106
0
04 Jun 2020
Learning RoI Transformer for Detecting Oriented Objects in Aerial Images
Learning RoI Transformer for Detecting Oriented Objects in Aerial Images
Jian Ding
Nan Xue
Yang Long
Gui-Song Xia
Qikai Lu
77
171
0
01 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,229
0
11 Oct 2018
Deep Learning for Generic Object Detection: A Survey
Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
M. Pietikäinen
ObjDVLMOOD
177
2,459
0
06 Sep 2018
Cascade R-CNN: Delving into High Quality Object Detection
Cascade R-CNN: Delving into High Quality Object Detection
Zhaowei Cai
Nuno Vasconcelos
ObjD
149
4,943
0
03 Dec 2017
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
Gui-Song Xia
X. Bai
Jian Ding
Zhen Zhu
Serge J. Belongie
Jiebo Luo
Mihai Datcu
Marcello Pelillo
Liangpei Zhang
ObjD
127
2,186
0
28 Nov 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
803
132,454
0
12 Jun 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
369
27,253
0
20 Mar 2017
Feature Pyramid Networks for Object Detection
Feature Pyramid Networks for Object Detection
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
488
22,158
0
09 Dec 2016
Fully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOSSSeg
750
37,895
0
20 May 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,510
0
10 Dec 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMatObjD
533
62,409
0
04 Jun 2015
Fast R-CNN
Fast R-CNN
Ross B. Girshick
ObjD
312
25,087
0
30 Apr 2015
MADE: Masked Autoencoder for Distribution Estimation
MADE: Masked Autoencoder for Distribution Estimation
M. Germain
Karol Gregor
Iain Murray
Hugo Larochelle
OODSyDaUQCV
187
874
0
12 Feb 2015
Rich feature hierarchies for accurate object detection and semantic
  segmentation
Rich feature hierarchies for accurate object detection and semantic segmentation
Ross B. Girshick
Jeff Donahue
Trevor Darrell
Jitendra Malik
ObjD
291
26,223
0
11 Nov 2013
Previous
123