ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.12058
  4. Cited By
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
v1v2 (latest)

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)

28 January 2023
Liya Wang
A. Tien
ArXiv (abs)PDFHTML

Papers citing "Aerial Image Object Detection With Vision Transformer Detector (ViTDet)"

50 / 144 papers shown
Title
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Sanghyun Woo
Shoubhik Debnath
Ronghang Hu
Xinlei Chen
Zhuang Liu
In So Kweon
Saining Xie
SyDa
154
811
0
02 Jan 2023
What's Behind the Mask: Estimating Uncertainty in Image-to-Image
  Problems
What's Behind the Mask: Estimating Uncertainty in Image-to-Image Problems
Gilad Kutiel
Regev Cohen
Michael Elad
Daniel Freedman
UQCV
93
5
0
28 Nov 2022
MAEDAY: MAE for few and zero shot AnomalY-Detection
MAEDAY: MAE for few and zero shot AnomalY-Detection
Eli Schwartz
Assaf Arbelle
Leonid Karlinsky
Sivan Harary
Florian Scheidegger
Sivan Doveh
Raja Giryes
ViTUQCV
63
36
0
25 Nov 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
83
48
0
23 Nov 2022
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
Yuting Wang
Jinpeng Wang
Bin Chen
Ziyun Zeng
Shutao Xia
49
22
0
21 Nov 2022
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with
  Masked Autoencoders
AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
W. G. C. Bandara
Naman Patel
A. Gholami
Mehdi Nikkhah
M. Agrawal
Vishal M. Patel
53
43
0
16 Nov 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and
  Image Synthesis
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
78
169
0
16 Nov 2022
Stare at What You See: Masked Image Modeling without Reconstruction
Stare at What You See: Masked Image Modeling without Reconstruction
Hongwei Xue
Peng Gao
Hongyang Li
Yu Qiao
Hao Sun
Houqiang Li
Jiebo Luo
62
32
0
16 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLMCLIP
193
725
0
14 Nov 2022
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual
  Anomaly Detector
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual Anomaly Detector
Haiming Yao
Xue Wang
Wenyong Yu
81
9
0
01 Nov 2022
MAEEG: Masked Auto-encoder for EEG Representation Learning
MAEEG: Masked Auto-encoder for EEG Representation Learning
H. Chien
Hanlin Goh
Christopher M. Sandino
Joseph Y. Cheng
54
49
0
27 Oct 2022
Masked Modeling Duo: Learning Representations by Encouraging Both
  Networks to Model the Input
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
78
33
0
26 Oct 2022
i-MAE: Are Latent Representations in Masked Autoencoders Linearly
  Separable?
i-MAE: Are Latent Representations in Masked Autoencoders Linearly Separable?
Kevin Zhang
Zhiqiang Shen
52
8
0
20 Oct 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
145
511
0
20 Oct 2022
A Unified View of Masked Image Modeling
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
115
38
0
19 Oct 2022
How Mask Matters: Towards Theoretical Understandings of Masked
  Autoencoders
How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders
Qi Zhang
Yifei Wang
Yisen Wang
65
76
0
15 Oct 2022
MOVE: Unsupervised Movable Object Segmentation and Detection
MOVE: Unsupervised Movable Object Segmentation and Detection
Adam Bielski
Paolo Favaro
OCL
52
21
0
14 Oct 2022
Exploring Long-Sequence Masked Autoencoders
Exploring Long-Sequence Masked Autoencoders
Ronghang Hu
Shoubhik Debnath
Saining Xie
Xinlei Chen
47
18
0
13 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised
  Video Transformer Pre-training
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
132
9
0
11 Oct 2022
Ensemble Learning using Transformers and Convolutional Networks for
  Masked Face Recognition
Ensemble Learning using Transformers and Convolutional Networks for Masked Face Recognition
Mohammed R. Al-Sinan
Aseel F. Haneef
H. Luqman
74
2
0
10 Oct 2022
Self-supervised Video Representation Learning with Motion-Aware Masked
  Autoencoders
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
Huanjin Yao
Yi Jiang
Xiatian Zhu
Zehuan Yuan
45
20
0
09 Oct 2022
Real-World Robot Learning with Masked Visual Pre-training
Real-World Robot Learning with Masked Visual Pre-training
Ilija Radosavovic
Tete Xiao
Stephen James
Pieter Abbeel
Jitendra Malik
Trevor Darrell
SSL
235
254
0
06 Oct 2022
Exploring The Role of Mean Teachers in Self-supervised Masked
  Auto-Encoders
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-Encoders
Youngwan Lee
Jeffrey Willette
Jonghee Kim
Juho Lee
Sung Ju Hwang
93
16
0
05 Oct 2022
Contrastive Audio-Visual Masked Autoencoder
Contrastive Audio-Visual Masked Autoencoder
Yuan Gong
Andrew Rouditchenko
Alexander H. Liu
David Harwath
Leonid Karlinsky
Hilde Kuehne
James R. Glass
91
128
0
02 Oct 2022
MaskTune: Mitigating Spurious Correlations by Forcing to Explore
MaskTune: Mitigating Spurious Correlations by Forcing to Explore
Saeid Asgari Taghanaki
Aliasghar Khani
Fereshte Khani
A. Gholami
Linh-Tam Tran
Ali Mahdavi-Amiri
Ghassan Hamarneh
AAML
86
48
0
30 Sep 2022
Self-Distillation for Further Pre-training of Transformers
Self-Distillation for Further Pre-training of Transformers
Seanie Lee
Minki Kang
Juho Lee
Sung Ju Hwang
Kenji Kawaguchi
91
8
0
30 Sep 2022
Self-Supervised Masked Convolutional Transformer Block for Anomaly
  Detection
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection
Neelu Madan
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Kamal Nasrollahi
Fahad Shahbaz Khan
T. Moeslund
M. Shah
ViTMedIm
325
70
0
25 Sep 2022
NamedMask: Distilling Segmenters from Complementary Foundation Models
NamedMask: Distilling Segmenters from Complementary Foundation Models
Gyungin Shin
Weidi Xie
Samuel Albanie
ISegVLM
101
23
0
22 Sep 2022
MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning
MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning
Jiangmeng Li
Jingyao Wang
Yanan Zhang
Wenyi Mo
Changwen Zheng
Fuchun Sun
Hui Xiong
SSL
99
14
0
16 Sep 2022
Masked Imitation Learning: Discovering Environment-Invariant Modalities
  in Multimodal Demonstrations
Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations
Yilun Hao
Ruinan Wang
Zhangjie Cao
Zihan Wang
Yuchen Cui
Dorsa Sadigh
77
2
0
16 Sep 2022
Multi-Modal Masked Autoencoders for Medical Vision-and-Language
  Pre-Training
Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training
Zhihong Chen
Yu Du
Jinpeng Hu
Yang Liu
Guanbin Li
Xiang Wan
Tsung-Hui Chang
143
118
0
15 Sep 2022
Exploring Target Representations for Masked Autoencoders
Exploring Target Representations for Masked Autoencoders
Xingbin Liu
Jinghao Zhou
Tao Kong
Xianming Lin
Rongrong Ji
172
51
0
08 Sep 2022
An Empirical Study of End-to-End Video-Language Transformers with Masked
  Visual Modeling
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
William Yang Wang
Lijuan Wang
Zicheng Liu
VLM
83
65
0
04 Sep 2022
Efficient Vision-Language Pretraining with Visual Concepts and
  Hierarchical Alignment
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment
Mustafa Shukor
Guillaume Couairon
Matthieu Cord
VLMCLIP
65
27
0
29 Aug 2022
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image
  Pretraining
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
Xiaoyi Dong
Jianmin Bao
Yinglin Zheng
Ting Zhang
Dongdong Chen
...
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
CLIPVLM
88
167
0
25 Aug 2022
Masked Autoencoders Enable Efficient Knowledge Distillers
Masked Autoencoders Enable Efficient Knowledge Distillers
Yutong Bai
Zeyu Wang
Junfei Xiao
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
CLL
86
43
0
25 Aug 2022
Heterogeneous Graph Masked Autoencoders
Heterogeneous Graph Masked Autoencoders
Yijun Tian
Kaiwen Dong
Chunhui Zhang
Chuxu Zhang
Nitesh Chawla
116
78
0
21 Aug 2022
VLMAE: Vision-Language Masked Autoencoder
VLMAE: Vision-Language Masked Autoencoder
Su He
Taian Guo
Tao Dai
Ruizhi Qiao
Chen Wu
Xiujun Shu
Bohan Ren
VLM
74
11
0
19 Aug 2022
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
69
320
0
12 Aug 2022
MILAN: Masked Image Pretraining on Language Assisted Representation
MILAN: Masked Image Pretraining on Language Assisted Representation
Zejiang Hou
Fei Sun
Yen-kuang Chen
Yuan Xie
S. Kung
ViT
85
68
0
11 Aug 2022
Understanding Masked Image Modeling via Learning Occlusion Invariant
  Feature
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature
Xiangwen Kong
Xiangyu Zhang
SSL
63
54
0
08 Aug 2022
Masked Vision and Language Modeling for Multi-modal Representation
  Learning
Masked Vision and Language Modeling for Multi-modal Representation Learning
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
75
68
0
03 Aug 2022
SdAE: Self-distillated Masked Autoencoder
SdAE: Self-distillated Masked Autoencoder
Yabo Chen
Yuchen Liu
Dongsheng Jiang
Xiaopeng Zhang
Wenrui Dai
H. Xiong
Qi Tian
ViT
79
73
0
31 Jul 2022
Less is More: Consistent Video Depth Estimation with Masked Frames
  Modeling
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling
Yiran Wang
Zhiyu Pan
Xingyi Li
Zhiguo Cao
Ke Xian
Jianming Zhang
66
29
0
31 Jul 2022
A Survey on Masked Autoencoder for Self-supervised Learning in Vision
  and Beyond
A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond
Chaoning Zhang
Chenshuang Zhang
Junha Song
John Seon Keun Yi
Kang Zhang
In So Kweon
SSL
84
77
0
30 Jul 2022
Contrastive Masked Autoencoders are Stronger Vision Learners
Contrastive Masked Autoencoders are Stronger Vision Learners
Zhicheng Huang
Xiaojie Jin
Cheng Lu
Qibin Hou
Mingg-Ming Cheng
Dongmei Fu
Xiaohui Shen
Jiashi Feng
121
153
0
27 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
93
45
0
24 Jul 2022
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis
Yaqian Liang
Shanshan Zhao
Baosheng Yu
Jing Zhang
Fazhi He
ViT
66
39
0
20 Jul 2022
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral
  Satellite Imagery
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery
Yezhen Cong
Samarth Khanna
Chenlin Meng
Patrick Liu
Erik Rozi
Yutong He
Marshall Burke
David B. Lobell
Stefano Ermon
ViT
80
275
0
17 Jul 2022
A Dual-Masked Auto-Encoder for Robust Motion Capture with
  Spatial-Temporal Skeletal Token Completion
A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion
Junkun Jiang
Jie Chen
Yike Guo
3DH
49
10
0
15 Jul 2022
123
Next