ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,279 papers shown
Title
Lost in Compression: the Impact of Lossy Image Compression on Variable
  Size Object Detection within Infrared Imagery
Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery
Neelanjan Bhowmik
Jack W. Barker
Yona Falinie A. Gaus
T. Breckon
39
14
0
16 May 2022
Transformers in 3D Point Clouds: A Survey
Transformers in 3D Point Clouds: A Survey
Dening Lu
Qian Xie
Mingqiang Wei
Kyle Gao
Linlin Xu
Jonathan Li
3DPC
ViT
37
49
0
16 May 2022
Video Frame Interpolation with Transformer
Video Frame Interpolation with Transformer
Liying Lu
Ruizheng Wu
Huaijia Lin
Jiangbo Lu
Jiaya Jia
ViT
45
4
0
15 May 2022
Dense residual Transformer for image denoising
Dense residual Transformer for image denoising
Chao Yao
Shuo Jin
Meiqin Liu
Xiaojuan Ban
ViT
51
29
0
14 May 2022
Simple Open-Vocabulary Object Detection with Vision Transformers
Simple Open-Vocabulary Object Detection with Vision Transformers
Matthias Minderer
A. Gritsenko
Austin Stone
Maxim Neumann
Dirk Weissenborn
...
Zhuoran Shen
Tianlin Li
Xiaohua Zhai
Thomas Kipf
N. Houlsby
ObjD
CLIP
VLM
ViT
OCL
36
307
0
12 May 2022
Group R-CNN for Weakly Semi-supervised Object Detection with Points
Group R-CNN for Weakly Semi-supervised Object Detection with Points
Shilong Zhang
Zhuoran Yu
Liyang Liu
Xinjiang Wang
Aojun Zhou
Kaibing Chen
20
45
0
12 May 2022
MEWS: Real-time Social Media Manipulation Detection and Analysis
MEWS: Real-time Social Media Manipulation Detection and Analysis
Trenton W. Ford
William Theisen
Michael Yankoski
Tom Henry
Farah Khashman
Katherine R. Dearstyne
Tim Weninger
22
0
0
11 May 2022
An Empirical Study Of Self-supervised Learning Approaches For Object
  Detection With Transformers
An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers
Gokul Karthik Kumar
Sahal Shaji Mullappilly
Abhishek Singh Gehlot
ViT
31
1
0
11 May 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
Qiankun Liu
Zhentao Tan
Dongdong Chen
Qi Chu
Xiyang Dai
Yinpeng Chen
Mengchen Liu
Lu Yuan
Nenghai Yu
ViT
31
70
0
10 May 2022
Activating More Pixels in Image Super-Resolution Transformer
Activating More Pixels in Image Super-Resolution Transformer
Xiangyu Chen
Xintao Wang
Jiantao Zhou
Yu Qiao
Chao Dong
ViT
86
603
0
09 May 2022
Siamese Object Tracking for Unmanned Aerial Vehicle: A Review and
  Comprehensive Analysis
Siamese Object Tracking for Unmanned Aerial Vehicle: A Review and Comprehensive Analysis
Changhong Fu
Kunhan Lu
Guang-Zheng Zheng
Junjie Ye
Ziang Cao
Bowen Li
Geng Lu
32
55
0
09 May 2022
Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection
Beyond Bounding Box: Multimodal Knowledge Learning for Object Detection
Wei Feng
Xingyuan Bu
Chenchen Zhang
Xubin Li
VLM
12
4
0
09 May 2022
Incremental-DETR: Incremental Few-Shot Object Detection via
  Self-Supervised Learning
Incremental-DETR: Incremental Few-Shot Object Detection via Self-Supervised Learning
Na Dong
Yongqiang Zhang
Mingli Ding
G. Lee
CLL
45
29
0
09 May 2022
ConvMAE: Masked Convolution Meets Masked Autoencoders
ConvMAE: Masked Convolution Meets Masked Autoencoders
Peng Gao
Teli Ma
Hongsheng Li
Ziyi Lin
Jifeng Dai
Yu Qiao
ViT
19
122
0
08 May 2022
Transformer Tracking with Cyclic Shifting Window Attention
Transformer Tracking with Cyclic Shifting Window Attention
Zikai Song
Junqing Yu
Yi-Ping Phoebe Chen
Wei Yang
ViT
27
123
0
08 May 2022
SparseTT: Visual Tracking with Sparse Transformers
SparseTT: Visual Tracking with Sparse Transformers
Z. Fu
Zehua Fu
Qingjie Liu
Wenrui Cai
Yunhong Wang
ViT
27
121
0
08 May 2022
YOLOPose: Transformer-based Multi-Object 6D Pose Estimation using
  Keypoint Regression
YOLOPose: Transformer-based Multi-Object 6D Pose Estimation using Keypoint Regression
Arash A. Amini
Arul Selvam Periyasamy
Sven Behnke
ViT
23
32
0
05 May 2022
P3IV: Probabilistic Procedure Planning from Instructional Videos with
  Weak Supervision
P3IV: Probabilistic Procedure Planning from Instructional Videos with Weak Supervision
Henghui Zhao
Isma Hadji
Nikita Dvornik
Konstantinos G. Derpanis
Richard P. Wildes
Allan D. Jepson
36
45
0
04 May 2022
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and
  Object Re-Identification
Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification
Haowei Zhu
Wenjing Ke
Dong Li
Ji Liu
Lu Tian
Yi Shan
31
134
0
04 May 2022
An Analysis of Generative Methods for Multiple Image Inpainting
An Analysis of Generative Methods for Multiple Image Inpainting
C. Ballester
Aurélie Bugeau
Samuel Hurault
S. Parisotto
Patricia Vitoria
16
3
0
04 May 2022
Dynamic Sparse R-CNN
Dynamic Sparse R-CNN
Qinghang Hong
Fengming Liu
Dong Li
Ji Liu
Lu Tian
Yi Shan
ObjD
31
28
0
04 May 2022
Application of belief functions to medical image segmentation: A review
Application of belief functions to medical image segmentation: A review
Ling Huang
S. Ruan
Thierry Denoeux
EDL
MedIm
32
30
0
03 May 2022
Cross-modal Representation Learning for Zero-shot Action Recognition
Cross-modal Representation Learning for Zero-shot Action Recognition
Chung-Ching Lin
Kevin Qinghong Lin
Linjie Li
Lijuan Wang
Zicheng Liu
ViT
16
29
0
03 May 2022
MTTrans: Cross-Domain Object Detection with Mean-Teacher Transformer
MTTrans: Cross-Domain Object Detection with Mean-Teacher Transformer
Jinze Yu
Jiaming Liu
Xi Wei
Haoyi Zhou
Yohei Nakata
Denis A. Gudovskiy
Tomoyuki Okuno
Jianxin Li
Kurt Keutzer
Shanghang Zhang
ViT
19
48
0
03 May 2022
Multimodal Detection of Unknown Objects on Roads for Autonomous Driving
Multimodal Detection of Unknown Objects on Roads for Autonomous Driving
Daniel Bogdoll
Enrico Eisen
Maximilian Nitsche
Christin Scheib
J. Marius Zöllner
20
12
0
03 May 2022
Cross Domain Object Detection by Target-Perceived Dual Branch
  Distillation
Cross Domain Object Detection by Target-Perceived Dual Branch Distillation
Meng He
Yali Wang
Jiaxi Wu
Yiru Wang
Hanqing Li
Bo-wen Li
Weihao Gan
Wei Wu
Yu Qiao
39
69
0
03 May 2022
Detection Recovery in Online Multi-Object Tracking with Sparse Graph
  Tracker
Detection Recovery in Online Multi-Object Tracking with Sparse Graph Tracker
Jeongseok Hyun
Myunggu Kang
Dongyoon Wee
Dit-Yan Yeung
VOT
28
37
0
02 May 2022
Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering
Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering
A. Piergiovanni
Wei Li
Weicheng Kuo
M. Saffar
Fred Bertsch
A. Angelova
17
16
0
02 May 2022
MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries
MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries
Tianyuan Zhang
Xuanyao Chen
Yue Wang
Yilun Wang
Hang Zhao
32
82
0
02 May 2022
COUCH: Towards Controllable Human-Chair Interactions
COUCH: Towards Controllable Human-Chair Interactions
Xiaohan Zhang
Bharat Lal Bhatnagar
V. Guzov
Sebastian Starke
Gerard Pons-Moll
59
97
0
01 May 2022
Continual Learning with Foundation Models: An Empirical Study of Latent
  Replay
Continual Learning with Foundation Models: An Empirical Study of Latent Replay
O. Ostapenko
Timothée Lesort
P. Rodríguez
Md Rifat Arefin
Arthur Douillard
Irina Rish
Laurent Charlin
39
52
0
30 Apr 2022
Composition-aware Graphic Layout GAN for Visual-textual Presentation
  Designs
Composition-aware Graphic Layout GAN for Visual-textual Presentation Designs
Min Zhou
Chenchen Xu
Ye Ma
T. Ge
Yuning Jiang
Weiwei Xu
19
51
0
30 Apr 2022
Dynamic Curriculum Learning for Great Ape Detection in the Wild
Dynamic Curriculum Learning for Great Ape Detection in the Wild
Xinyu Yang
T. Burghardt
Majid Mirmehdi
35
14
0
30 Apr 2022
Improving Visual Grounding with Visual-Linguistic Verification and
  Iterative Reasoning
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning
Li Yang
Yan Xu
Chunfen Yuan
Wei Liu
Bing Li
Weiming Hu
ObjD
52
113
0
30 Apr 2022
Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel
  Transformer
Coarse-to-Fine Video Denoising with Dual-Stage Spatial-Channel Transformer
Wu Yun
Mengshi Qi
Chuanming Wang
Huiyuan Fu
Huadong Ma
ViT
15
6
0
30 Apr 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
51
3,369
0
29 Apr 2022
Improving Transferability for Domain Adaptive Detection Transformers
Improving Transferability for Domain Adaptive Detection Transformers
Kaixiong Gong
Shuang Li
Shugang Li
Rui Zhang
Chi Harold Liu
Qiang Chen
62
34
0
29 Apr 2022
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth
  Estimation
SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation
Chang Shu
Zi-Chun Chen
Lei Chen
Kuan Ma
Minghui Wang
Haibing Ren
ViT
32
14
0
29 Apr 2022
Where in the World is this Image? Transformer-based Geo-localization in
  the Wild
Where in the World is this Image? Transformer-based Geo-localization in the Wild
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
21
30
0
29 Apr 2022
One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer
  for Missing Data Imputation
One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer for Missing Data Imputation
Jiang Liu
Srivathsa Pasumarthi
B. Duffy
Enhao Gong
Keshav Datta
Greg Zaharchuk
ViT
MedIm
24
56
0
28 Apr 2022
Tragedy Plus Time: Capturing Unintended Human Activities from
  Weakly-labeled Videos
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy
Zhiyuan Fang
Yezhou Yang
35
2
0
28 Apr 2022
Region-level Contrastive and Consistency Learning for Semi-Supervised
  Semantic Segmentation
Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation
Jianrong Zhang
Tianyi Wu
Chuan-Yong Ding
Hongwei Zhao
Guodong Guo
ISeg
38
15
0
28 Apr 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
49
150
0
27 Apr 2022
CapOnImage: Context-driven Dense-Captioning on Image
CapOnImage: Context-driven Dense-Captioning on Image
Yiqi Gao
Xinglin Hou
Yuanmeng Zhang
T. Ge
Yuning Jiang
Peifeng Wang
33
10
0
27 Apr 2022
CATrans: Context and Affinity Transformer for Few-Shot Segmentation
CATrans: Context and Affinity Transformer for Few-Shot Segmentation
Shan Zhang
Tianyi Wu
Sitong Wu
Guodong Guo
ViT
40
19
0
27 Apr 2022
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and
  Deploying in the Wild
The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild
Spyridon Baxevanakis
Giorgos Kordopatis-Zilos
Panagiotis Galopoulos
Lazaros Apostolidis
Killian Levacher
Ipek B. Schlicht
Denis Teyssou
I. Kompatsiaris
Symeon Papadopoulos
47
8
0
27 Apr 2022
A Multi-Head Convolutional Neural Network With Multi-path Attention
  improves Image Denoising
A Multi-Head Convolutional Neural Network With Multi-path Attention improves Image Denoising
Jiahong Zhang
Meijun Qu
Ye Wang
Lihong Cao
16
6
0
27 Apr 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
30
515
0
26 Apr 2022
Understanding The Robustness in Vision Transformers
Understanding The Robustness in Vision Transformers
Daquan Zhou
Zhiding Yu
Enze Xie
Chaowei Xiao
Anima Anandkumar
Jiashi Feng
J. Álvarez
ViT
22
185
0
26 Apr 2022
A survey on attention mechanisms for medical applications: are we moving
  towards better algorithms?
A survey on attention mechanisms for medical applications: are we moving towards better algorithms?
Tiago Gonçalves
Isabel Rio-Torto
Luís F. Teixeira
J. S. Cardoso
OOD
MedIm
37
36
0
26 Apr 2022
Previous
123...848586...104105106
Next