ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,259 papers shown
Title
Unitail: Detecting, Reading, and Matching in Retail Scene
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
42
9
0
01 Apr 2022
Semi-Weakly Supervised Object Detection by Sampling Pseudo Ground-Truth
  Boxes
Semi-Weakly Supervised Object Detection by Sampling Pseudo Ground-Truth Boxes
Akhil Meethal
M. Pedersoli
Zhongwen Zhu
Francisco Perdigon Romero
Eric Granger
11
4
0
01 Apr 2022
Universal Lymph Node Detection in T2 MRI using Neural Networks
Universal Lymph Node Detection in T2 MRI using Neural Networks
T. Mathai
Sungwon Lee
Thomas C. Shen
Zhi-hua Lu
Ronald M. Summers
MedIm
20
8
0
31 Mar 2022
FindIt: Generalized Localization with Natural Language Queries
FindIt: Generalized Localization with Natural Language Queries
Weicheng Kuo
Fred Bertsch
Wei Li
A. Piergiovanni
M. Saffar
A. Angelova
ObjD
19
17
0
31 Mar 2022
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
75
1,254
0
31 Mar 2022
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable
  Facial Editing
TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing
Yanbo Xu
Yueqin Yin
Liming Jiang
Qianyi Wu
Chengyao Zheng
Chen Change Loy
Bo Dai
Wayne Wu
38
53
0
31 Mar 2022
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection
Junjie Huang
Guan Huang
27
336
0
31 Mar 2022
Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks
Few-Shot Class-Incremental Learning by Sampling Multi-Phase Tasks
Da-Wei Zhou
Han-Jia Ye
Liang Ma
Di Xie
Shiliang Pu
De-Chuan Zhan
CLL
19
119
0
31 Mar 2022
Human Instance Segmentation and Tracking via Data Association and
  Single-stage Detector
Human Instance Segmentation and Tracking via Data Association and Single-stage Detector
Lu Cheng
Mingde Zhao
28
0
0
31 Mar 2022
CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
Xiuchao Sui
Shaohua Li
Xue Geng
Yan Wu
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
Erik Cambria
ViT
37
95
0
31 Mar 2022
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
ReSTR: Convolution-free Referring Image Segmentation Using Transformers
N. Kim
Dongwon Kim
Cuiling Lan
Wenjun Zeng
Suha Kwak
30
137
0
31 Mar 2022
MeMOT: Multi-Object Tracking with Memory
MeMOT: Multi-Object Tracking with Memory
Jiarui Cai
Mingze Xu
Wei Li
Yuanjun Xiong
Wei Xia
Zhuowen Tu
Stefano Soatto
VOT
36
148
0
31 Mar 2022
TR-MOT: Multi-Object Tracking by Reference
TR-MOT: Multi-Object Tracking by Reference
Mingfei Chen
Yue Liao
Si Liu
Fei Wang
Lei Li
VOT
59
9
0
30 Mar 2022
Exploring Plain Vision Transformer Backbones for Object Detection
Exploring Plain Vision Transformer Backbones for Object Detection
Yanghao Li
Hanzi Mao
Ross B. Girshick
Kaiming He
ViT
36
779
0
30 Mar 2022
Collaborative Transformers for Grounded Situation Recognition
Collaborative Transformers for Grounded Situation Recognition
Junhyeong Cho
Youngseok Yoon
Suha Kwak
ViT
27
25
0
30 Mar 2022
AdaMixer: A Fast-Converging Query-Based Object Detector
AdaMixer: A Fast-Converging Query-Based Object Detector
Ziteng Gao
Limin Wang
Bing Han
Sheng Guo
ObjD
41
106
0
30 Mar 2022
Forensic Analysis and Localization of Multiply Compressed MP3 Audio
  Using Transformers
Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers
Ziyue Xiang
Paolo Bestagini
Stefano Tubaro
Edward J. Delp
36
10
0
30 Mar 2022
TubeDETR: Spatio-Temporal Video Grounding with Transformers
TubeDETR: Spatio-Temporal Video Grounding with Transformers
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
30
94
0
30 Mar 2022
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
InstaFormer: Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim
Jongbeom Baek
Jihye Park
Gyeongnyeon Kim
Seung Wook Kim
ViT
39
47
0
30 Mar 2022
FlowFormer: A Transformer Architecture for Optical Flow
FlowFormer: A Transformer Architecture for Optical Flow
Zhaoyang Huang
Xiaoyu Shi
Chao Zhang
Qiang Wang
Ka Chun Cheung
Hongwei Qin
Jifeng Dai
Hongsheng Li
ViT
35
270
0
30 Mar 2022
Global Tracking via Ensemble of Local Trackers
Global Tracking via Ensemble of Local Trackers
Zikun Zhou
Jianqiu Chen
Wenjie Pei
Kaige Mao
Hongpeng Wang
Zhenyu He
15
31
0
30 Mar 2022
Omni-DETR: Omni-Supervised Object Detection with Transformers
Omni-DETR: Omni-Supervised Object Detection with Transformers
Pei Wang
Zhaowei Cai
Hao Yang
Gurumurthy Swaminathan
Nuno Vasconcelos
Bernt Schiele
Stefano Soatto
38
40
0
30 Mar 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting
  Vision-Language Transformers
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
Estelle Aflalo
Meng Du
Shao-Yen Tseng
Yongfei Liu
Chenfei Wu
Nan Duan
Vasudev Lal
36
45
0
30 Mar 2022
VPTR: Efficient Transformers for Video Prediction
VPTR: Efficient Transformers for Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
32
18
0
29 Mar 2022
MAP-Gen: An Automated 3D-Box Annotation Flow with Multimodal Attention
  Point Generator
MAP-Gen: An Automated 3D-Box Annotation Flow with Multimodal Attention Point Generator
Chang Liu
Xiaoyan Qian
Xiaojuan Qi
E. Lam
Siew-Chong Tan
Ngai Wong
3DPC
31
11
0
29 Mar 2022
MatteFormer: Transformer-Based Image Matting via Prior-Tokens
MatteFormer: Transformer-Based Image Matting via Prior-Tokens
Gyutae Park
S. Son
Jaeyoung Yoo
Seho Kim
Nojun Kwak
ViT
30
65
0
29 Mar 2022
SepViT: Separable Vision Transformer
SepViT: Separable Vision Transformer
Wei Li
Xing Wang
Xin Xia
Jie Wu
Jiashi Li
Xuefeng Xiao
Min Zheng
Shiping Wen
ViT
26
40
0
29 Mar 2022
Domain Invariant Siamese Attention Mask for Small Object Change
  Detection via Everyday Indoor Robot Navigation
Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation
Koji Takeda
Kanji Tanaka
Yoshimasa Nakamura
3DPC
25
3
0
29 Mar 2022
SIOD: Single Instance Annotated Per Category Per Image for Object
  Detection
SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Hanjun Li
Xingjia Pan
Ke Yan
Fan Tang
Weihao Zheng
25
18
0
29 Mar 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional
  Filters
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
Paul Gavrikov
J. Keuper
AAML
24
31
0
29 Mar 2022
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
In-N-Out Generative Learning for Dense Unsupervised Video Segmentation
Xiaomiao Pan
Peike Li
Zongxin Yang
Huiling Zhou
Chang Zhou
Hongxia Yang
Jingren Zhou
Yi Yang
VOS
40
11
0
29 Mar 2022
Hybrid Routing Transformer for Zero-Shot Learning
Hybrid Routing Transformer for Zero-Shot Learning
Dezhong Cheng
Gerong Wang
Boshen Wang
Qiang Zhang
Jungong Han
Dingwen Zhang
29
69
0
29 Mar 2022
Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene
  Segmentation
Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Yueming Jin
Yang Yu
Cheng Chen
Zixu Zhao
Pheng-Ann Heng
Danail Stoyanov
37
39
0
29 Mar 2022
Parameter-efficient Model Adaptation for Vision Transformers
Parameter-efficient Model Adaptation for Vision Transformers
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Junfeng Fang
30
84
0
29 Mar 2022
Few Could Be Better Than All: Feature Sampling and Grouping for Scene
  Text Detection
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection
J. Tang
Wenqing Zhang
Hong-yi Liu
Mingkun Yang
Bo Jiang
Guan-Nan Hu
Xiang Bai
ViT
19
66
0
29 Mar 2022
Unified Transformer Tracker for Object Tracking
Unified Transformer Tracker for Object Tracking
Fan Ma
Mike Zheng Shou
Linchao Zhu
Haoqi Fan
Yilei Xu
Yi Yang
Zhicheng Yan
VOT
33
79
0
29 Mar 2022
CAT-Net: A Cross-Slice Attention Transformer Model for Prostate Zonal
  Segmentation in MRI
CAT-Net: A Cross-Slice Attention Transformer Model for Prostate Zonal Segmentation in MRI
A. Hung
Haoxin Zheng
Qi Miao
S. Raman
D. Terzopoulos
Kyunghyun Sung
ViT
MedIm
38
44
0
29 Mar 2022
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Shangbang Long
Siyang Qin
Dmitry Panteleev
Alessandro Bissacco
Yasuhisa Fujii
Michalis Raptis
41
89
0
28 Mar 2022
Rethinking Semantic Segmentation: A Prototype View
Rethinking Semantic Segmentation: A Prototype View
Tianfei Zhou
Wenguan Wang
E. Konukoglu
Luc Van Gool
SSeg
34
260
0
28 Mar 2022
Frame-wise Action Representations for Long Videos via Sequence
  Contrastive Learning
Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning
Minghao Chen
Fangyun Wei
Chong Li
Deng Cai
AI4TS
35
33
0
28 Mar 2022
Expanding Low-Density Latent Regions for Open-Set Object Detection
Expanding Low-Density Latent Regions for Open-Set Object Detection
Jiaming Han
Yuqiang Ren
Jian Ding
Xingjia Pan
Ke Yan
Guisong Xia
ObjD
40
59
0
28 Mar 2022
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction
  Detection
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Bumsoo Kim
Jonghwan Mun
Kyoung-Woon On
Minchul Shin
Junhyun Lee
Eun-Sol Kim
41
51
0
28 Mar 2022
ObjectFormer for Image Manipulation Detection and Localization
ObjectFormer for Image Manipulation Detection and Localization
Junke Wang
Zuxuan Wu
Jingjing Chen
Xintong Han
Abhinav Shrivastava
Ser-Nam Lim
Yu-Gang Jiang
39
108
0
28 Mar 2022
HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network
HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network
J. Park
Y. Oh
Gyeongsik Moon
Hongsuk Choi
Kyoung Mu Lee
3DH
29
102
0
28 Mar 2022
REGTR: End-to-end Point Cloud Correspondences with Transformers
REGTR: End-to-end Point Cloud Correspondences with Transformers
Zi Jian Yew
Gim Hee Lee
3DPC
ViT
40
172
0
28 Mar 2022
Stratified Transformer for 3D Point Cloud Segmentation
Stratified Transformer for 3D Point Cloud Segmentation
Xin Lai
Jianhui Liu
Li Jiang
Liwei Wang
Hengshuang Zhao
Shu Liu
Xiaojuan Qi
Jiaya Jia
3DPC
ViT
35
263
0
28 Mar 2022
NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External
  Knowledge
NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge
D. Vo
Hong Chen
Akihiro Sugimoto
Hideki Nakayama
11
13
0
28 Mar 2022
Optimal Correction Cost for Object Detection Evaluation
Optimal Correction Cost for Object Detection Evaluation
Mayu Otani
Riku Togashi
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
38
14
0
28 Mar 2022
DepthFormer: Exploiting Long-Range Correlation and Local Information for
  Accurate Monocular Depth Estimation
DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation
Zhenyu Li
Zehui Chen
Xianming Liu
Junjun Jiang
ViT
MDE
38
183
1
27 Mar 2022
Towards Discriminative Representation: Multi-view Trajectory Contrastive
  Learning for Online Multi-object Tracking
Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking
En Yu
Zhuoling Li
Shoudong Han
46
44
0
27 Mar 2022
Previous
123...878889...104105106
Next