ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,198 papers shown
Title
Interactron: Embodied Adaptive Object Detection
Interactron: Embodied Adaptive Object Detection
Klemen Kotar
Roozbeh Mottaghi
39
25
0
01 Feb 2022
Detecting Human-Object Interactions with Object-Guided Cross-Modal
  Calibrated Semantics
Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
Hangjie Yuan
Mang Wang
Dong Ni
Liangpeng Xu
19
36
0
01 Feb 2022
Query Efficient Decision Based Sparse Attacks Against Black-Box Deep
  Learning Models
Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models
Viet Vo
Ehsan Abbasnejad
D. Ranasinghe
AAML
30
14
0
31 Jan 2022
Understanding AdamW through Proximal Methods and Scale-Freeness
Understanding AdamW through Proximal Methods and Scale-Freeness
Zhenxun Zhuang
Mingrui Liu
Ashok Cutkosky
Francesco Orabona
39
63
0
31 Jan 2022
Learning Super-Features for Image Retrieval
Learning Super-Features for Image Retrieval
Philippe Weinzaepfel
Thomas Lucas
Diane Larlus
Yannis Kalantidis
SupR
VLM
33
45
0
31 Jan 2022
Deep Learning Approaches on Image Captioning: A Review
Deep Learning Approaches on Image Captioning: A Review
Taraneh Ghandi
H. Pourreza
H. Mahyar
VLM
16
89
0
31 Jan 2022
TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of
  Medical Images
TransBTSV2: Towards Better and More Efficient Volumetric Segmentation of Medical Images
Jiangyun Li
Wenxuan Wang
Chen Chen
Tianxiang Zhang
Sen Zha
Jing Wang
Hong Yu
ViT
MedIm
26
24
0
30 Jan 2022
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training
  via Multi-Stage Learning
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning
Zejun Li
Zhihao Fan
Huaixiao Tou
Jingjing Chen
Zhongyu Wei
Xuanjing Huang
25
16
0
29 Jan 2022
Flashlight: Enabling Innovation in Tools for Machine Learning
Flashlight: Enabling Innovation in Tools for Machine Learning
Jacob Kahn
Vineel Pratap
Tatiana Likhomanenko
Qiantong Xu
Awni Y. Hannun
...
Gilad Avidov
Benoit Steiner
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
26
28
0
29 Jan 2022
Mobile Robot Manipulation using Pure Object Detection
Mobile Robot Manipulation using Pure Object Detection
Brent A. Griffin
39
7
0
28 Jan 2022
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR
Shilong Liu
Feng Li
Hao Zhang
X. Yang
Xianbiao Qi
Hang Su
Jun Zhu
Lei Zhang
ViT
161
728
0
28 Jan 2022
VRT: A Video Restoration Transformer
VRT: A Video Restoration Transformer
Christos Sakaridis
Jingyun Liang
Yuchen Fan
Peng Sun
Rakesh Ranjan
Yawei Li
Radu Timofte
Luc Van Gool
ViT
34
251
0
28 Jan 2022
Learning Proximal Operators to Discover Multiple Optima
Learning Proximal Operators to Discover Multiple Optima
Lingxiao Li
Noam Aigerman
Vladimir G. Kim
Jiajin Li
Kristjan Greenewald
Mikhail Yurochkin
Justin Solomon
47
1
0
28 Jan 2022
RelTR: Relation Transformer for Scene Graph Generation
RelTR: Relation Transformer for Scene Graph Generation
Yuren Cong
M. Yang
Bodo Rosenhahn
ViT
97
133
0
27 Jan 2022
DocSegTr: An Instance-Level End-to-End Document Image Segmentation
  Transformer
DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer
Sanket Biswas
Ayan Banerjee
Josep Lladós
Umapada Pal
ViT
19
23
0
27 Jan 2022
Training Vision Transformers with Only 2040 Images
Training Vision Transformers with Only 2040 Images
Yunhao Cao
Hao Yu
Jianxin Wu
ViT
110
42
0
26 Jan 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for
  Visual Question Answering
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering
Peixi Xiong
Quanzeng You
Pei Yu
Zicheng Liu
Ying Wu
24
5
0
25 Jan 2022
Simultaneous Human-robot Matching and Routing for Multi-robot Tour
  Guiding under Time Uncertainty
Simultaneous Human-robot Matching and Routing for Multi-robot Tour Guiding under Time Uncertainty
Bo Fu
Tribhi Kathuria
Denise M. Rizzo
Matthew Castanier
X. J. Yang
Maani Ghaffari
Kira Barton
24
7
0
25 Jan 2022
DocEnTr: An End-to-End Document Image Enhancement Transformer
DocEnTr: An End-to-End Document Image Enhancement Transformer
Mohamed Ali Souibgui
Sanket Biswas
Sana Khamekhem Jemni
Yousri Kessentini
Alicia Fornés
Josep Lladós
Umapada Pal
ViT
58
45
0
25 Jan 2022
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With
  Transformer for Sentence Grounding in Videos
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos
Sangmin Woo
Jinyoung Park
Inyong Koo
Sumin Lee
Minki Jeong
Changick Kim
44
3
0
25 Jan 2022
Transformers in Medical Imaging: A Survey
Transformers in Medical Imaging: A Survey
Fahad Shamshad
Salman Khan
Syed Waqas Zamir
Muhammad Haris Khan
Munawar Hayat
F. Khan
Huazhu Fu
ViT
LM&MA
MedIm
111
663
0
24 Jan 2022
Describe me if you can! Characterized Instance-level Human Parsing
Describe me if you can! Characterized Instance-level Human Parsing
Angélique Loesch
Romaric Audigier
25
7
0
24 Jan 2022
UniFormer: Unifying Convolution and Self-attention for Visual
  Recognition
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
Kunchang Li
Yali Wang
Junhao Zhang
Peng Gao
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
162
360
0
24 Jan 2022
Dynamic Label Assignment for Object Detection by Combining Predicted
  IoUs and Anchor IoUs
Dynamic Label Assignment for Object Detection by Combining Predicted IoUs and Anchor IoUs
Tianxiao Zhang
Bo Luo
A. Sharda
Guanghui Wang
39
18
0
23 Jan 2022
ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer
ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer
Pengfei Guo
Yiqun Mei
Jinyuan Zhou
Shanshan Jiang
Vishal M. Patel
ViT
MedIm
91
65
0
23 Jan 2022
A Transformer-Based Feature Segmentation and Region Alignment Method For
  UAV-View Geo-Localization
A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization
Ming Dai
Jian Hu
Jiedong Zhuang
E. Zheng
ViT
45
111
0
23 Jan 2022
Learning to Minimize the Remainder in Supervised Learning
Learning to Minimize the Remainder in Supervised Learning
Yan Luo
Yongkang Wong
Mohan S. Kankanhalli
Qi Zhao
46
1
0
23 Jan 2022
Dual-Flattening Transformers through Decomposed Row and Column Queries
  for Semantic Segmentation
Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation
Ying Wang
C. Ho
Wenju Xu
Ziwei Xuan
Xudong Liu
Guo-Jun Qi
ViT
25
5
0
22 Jan 2022
Representing Long-Range Context for Graph Neural Networks with Global
  Attention
Representing Long-Range Context for Graph Neural Networks with Global Attention
Zhanghao Wu
Paras Jain
Matthew A. Wright
Azalia Mirhoseini
Joseph E. Gonzalez
Ion Stoica
GNN
46
258
0
21 Jan 2022
Omnivore: A Single Model for Many Visual Modalities
Omnivore: A Single Model for Many Visual Modalities
Rohit Girdhar
Mannat Singh
Nikhil Ravi
L. V. D. van der Maaten
Armand Joulin
Ishan Misra
226
225
0
20 Jan 2022
Temporal Sentence Grounding in Videos: A Survey and Future Directions
Temporal Sentence Grounding in Videos: A Survey and Future Directions
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
3DGS
36
38
0
20 Jan 2022
TerViT: An Efficient Ternary Vision Transformer
TerViT: An Efficient Ternary Vision Transformer
Sheng Xu
Yanjing Li
Teli Ma
Bo-Wen Zeng
Baochang Zhang
Peng Gao
Jinhu Lv
ViT
23
11
0
20 Jan 2022
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Zhexin Li
Tong Yang
Peisong Wang
Jian Cheng
ViT
MQ
33
41
0
19 Jan 2022
TransFuse: A Unified Transformer-based Image Fusion Framework using
  Self-supervised Learning
TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised Learning
Linhao Qu
Shaolei Liu
Manning Wang
Shiman Li
Siqi Yin
Qin Qiao
Zhijian Song
ViT
SSL
11
22
0
19 Jan 2022
Poseur: Direct Human Pose Regression with Transformers
Poseur: Direct Human Pose Regression with Transformers
Wei Mao
Yongtao Ge
Chunhua Shen
Zhi Tian
Xinlong Wang
Zhibin Wang
Anton Van Den Hengel
ViT
29
81
0
19 Jan 2022
ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via
  Exploiting CLIP Cues
ProposalCLIP: Unsupervised Open-Category Object Proposal Generation via Exploiting CLIP Cues
Hengcan Shi
Munawar Hayat
Yicheng Wu
Jianfei Cai
VLM
30
60
0
18 Jan 2022
VAQF: Fully Automatic Software-Hardware Co-Design Framework for Low-Bit
  Vision Transformer
VAQF: Fully Automatic Software-Hardware Co-Design Framework for Low-Bit Vision Transformer
Mengshu Sun
Haoyu Ma
Guoliang Kang
Yi Ding
Tianlong Chen
Xiaolong Ma
Zhangyang Wang
Yanzhi Wang
ViT
33
45
0
17 Jan 2022
RestoreFormer: High-Quality Blind Face Restoration from Undegraded
  Key-Value Pairs
RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs
Zhouxia Wang
Jiawei Zhang
Runjian Chen
Wenping Wang
Ping Luo
CVBM
20
109
0
17 Jan 2022
Continual Transformers: Redundancy-Free Attention for Online Inference
Continual Transformers: Redundancy-Free Attention for Online Inference
Lukas Hedegaard
Arian Bakhtiarnia
Alexandros Iosifidis
CLL
27
11
0
17 Jan 2022
Video Transformers: A Survey
Video Transformers: A Survey
Javier Selva
A. S. Johansen
Sergio Escalera
Kamal Nasrollahi
T. Moeslund
Albert Clapés
ViT
22
103
0
16 Jan 2022
Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic
  Segmentation
Sparse Cross-scale Attention Network for Efficient LiDAR Panoptic Segmentation
Shuangjie Xu
Rui Wan
Maosheng Ye
Xiaoyi Zou
Tongyi Cao
3DPC
18
32
0
16 Jan 2022
Domain Adaptation via Bidirectional Cross-Attention Transformer
Domain Adaptation via Bidirectional Cross-Attention Transformer
Xiyu Wang
Pengxin Guo
Yu Zhang
ViT
30
19
0
15 Jan 2022
TransVOD: End-to-End Video Object Detection with Spatial-Temporal
  Transformers
TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers
Qianyu Zhou
Hefei Ling
Lu He
Li Niu
Guangliang Cheng
Yunhai Tong
Lizhuang Ma
Liqing Zhang
ViT
38
134
0
13 Jan 2022
CFNet: Learning Correlation Functions for One-Stage Panoptic
  Segmentation
CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation
Yifeng Chen
Wenqing Chu
Fangfang Wang
Ying Tai
Ran Yi
Zhenye Gan
Lili Yao
Chengjie Wang
Xi Li
ISeg
27
2
0
13 Jan 2022
A Survey on Masked Facial Detection Methods and Datasets for Fighting
  Against COVID-19
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19
Bingshu Wang
Jiangbin Zheng
C. L. P. Chen
21
29
0
13 Jan 2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised
  Few-Shot Learning
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
A. Zhmoginov
Mark Sandler
Max Vladymyrov
ViT
33
68
0
11 Jan 2022
Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at
  Scale
Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale
Gang Li
Gilles Baechler
Manuel Tragut
Yang Li
24
49
0
11 Jan 2022
Pyramid Fusion Transformer for Semantic Segmentation
Pyramid Fusion Transformer for Semantic Segmentation
Zipeng Qin
Jianbo Liu
Xiaoling Zhang
Maoqing Tian
Aojun Zhou
Shuai Yi
Hongsheng Li
ViT
31
15
0
11 Jan 2022
Swin Transformer for Fast MRI
Swin Transformer for Fast MRI
Jiahao Huang
Yingying Fang
Yinzhe Wu
Huanjun Wu
Zhifan Gao
Yang Li
Javier Del Ser
Jun Xia
Guang Yang
ViT
OOD
32
142
0
10 Jan 2022
ImageSubject: A Large-scale Dataset for Subject Detection
ImageSubject: A Large-scale Dataset for Subject Detection
Xin Miao
Jiayi Liu
Huayan Wang
J. Fu
16
0
0
09 Jan 2022
Previous
123...929394...102103104
Next