ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,204 papers shown
Title
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
Yu-yang Li
Yuliang Guo
Zhixin Yan
Xinyu Huang
Ye Duan
Liu Ren
MDE
26
66
0
02 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary
  Detection
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
43
14
0
01 Mar 2022
Enhancing Local Feature Learning for 3D Point Cloud Processing using
  Unary-Pairwise Attention
Enhancing Local Feature Learning for 3D Point Cloud Processing using Unary-Pairwise Attention
H. Xiu
Xin Liu
Weimin Wang
Kyoung-Sook Kim
T. Shinohara
Qiong Chang
M. Matsuoka
3DPC
30
5
0
01 Mar 2022
Fuse Local and Global Semantics in Representation Learning
Fuse Local and Global Semantics in Representation Learning
Yuchi Zhao
Yuhao Zhou
FedML
18
1
0
28 Feb 2022
PartAfford: Part-level Affordance Discovery from 3D Objects
PartAfford: Part-level Affordance Discovery from 3D Objects
Chao Xu
Yixin Chen
He Wang
Song-Chun Zhu
Yixin Zhu
Siyuan Huang
31
25
0
28 Feb 2022
TransKD: Transformer Knowledge Distillation for Efficient Semantic
  Segmentation
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
R. Liu
Kailun Yang
Alina Roitberg
Jiaming Zhang
Kunyu Peng
Huayao Liu
Yaonan Wang
Rainer Stiefelhagen
ViT
47
36
0
27 Feb 2022
Analysis of Visual Reasoning on One-Stage Object Detection
Analysis of Visual Reasoning on One-Stage Object Detection
Tolga Aksoy
U. Halici
ObjD
24
5
0
26 Feb 2022
An End-to-End Transformer Model for Crowd Localization
An End-to-End Transformer Model for Crowd Localization
Dingkang Liang
Wei Xu
Xiang Bai
18
114
0
26 Feb 2022
Instantaneous Physiological Estimation using Video Transformers
Instantaneous Physiological Estimation using Video Transformers
Ambareesh Revanur
Ananyananda Dasari
Conrad S. Tucker
László A. Jeni
26
33
0
24 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level
  Bipartite Graph
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
52
13
0
24 Feb 2022
Transformers in Medical Image Analysis: A Review
Transformers in Medical Image Analysis: A Review
Kelei He
Chen Gan
Zhuoyuan Li
I. Rekik
Zihao Yin
Wen Ji
Yang Gao
Qian Wang
Junfeng Zhang
D. Shen
ViT
MedIm
28
255
0
24 Feb 2022
Prompt for Extraction? PAIE: Prompting Argument Interaction for Event
  Argument Extraction
Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction
Yubo Ma
Zehao Wang
Yixin Cao
Mukai Li
Meiqi Chen
Kunze Wang
Jing Shao
19
131
0
24 Feb 2022
Auto-scaling Vision Transformers without Training
Auto-scaling Vision Transformers without Training
Wuyang Chen
Wei Huang
Xianzhi Du
Xiaodan Song
Zhangyang Wang
Denny Zhou
ViT
32
23
0
24 Feb 2022
ISDA: Position-Aware Instance Segmentation with Deformable Attention
ISDA: Position-Aware Instance Segmentation with Deformable Attention
Kaining Ying
Zhenhua Wang
Cong Bai
Pengfei Zhou
ISeg
22
5
0
23 Feb 2022
Preformer: Predictive Transformer with Multi-Scale Segment-wise
  Correlations for Long-Term Time Series Forecasting
Preformer: Predictive Transformer with Multi-Scale Segment-wise Correlations for Long-Term Time Series Forecasting
Dazhao Du
Bing-Huang Su
Zhewei Wei
AI4TS
15
43
0
23 Feb 2022
Better Modelling Out-of-Distribution Regression on Distributed Acoustic
  Sensor Data Using Anchored Hidden State Mixup
Better Modelling Out-of-Distribution Regression on Distributed Acoustic Sensor Data Using Anchored Hidden State Mixup
Hasan Asy’ari Arief
P. J. Thomas
T. Wiktorski
OOD
26
4
0
23 Feb 2022
Arbitrary Shape Text Detection using Transformers
Arbitrary Shape Text Detection using Transformers
Z. Raisi
Georges Younes
John S. Zelek
ViT
36
13
0
22 Feb 2022
Tracking perovskite crystallization via deep learning-based feature
  detection on 2D X-ray scattering data
Tracking perovskite crystallization via deep learning-based feature detection on 2D X-ray scattering data
V. Starostin
Valentin Munteanu
Alessandro Greco
Ekaterina Kneschaurek
Alina Pleli
F. Bertram
A. Gerlach
A. Hinderhofer
F. Schreiber
19
14
0
22 Feb 2022
Ligandformer: A Graph Neural Network for Predicting Compound Property
  with Robust Interpretation
Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Jinjiang Guo
Qi Liu
Han Guo
Xi Lu
AI4CE
24
3
0
21 Feb 2022
Visual Attention Network
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
24
637
0
20 Feb 2022
Guide Local Feature Matching by Overlap Estimation
Guide Local Feature Matching by Overlap Estimation
Ying Chen
Dihe Huang
Shang Xu
Jianlin Liu
Yong-Jin Liu
ViT
22
27
0
18 Feb 2022
Task Specific Attention is one more thing you need for object detection
Task Specific Attention is one more thing you need for object detection
Sang Yon Lee
ViT
26
4
0
18 Feb 2022
Joint Learning of Frequency and Spatial Domains for Dense Predictions
Joint Learning of Frequency and Spatial Domains for Dense Predictions
Shaocheng Jia
Wei-Ting Yao
25
0
0
18 Feb 2022
cosFormer: Rethinking Softmax in Attention
cosFormer: Rethinking Softmax in Attention
Zhen Qin
Weixuan Sun
Huicai Deng
Dongxu Li
Yunshen Wei
Baohong Lv
Junjie Yan
Lingpeng Kong
Yiran Zhong
26
212
0
17 Feb 2022
Revisiting Over-smoothing in BERT from the Perspective of Graph
Revisiting Over-smoothing in BERT from the Perspective of Graph
Han Shi
Jiahui Gao
Hang Xu
Xiaodan Liang
Zhenguo Li
Lingpeng Kong
Stephen M. S. Lee
James T. Kwok
22
71
0
17 Feb 2022
TraSeTR: Track-to-Segment Transformer with Contrastive Query for
  Instance-level Instrument Segmentation in Robotic Surgery
TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Zixu Zhao
Yueming Jin
Pheng-Ann Heng
MedIm
21
45
0
17 Feb 2022
Graph Masked Autoencoders with Transformers
Graph Masked Autoencoders with Transformers
Sixiao Zhang
Hongxu Chen
Haoran Yang
Xiangguo Sun
Philip S. Yu
Guandong Xu
21
18
0
17 Feb 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
31
329
0
16 Feb 2022
Can Deep Learning be Applied to Model-Based Multi-Object Tracking?
Can Deep Learning be Applied to Model-Based Multi-Object Tracking?
Juliano Pinto
Georg Hess
William Ljungbergh
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
23
10
0
16 Feb 2022
RNGDet: Road Network Graph Detection by Transformer in Aerial Images
RNGDet: Road Network Graph Detection by Transformer in Aerial Images
Zhenhua Xu
Yuxuan Liu
Lu Gan
Yuxiang Sun
Xinyu Wu
Meilin Liu
Lujia Wang
ViT
31
46
0
16 Feb 2022
Not All Patches are What You Need: Expediting Vision Transformers via
  Token Reorganizations
Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Youwei Liang
Chongjian Ge
Zhan Tong
Yibing Song
Jue Wang
P. Xie
ViT
25
236
0
16 Feb 2022
Few-shot semantic segmentation via mask aggregation
Few-shot semantic segmentation via mask aggregation
Wei Ao
Shunyi Zheng
Yan Meng
ISeg
22
5
0
15 Feb 2022
Box Supervised Video Segmentation Proposal Network
Box Supervised Video Segmentation Proposal Network
Tanveer Hannan
Rajat Koner
Jonathan Kobold
Matthias Schubert
VOS
30
5
0
14 Feb 2022
Handcrafted Histological Transformer (H2T): Unsupervised Representation
  of Whole Slide Images
Handcrafted Histological Transformer (H2T): Unsupervised Representation of Whole Slide Images
Q. Vu
K. Rajpoot
S. Raza
Nasir M. Rajpoot
ViT
MedIm
25
33
0
14 Feb 2022
An experimental study of the vision-bottleneck in VQA
An experimental study of the vision-bottleneck in VQA
Pierre Marza
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
22
1
0
14 Feb 2022
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho
Sunghwan Hong
Seung Wook Kim
ViT
27
34
0
14 Feb 2022
How Do Vision Transformers Work?
How Do Vision Transformers Work?
Namuk Park
Songkuk Kim
ViT
47
465
0
14 Feb 2022
Opinions Vary? Diagnosis First!
Opinions Vary? Diagnosis First!
Junde Wu
Huihui Fang
Dalu Yang
Binghong Wu
Wenshuo Zhou
Fangxin Shang
Yehui Yang
Yanwu Xu
30
3
0
14 Feb 2022
BViT: Broad Attention based Vision Transformer
BViT: Broad Attention based Vision Transformer
Nannan Li
Yaran Chen
Weifan Li
Zixiang Ding
Dong Zhao
ViT
38
23
0
13 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
18
53
0
11 Feb 2022
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for
  Localizing Actions in Egocentric Videos
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos
Merey Ramazanova
Victor Escorcia
Fabian Caba Heilbron
Chen Zhao
Guohao Li
28
3
0
10 Feb 2022
Spherical Transformer
Spherical Transformer
Sungmin Cho
Raehyuk Jung
Junseok Kwon
ViT
10
10
0
10 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of
  Text-to-Image Generation Models
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
ViT
145
170
0
08 Feb 2022
LwPosr: Lightweight Efficient Fine-Grained Head Pose Estimation
LwPosr: Lightweight Efficient Fine-Grained Head Pose Estimation
Naina Dhingra
29
16
0
07 Feb 2022
Recent Trends in 2D Object Detection and Applications in Video Event
  Recognition
Recent Trends in 2D Object Detection and Applications in Video Event Recognition
Prithwish Jana
Partha Pratim Mohanta
6
1
0
07 Feb 2022
Patch-Based Stochastic Attention for Image Editing
Patch-Based Stochastic Attention for Image Editing
Nicolas Cherel
Andrés Almansa
Y. Gousseau
A. Newson
25
6
0
07 Feb 2022
Transformers in Self-Supervised Monocular Depth Estimation with Unknown
  Camera Intrinsics
Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics
Arnav Varma
Hemang Chawla
Bahram Zonooz
Elahe Arani
ViT
MDE
36
49
0
07 Feb 2022
Webly Supervised Concept Expansion for General Purpose Vision Models
Webly Supervised Concept Expansion for General Purpose Vision Models
Amita Kamath
Christopher Clark
Tanmay Gupta
Eric Kolve
Derek Hoiem
Aniruddha Kembhavi
VLM
32
54
0
04 Feb 2022
ETSformer: Exponential Smoothing Transformers for Time-series
  Forecasting
ETSformer: Exponential Smoothing Transformers for Time-series Forecasting
Gerald Woo
Chenghao Liu
Doyen Sahoo
Akshat Kumar
Guosheng Lin
AI4TS
31
161
0
03 Feb 2022
IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
Ankit Goyal
Arsalan Mousavian
Chris Paxton
Yu-Wei Chao
Brian Okorn
Jia Deng
Dieter Fox
35
55
0
01 Feb 2022
Previous
123...919293...103104105
Next