Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 5,204 papers shown
Title
OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion
Yu-yang Li
Yuliang Guo
Zhixin Yan
Xinyu Huang
Ye Duan
Liu Ren
MDE
26
66
0
02 Mar 2022
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
Jing Tan
Yuhong Wang
Gangshan Wu
Limin Wang
43
14
0
01 Mar 2022
Enhancing Local Feature Learning for 3D Point Cloud Processing using Unary-Pairwise Attention
H. Xiu
Xin Liu
Weimin Wang
Kyoung-Sook Kim
T. Shinohara
Qiong Chang
M. Matsuoka
3DPC
30
5
0
01 Mar 2022
Fuse Local and Global Semantics in Representation Learning
Yuchi Zhao
Yuhao Zhou
FedML
18
1
0
28 Feb 2022
PartAfford: Part-level Affordance Discovery from 3D Objects
Chao Xu
Yixin Chen
He Wang
Song-Chun Zhu
Yixin Zhu
Siyuan Huang
31
25
0
28 Feb 2022
TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
R. Liu
Kailun Yang
Alina Roitberg
Jiaming Zhang
Kunyu Peng
Huayao Liu
Yaonan Wang
Rainer Stiefelhagen
ViT
47
36
0
27 Feb 2022
Analysis of Visual Reasoning on One-Stage Object Detection
Tolga Aksoy
U. Halici
ObjD
24
5
0
26 Feb 2022
An End-to-End Transformer Model for Crowd Localization
Dingkang Liang
Wei Xu
Xiang Bai
18
114
0
26 Feb 2022
Instantaneous Physiological Estimation using Video Transformers
Ambareesh Revanur
Ananyananda Dasari
Conrad S. Tucker
László A. Jeni
26
33
0
24 Feb 2022
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Dacheng Yin
Xuanchi Ren
Chong Luo
Yuwang Wang
Zhiwei Xiong
Wenjun Zeng
52
13
0
24 Feb 2022
Transformers in Medical Image Analysis: A Review
Kelei He
Chen Gan
Zhuoyuan Li
I. Rekik
Zihao Yin
Wen Ji
Yang Gao
Qian Wang
Junfeng Zhang
D. Shen
ViT
MedIm
28
255
0
24 Feb 2022
Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction
Yubo Ma
Zehao Wang
Yixin Cao
Mukai Li
Meiqi Chen
Kunze Wang
Jing Shao
19
131
0
24 Feb 2022
Auto-scaling Vision Transformers without Training
Wuyang Chen
Wei Huang
Xianzhi Du
Xiaodan Song
Zhangyang Wang
Denny Zhou
ViT
32
23
0
24 Feb 2022
ISDA: Position-Aware Instance Segmentation with Deformable Attention
Kaining Ying
Zhenhua Wang
Cong Bai
Pengfei Zhou
ISeg
22
5
0
23 Feb 2022
Preformer: Predictive Transformer with Multi-Scale Segment-wise Correlations for Long-Term Time Series Forecasting
Dazhao Du
Bing-Huang Su
Zhewei Wei
AI4TS
15
43
0
23 Feb 2022
Better Modelling Out-of-Distribution Regression on Distributed Acoustic Sensor Data Using Anchored Hidden State Mixup
Hasan Asy’ari Arief
P. J. Thomas
T. Wiktorski
OOD
26
4
0
23 Feb 2022
Arbitrary Shape Text Detection using Transformers
Z. Raisi
Georges Younes
John S. Zelek
ViT
36
13
0
22 Feb 2022
Tracking perovskite crystallization via deep learning-based feature detection on 2D X-ray scattering data
V. Starostin
Valentin Munteanu
Alessandro Greco
Ekaterina Kneschaurek
Alina Pleli
F. Bertram
A. Gerlach
A. Hinderhofer
F. Schreiber
19
14
0
22 Feb 2022
Ligandformer: A Graph Neural Network for Predicting Compound Property with Robust Interpretation
Jinjiang Guo
Qi Liu
Han Guo
Xi Lu
AI4CE
24
3
0
21 Feb 2022
Visual Attention Network
Meng-Hao Guo
Chengrou Lu
Zheng-Ning Liu
Ming-Ming Cheng
Shiyong Hu
ViT
VLM
24
637
0
20 Feb 2022
Guide Local Feature Matching by Overlap Estimation
Ying Chen
Dihe Huang
Shang Xu
Jianlin Liu
Yong-Jin Liu
ViT
22
27
0
18 Feb 2022
Task Specific Attention is one more thing you need for object detection
Sang Yon Lee
ViT
26
4
0
18 Feb 2022
Joint Learning of Frequency and Spatial Domains for Dense Predictions
Shaocheng Jia
Wei-Ting Yao
25
0
0
18 Feb 2022
cosFormer: Rethinking Softmax in Attention
Zhen Qin
Weixuan Sun
Huicai Deng
Dongxu Li
Yunshen Wei
Baohong Lv
Junjie Yan
Lingpeng Kong
Yiran Zhong
26
212
0
17 Feb 2022
Revisiting Over-smoothing in BERT from the Perspective of Graph
Han Shi
Jiahui Gao
Hang Xu
Xiaodan Liang
Zhenguo Li
Lingpeng Kong
Stephen M. S. Lee
James T. Kwok
22
71
0
17 Feb 2022
TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Zixu Zhao
Yueming Jin
Pheng-Ann Heng
MedIm
21
45
0
17 Feb 2022
Graph Masked Autoencoders with Transformers
Sixiao Zhang
Hongxu Chen
Haoran Yang
Xiangguo Sun
Philip S. Yu
Guandong Xu
21
18
0
17 Feb 2022
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
31
329
0
16 Feb 2022
Can Deep Learning be Applied to Model-Based Multi-Object Tracking?
Juliano Pinto
Georg Hess
William Ljungbergh
Yuxuan Xia
H. Wymeersch
Lennart Svensson
VOT
23
10
0
16 Feb 2022
RNGDet: Road Network Graph Detection by Transformer in Aerial Images
Zhenhua Xu
Yuxuan Liu
Lu Gan
Yuxiang Sun
Xinyu Wu
Meilin Liu
Lujia Wang
ViT
31
46
0
16 Feb 2022
Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
Youwei Liang
Chongjian Ge
Zhan Tong
Yibing Song
Jue Wang
P. Xie
ViT
25
236
0
16 Feb 2022
Few-shot semantic segmentation via mask aggregation
Wei Ao
Shunyi Zheng
Yan Meng
ISeg
22
5
0
15 Feb 2022
Box Supervised Video Segmentation Proposal Network
Tanveer Hannan
Rajat Koner
Jonathan Kobold
Matthias Schubert
VOS
30
5
0
14 Feb 2022
Handcrafted Histological Transformer (H2T): Unsupervised Representation of Whole Slide Images
Q. Vu
K. Rajpoot
S. Raza
Nasir M. Rajpoot
ViT
MedIm
25
33
0
14 Feb 2022
An experimental study of the vision-bottleneck in VQA
Pierre Marza
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
22
1
0
14 Feb 2022
CATs++: Boosting Cost Aggregation with Convolutions and Transformers
Seokju Cho
Sunghwan Hong
Seung Wook Kim
ViT
27
34
0
14 Feb 2022
How Do Vision Transformers Work?
Namuk Park
Songkuk Kim
ViT
47
465
0
14 Feb 2022
Opinions Vary? Diagnosis First!
Junde Wu
Huihui Fang
Dalu Yang
Binghong Wu
Wenshuo Zhou
Fangxin Shang
Yehui Yang
Yanwu Xu
30
3
0
14 Feb 2022
BViT: Broad Attention based Vision Transformer
Nannan Li
Yaran Chen
Weifan Li
Zixiang Ding
Dong Zhao
ViT
38
23
0
13 Feb 2022
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer
Yair Kittenplon
I. Lavi
Sharon Fogel
Yarin Bar
R. Manmatha
Pietro Perona
ViT
18
53
0
11 Feb 2022
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos
Merey Ramazanova
Victor Escorcia
Fabian Caba Heilbron
Chen Zhao
Guohao Li
28
3
0
10 Feb 2022
Spherical Transformer
Sungmin Cho
Raehyuk Jung
Junseok Kwon
ViT
10
10
0
10 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
ViT
145
170
0
08 Feb 2022
LwPosr: Lightweight Efficient Fine-Grained Head Pose Estimation
Naina Dhingra
29
16
0
07 Feb 2022
Recent Trends in 2D Object Detection and Applications in Video Event Recognition
Prithwish Jana
Partha Pratim Mohanta
6
1
0
07 Feb 2022
Patch-Based Stochastic Attention for Image Editing
Nicolas Cherel
Andrés Almansa
Y. Gousseau
A. Newson
25
6
0
07 Feb 2022
Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics
Arnav Varma
Hemang Chawla
Bahram Zonooz
Elahe Arani
ViT
MDE
36
49
0
07 Feb 2022
Webly Supervised Concept Expansion for General Purpose Vision Models
Amita Kamath
Christopher Clark
Tanmay Gupta
Eric Kolve
Derek Hoiem
Aniruddha Kembhavi
VLM
32
54
0
04 Feb 2022
ETSformer: Exponential Smoothing Transformers for Time-series Forecasting
Gerald Woo
Chenghao Liu
Doyen Sahoo
Akshat Kumar
Guosheng Lin
AI4TS
31
161
0
03 Feb 2022
IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
Ankit Goyal
Arsalan Mousavian
Chris Paxton
Yu-Wei Chao
Brian Okorn
Jia Deng
Dieter Fox
35
55
0
01 Feb 2022
Previous
1
2
3
...
91
92
93
...
103
104
105
Next