Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 5,281 papers shown
Title
Occlusion-Aware Instance Segmentation via BiLayer Network Architectures
Lei Ke
Yu-Wing Tai
Chi-Keung Tang
ISeg
32
11
0
08 Aug 2022
Semi-Supervised Cross-Modal Salient Object Detection with U-Structure Networks
Yunqing Bao
Hang Dai
Abdulmotaleb Elsaddik
24
2
0
08 Aug 2022
3D Vision with Transformers: A Survey
Jean Lahoud
Jiale Cao
Fahad Shahbaz Khan
Hisham Cholakkal
Rao Muhammad Anwer
Salman Khan
Ming-Hsuan Yang
ViT
MedIm
47
32
0
08 Aug 2022
Global Hierarchical Attention for 3D Point Cloud Analysis
Dan Jia
Alexander Hermans
Bastian Leibe
3DPC
21
0
0
07 Aug 2022
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation
Sebastian Lutz
R. Blythman
Koustav Ghosal
Matthew Moynihan
C. Simms
A. Smolic
ViT
34
15
0
07 Aug 2022
No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects
Raja Sunkara
Tie-Mei Luo
ObjD
27
323
0
07 Aug 2022
Blackbox Attacks via Surrogate Ensemble Search
Zikui Cai
Chengyu Song
S. Krishnamurthy
Amit K. Roy-Chowdhury
Ulugbek S. Kamilov
AAML
27
19
0
07 Aug 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer
Chaoqiang Zhao
Youming Zhang
Matteo Poggi
Fabio Tosi
Xianda Guo
Zheng Zhu
Guan Huang
Yang Tang
S. Mattoccia
ViT
MDE
49
176
0
06 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
33
12
0
06 Aug 2022
Transformer-based assignment decision network for multiple object tracking
Athena Psalta
Vasileios Tsironis
K. Karantzalos
VOT
79
9
0
06 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
31
13
0
05 Aug 2022
ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Bingning Wang
Feiya Lv
Ting Yao
Yiming Yuan
Jin Ma
Yu Luo
Haijin Liang
31
3
0
05 Aug 2022
Multimodal Brain Disease Classification with Functional Interaction Learning from Single fMRI Volume
Weiming Dai
Zi-Ke Zhang
L. Tian
Shengyuan Yu
Shuhui Wang
Zhao Dong
Hairong Zheng
MedIm
33
2
0
05 Aug 2022
Calibrate the inter-observer segmentation uncertainty via diagnosis-first principle
Junde Wu
Huihui Fang
Hoayi Xiong
Lixin Duan
Mingkui Tan
Weihua Yang
Huiying Liu
Yanwu Xu
MedIm
55
1
0
05 Aug 2022
TransMatting: Enhancing Transparent Objects Matting with Transformers
Huanqia Cai
Fanglei Xue
Lele Xu
Lili Guo
ViT
21
20
0
05 Aug 2022
An Efficient Person Clustering Algorithm for Open Checkout-free Groceries
Junde Wu
Yu Zhang
Rao Fu
Yuanpei Liu
Jing Gao
29
1
0
05 Aug 2022
Vision-Centric BEV Perception: A Survey
Yuexin Ma
Tai Wang
Xuyang Bai
Huitong Yang
Yuenan Hou
Yaming Wang
Yu Qiao
Ruigang Yang
Tianyi Zhou
Xinge Zhu
66
130
0
04 Aug 2022
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
Zhipeng Luo
Gongjie Zhang
Changqing Zhou
Ti Liu
Shijian Lu
Liang Pan
3DPC
ViT
53
9
0
04 Aug 2022
DropKey
Bonan li
Yinhan Hu
Xuecheng Nie
Congying Han
Xiangjian Jiang
Tiande Guo
Luoqi Liu
20
11
0
04 Aug 2022
MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth
Chenjie Cao
Xinlin Ren
Yanwei Fu
36
48
0
04 Aug 2022
End-to-end deep learning for directly estimating grape yield from ground-based imagery
A. Olenskyj
B. Sams
Zhenghao Fei
Vishal Singh
P. Raja
G. Bornhorst
J. M. Earles
31
28
0
04 Aug 2022
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training
De-An Huang
Zhiding Yu
Anima Anandkumar
VLM
58
78
0
03 Aug 2022
Re-Attention Transformer for Weakly Supervised Object Localization
Hui Su
Yue Ye
Zhiwei Chen
Min-Gyoo Song
Lechao Cheng
WSOL
38
14
0
03 Aug 2022
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation
Jun Wang
M. Gao
Yuqian Hu
Ramprasaath R. Selvaraju
Chetan Ramaiah
Ran Xu
J. JáJá
Larry S. Davis
ViT
35
17
0
03 Aug 2022
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Junru Gu
Chenxu Hu
Tian-Yi Zhang
Xuanyao Chen
Yilun Wang
Yue Wang
Hang Zhao
33
98
0
02 Aug 2022
Unified Normalization for Accelerating and Stabilizing Transformers
Qiming Yang
Kai Zhang
Chaoxiang Lan
Zhi Yang
Zheyang Li
Wenming Tan
Jun Xiao
Shiliang Pu
23
8
0
02 Aug 2022
Making the Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation
Wen-hui Ma
Jinming Zhang
Shuang Li
Chi Harold Liu
Yulin Wang
Wei Li
26
14
0
02 Aug 2022
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
Ye Yu
Jialing Yuan
Gaurav Mittal
Fuxin Li
Mei Chen
VOS
67
28
0
01 Aug 2022
Understanding Adversarial Robustness of Vision Transformers via Cauchy Problem
Zheng Wang
Wenjie Ruan
ViT
44
8
0
01 Aug 2022
Cross Attention Based Style Distribution for Controllable Person Image Synthesis
Xinyue Zhou
M. Yin
Xinyuan Chen
Li Sun
Changxin Gao
Qingli Li
DiffM
19
54
0
01 Aug 2022
Search for or Navigate to? Dual Adaptive Thinking for Object Navigation
Ronghao Dang
Liuyi Wang
Zongtao He
Shuai Su
Chengju Liu
Qi Chen
22
16
0
01 Aug 2022
One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement
Zihao Yin
Ping Gong
Chun-yu Wang
Yizhou Yu
Yizhou Wang
37
12
0
31 Jul 2022
Less is More: Consistent Video Depth Estimation with Masked Frames Modeling
Yiran Wang
Zhiyu Pan
Xingyi Li
Zhiguo Cao
Ke Xian
Jianming Zhang
35
27
0
31 Jul 2022
One for All: One-stage Referring Expression Comprehension with Dynamic Reasoning
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjD
LRM
25
9
0
31 Jul 2022
Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding
Hao-Kai Wen
Yunze Liu
Jingwei Huang
Bokun Duan
Li Yi
ViT
3DPC
33
26
0
30 Jul 2022
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation
Gongjie Zhang
Zhipeng Luo
Kaiwen Cui
Shijian Lu
Eric P. Xing
ViT
51
93
0
30 Jul 2022
Computer Vision Methods for the Microstructural Analysis of Materials: The State-of-the-art and Future Perspectives
Khaled Alrfou
Amir Kordijazi
Tian Zhao
3DV
50
6
0
29 Jul 2022
End-to-end View Synthesis via NeRF Attention
Zelin Zhao
Jiaya Jia
39
8
0
29 Jul 2022
Prompting for Multi-Modal Tracking
Jinyu Yang
Zhe Li
Fengcai Zheng
A. Leonardis
Jingkuan Song
26
89
0
29 Jul 2022
ScaleFormer: Revisiting the Transformer-based Backbones from a Scale-wise Perspective for Medical Image Segmentation
Huimin Huang
Shiao Xie
Lanfen Lin
Yutaro Iwamoto
X. Han
Yen-Wei Chen
Ruofeng Tong
ViT
MedIm
27
45
0
29 Jul 2022
Visual Recognition by Request
Chufeng Tang
Lingxi Xie
Xiaopeng Zhang
Xiaolin Hu
Qi Tian
VLM
21
15
0
28 Jul 2022
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion
Gongjie Zhang
Zhipeng Luo
Jiaxing Huang
Shijian Lu
Eric Xing
ViT
46
20
0
28 Jul 2022
A Transformer-based Generative Adversarial Network for Brain Tumor Segmentation
Liqun Huang
Long Chen
Bai-wen Zhang
S. Chai
MedIm
ViT
26
27
0
28 Jul 2022
Towards Large-Scale Small Object Detection: Survey and Benchmarks
Gong Cheng
Xiang Yuan
Xiwen Yao
Ke Yan
Qinghua Zeng
Xingxing Xie
Junwei Han
ObjD
41
308
0
28 Jul 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Hao Shao
Letian Wang
Ruobing Chen
Hongsheng Li
Y. Liu
52
196
0
28 Jul 2022
Video Mask Transfiner for High-Quality Video Instance Segmentation
Lei Ke
Henghui Ding
Martin Danelljan
Yu-Wing Tai
Chi-Keung Tang
Feng Yu
29
29
0
28 Jul 2022
Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers
Junhyeong Cho
Youwang Kim
Tae-Hyun Oh
ViT
16
121
0
27 Jul 2022
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing
Jiaxi Jiang
Paul Streli
Huajian Qiu
A. Fender
Larissa Laich
Patrick Snape
Christian Holz
26
106
0
27 Jul 2022
Membership Inference Attacks via Adversarial Examples
Hamid Jalalzai
Elie Kadoche
Rémi Leluc
Vincent Plassier
AAML
FedML
MIACV
55
7
0
27 Jul 2022
Iterative Scene Graph Generation
Siddhesh Khandelwal
Leonid Sigal
OCL
34
29
0
27 Jul 2022
Previous
1
2
3
...
77
78
79
...
104
105
106
Next