Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.12450
Cited By
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
25 November 2020
Pei Sun
Rufeng Zhang
Yi-Xin Jiang
Tao Kong
Chenfeng Xu
Wei Zhan
Masayoshi Tomizuka
Lei Li
Zehuan Yuan
Changhu Wang
Ping Luo
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sparse R-CNN: End-to-End Object Detection with Learnable Proposals"
15 / 165 papers shown
Title
Structured Sparse R-CNN for Direct Scene Graph Generation
Yao Teng
Limin Wang
3DPC
GNN
26
53
0
21 Jun 2021
CAT: Cross Attention in Vision Transformer
Hezheng Lin
Xingyi Cheng
Xiangyu Wu
Fan Yang
Dong Shen
Zhongyuan Wang
Qing Song
Wei Yuan
ViT
35
149
0
10 Jun 2021
Gaze Estimation using Transformer
Yihua Cheng
Feng Lu
ViT
21
87
0
30 May 2021
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks
Meng-Hao Guo
Zheng-Ning Liu
Tai-Jiang Mu
Shimin Hu
28
472
0
05 May 2021
Instances as Queries
Yuxin Fang
Shusheng Yang
Xinggang Wang
Yu Li
Chen Fang
Ying Shan
Bin Feng
Wenyu Liu
ISeg
42
255
0
05 May 2021
Visformer: The Vision-friendly Transformer
Zhengsu Chen
Lingxi Xie
Jianwei Niu
Xuefeng Liu
Longhui Wei
Qi Tian
ViT
120
209
0
26 Apr 2021
Efficient DETR: Improving End-to-End Object Detector with Dense Prior
Z. Yao
Jiangbo Ai
Boxun Li
Chi Zhang
ViT
48
214
0
03 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
ViT
148
20,710
0
25 Mar 2021
Augmenting Proposals by the Detector Itself
Xiaopei Wan
Zhenhua Guo
Chao He
Yujiu Yang
Fangbo Tao
ObjD
34
2
0
28 Jan 2021
TransTrack: Multiple Object Tracking with Transformer
Pei Sun
Jinkun Cao
Yi-Xin Jiang
Rufeng Zhang
Enze Xie
Zehuan Yuan
Changhu Wang
Ping Luo
ViT
VOT
264
567
0
31 Dec 2020
MiniVLM: A Smaller and Faster Vision-Language Model
Jianfeng Wang
Xiaowei Hu
Pengchuan Zhang
Xiujun Li
Lijuan Wang
Lefei Zhang
Jianfeng Gao
Zicheng Liu
VLM
MLLM
35
59
0
13 Dec 2020
One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks
Kemal Oksuz
Baris Can Cam
Sinan Kalkan
Emre Akbas
32
32
0
21 Nov 2020
FairMOT: On the Fairness of Detection and Re-Identification in Multiple Object Tracking
Yifu Zhang
Chunyu Wang
Xinggang Wang
Wenjun Zeng
Wenyu Liu
VOT
34
1,305
0
04 Apr 2020
Conditional Convolutions for Instance Segmentation
Zhi Tian
Chunhua Shen
Hao Chen
ISeg
187
597
0
12 Mar 2020
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,225
0
16 Nov 2016
Previous
1
2
3
4