End-to-End Object Detection with Transformers

26 May 2020

Papers citing "End-to-End Object Detection with Transformers"

50 / 5,307 papers shown

Title
tSF: Transformer-based Semantic Filter for Few-Shot Learning Jinxiang Lai Siqian Yang Wenlong Liu Yi Zeng Zhongyi Huang Wenlong Wu Jun Liu Bin-Bin Gao Chengjie Wang VLM 28 19 0 02 Nov 2022
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual Anomaly Detector Haiming Yao Xue Wang Wenyong Yu 48 9 0 01 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection Hongxiang Jiang Wenming Meng Hongmei Zhu Qiaosheng Zhang Jihao Yin 45 4 0 31 Oct 2022
LAD-RCNN:A Powerful Tool for Livestock Face Detection and Normalization Lin Sun Guiqiong Liu Xunping Jiang Jing Liu X. Wang Hang Yang Shiping Yang CVBM 30 6 0 31 Oct 2022
Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection Shan Zhang Naila Murray Lei Wang Piotr Koniusz ViT 45 16 0 30 Oct 2022
Relative Attention-based One-Class Adversarial Autoencoder for Continuous Authentication of Smartphone Users Mingming Hu Kun Zhang Ruibang You Bibo Tu AAML 35 1 0 30 Oct 2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation Ç. S. Çoban Oguzhan Keskin Jordi Pont-Tuset Fatma Guney VOS 40 0 0 30 Oct 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training M. Iranmanesh Xiaotong Chen Kuo-Chin Lien ViT 32 0 0 29 Oct 2022
ImplantFormer: Vision Transformer based Implant Position Regression Using Dental CBCT Data Xinquan Yang Xuguang Li Xuechen Li Pei-Yao Wu Linlin Shen Yongqiang Deng MedIm 52 8 0 29 Oct 2022
Grafting Vision Transformers Jong Sung Park Kumara Kahatapitiya Donghyun Kim Shivchander Sudalairaj Quanfu Fan Michael S. Ryoo ViT 29 3 0 28 Oct 2022
VLT: Vision-Language Transformer and Query Generation for Referring Segmentation Henghui Ding Chang Liu Suchen Wang Xudong Jiang 86 116 0 28 Oct 2022
Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding Mahdi Bonyani Maryam Soleymani 29 0 0 27 Oct 2022
Li3DeTr: A LiDAR based 3D Detection Transformer Gopi Krishna Erabati Helder Araújo ViT 3DPC 117 14 0 27 Oct 2022
MSF3DDETR: Multi-Sensor Fusion 3D Detection Transformer for Autonomous Driving Gopi Krishna Erabati Helder Araújo ViT 3DPC 29 3 0 27 Oct 2022
The 1st-place Solution for ECCV 2022 Multiple People Tracking in Group Dance Challenge Yuang Zhang Tiancai Wang Weiyao Lin Xiangyu Zhang 13 0 0 27 Oct 2022
Visual Semantic Parsing: From Images to Abstract Meaning Representation M. A. Abdelsalam Zhan Shi Federico Fancellu Kalliopi Basioti Dhaivat Bhatt Vladimir Pavlovic Afsaneh Fazly GNN 65 4 0 26 Oct 2022
M $^3$ ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design Hanxue Liang Zhiwen Fan Rishov Sarkar Ziyu Jiang Tianlong Chen Kai Zou Yu Cheng Cong Hao Zhangyang Wang MoE 44 82 0 26 Oct 2022
End-to-end Tracking with a Multi-query Transformer Bruno Korbar Andrew Zisserman VOT 32 6 0 26 Oct 2022
Discovering Design Concepts for CAD Sketches Yuezhi Yang Hao Pan 19 12 0 26 Oct 2022
Can Transformer Attention Spread Give Insights Into Uncertainty of Detected and Tracked Objects? Felicia Ruppel F. Faion Claudius Gläser Klaus C. J. Dietmayer 21 0 0 26 Oct 2022
Cross-View Image Sequence Geo-localization Xiaohan Zhang Waqas Sultani S. Wshah 29 22 0 25 Oct 2022
Refining Action Boundaries for One-stage Detection Hanyuan Wang Majid Mirmehdi Dima Damen Toby Perrett ObjD 37 1 0 25 Oct 2022
Search for Concepts: Discovering Visual Concepts Using Direct Optimization P. Reddy Paul Guerrero Niloy J. Mitra OCL 26 4 0 25 Oct 2022
Pointly-Supervised Panoptic Segmentation Junsong Fan Zhaoxiang Zhang Tieniu Tan 40 23 0 25 Oct 2022
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial Images Pierre Le Jeune Anissa Zergaïnoh-Mokraoui ObjD 49 3 0 25 Oct 2022
End-to-end Transformer for Compressed Video Quality Enhancement Li Yu Wenshuai Chang Shiyu Wu Moncef Gabbouj ViT 28 8 0 25 Oct 2022
Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations Amit Galor Roy Orfaig B. Bobrovsky VOT 50 6 0 24 Oct 2022
Video based Object 6D Pose Estimation using Transformers Apoorva Beedu Huda AlAmri Irfan Essa ViT 24 8 0 24 Oct 2022
MetaFormer Baselines for Vision Weihao Yu Chenyang Si Pan Zhou Mi Luo Yichen Zhou Jiashi Feng Shuicheng Yan Xinchao Wang MoE 45 162 0 24 Oct 2022
Towards Unifying Reference Expression Generation and Comprehension Duo Zheng Tao Kong Ya Jing Jiaan Wang Xiaojie Wang ObjD 35 6 0 24 Oct 2022
Iterative Patch Selection for High-Resolution Image Recognition Benjamin Bergner C. Lippert Aravindh Mahendran 36 13 0 24 Oct 2022
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning Nan Xue Tianfu Wu Song Bai Fu-Dong Wang Gui-Song Xia Lefei Zhang Philip Torr 14 23 0 24 Oct 2022
BARS: A Benchmark for Airport Runway Segmentation Wenhui Chen Zhijiang Zhang Liang Yu Yichun Tai 19 11 0 24 Oct 2022
LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers Zhuo Huang Zhiyou Zhao Banghuai Li Jungong Han 3DPC ViT 42 55 0 23 Oct 2022
Extending Phrase Grounding with Pronouns in Visual Dialogues Panzhong Lu Xin Zhang Meishan Zhang Min Zhang ObjD 37 4 0 23 Oct 2022
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data Yangfan Zhan Zhitong Xiong Yuan. Yuan 78 110 0 23 Oct 2022
Spectrum-BERT: Pre-training of Deep Bidirectional Transformers for Spectral Classification of Chinese Liquors Yansong Wang Yundong Sun Yan-Jiao Fu Dongjie Zhu Zhaoshuo Tian 32 6 0 22 Oct 2022
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention Chi Zhang Lu Zhou Lei Wang Zaiyan Dai Jun Yang ViT 47 24 0 22 Oct 2022
Instance-Aware Image Completion Ji-Ho Cho Minguk Kang Vibhav Vineet Jaesik Park ISeg VLM 28 2 0 22 Oct 2022
Face Pyramid Vision Transformer Khawar Islam M. Zaheer Arif Mahmood ViT CVBM 29 4 0 21 Oct 2022
Automatic Cattle Identification using YOLOv5 and Mosaic Augmentation: A Comparative Analysis Rabindra Dulal Lihong Zheng M. A. Kabir S. McGrath J. Medway D. Swain Will Swain 28 18 0 21 Oct 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding Yuechen Wang Wen-gang Zhou Houqiang Li AI4TS 24 12 0 21 Oct 2022
Boosting vision transformers for image retrieval Chull Hwan Song Jooyoung Yoon Shunghyun Choi Yannis Avrithis ViT 47 33 0 21 Oct 2022
3D Human Pose Estimation in Multi-View Operating Room Videos Using Differentiable Camera Projections Beerend G. A. Gerats J. Wolterink I. A. Broeders 3DH 35 11 0 21 Oct 2022
AROS: Affordance Recognition with One-Shot Human Stances Abel Pacheco-Ortega W. Mayol-Cuevas 3DH 47 0 0 21 Oct 2022
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers Pedro Castro Tae-Kyun Kim 40 30 0 21 Oct 2022
Rethinking Learning Approaches for Long-Term Action Anticipation Megha Nawhal Akash Abdu Jyothi Greg Mori AI4TS 39 27 0 20 Oct 2022
Transformer-based Action recognition in hand-object interacting scenarios Hoseong Cho Seungryul Baek EgoV 42 2 0 20 Oct 2022
Transformer-based Global 3D Hand Pose Estimation in Two Hands Manipulating Objects Scenarios Hoseong Cho Donguk Kim Chanwoo Kim Seongyeong Lee Seungryul Baek 34 1 0 20 Oct 2022
Large-batch Optimization for Dense Visual Predictions Zeyue Xue Jianming Liang Guanglu Song Zhuofan Zong Liang Chen Yu Liu Ping Luo VLM 52 9 0 20 Oct 2022