Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.09630
Cited By
v1
v2 (latest)
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"
50 / 1,104 papers shown
Title
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
458
1
0
19 Sep 2024
Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Zimeng Fang
Chao Liang
Xue Zhou
Shuyuan Zhu
Xi Li
77
2
0
14 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
161
2
0
12 Sep 2024
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Assefa Seyoum Wahd
B. Felfeliyan
Yuyue Zhou
Shrimanti Ghosh
Adam McArthur
Jiechen Zhang
Jacob L. Jaremko
A. Hareendranathan
VLM
MedIm
90
1
0
10 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
105
0
0
09 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
80
0
0
06 Sep 2024
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
Md. Mahfuzur Rahman
Sunzida Siddique
Marufa Kamal
Rakib Hossain Rifat
Kishor Datta Gupta
AI4TS
86
1
0
05 Sep 2024
A Modern Take on Visual Relationship Reasoning for Grasp Planning
Paolo Rabino
Tatiana Tommasi
79
1
0
03 Sep 2024
TrackSSM: A General Motion Predictor by State-Space Model
Bin Hu
Run Luo
Zelin Liu
Cheng Wang
Wenyu Liu
145
3
0
31 Aug 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
90
0
0
30 Aug 2024
Hybrid Classification-Regression Adaptive Loss for Dense Object Detection
Yanquan Huang
Liu Wei Zhen
Yun Hao
Mengyuan Zhang
Qingyao Wu
Zikun Deng
Xueming Liu
Hong Deng
75
0
0
30 Aug 2024
UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo
Felix Sattler
Antje Alex
Alexander Klein
Bruno Pereira Costa
Ángel Bueno Rodríguez
Jannis Stoppe
VOT
110
1
0
30 Aug 2024
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
Minghang Zheng
Jiahua Zhang
Qingchao Chen
Yuxin Peng
Yang Liu
ObjD
96
2
0
29 Aug 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
66
3
0
29 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
138
3
0
27 Aug 2024
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by Probability Distribution Learning and Interval Cluster Refinement
Fengshun Wang
Qiurui Wang
Yuting Wang
80
0
0
25 Aug 2024
MCTR: Multi Camera Tracking Transformer
Alexandru Niculescu-Mizil
Deep Patel
Iain Melvin
129
0
0
23 Aug 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Wenjing Bian
Zirui Wang
Andrea Vedaldi
96
1
0
22 Aug 2024
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Hanzheng Wang
Wei Li
X. Xia
Qian Du
139
2
0
22 Aug 2024
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System
Shuo Wang
Yongcai Wang
Zhimin Xu
Yongyu Guo
Wanting Li
Zhe Huang
Xuewei Bai
Deying Li
VOT
65
2
0
17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
136
9
0
17 Aug 2024
Language-Driven Interactive Shadow Detection
Hongqiu Wang
Wei Wang
Haipeng Zhou
Huihui Xu
Shaozhi Wu
Lei Zhu
86
8
0
16 Aug 2024
RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking
Song Guo
Rujie Liu
N. Abe
VOT
78
0
0
14 Aug 2024
Unified-IoU: For High-Quality Object Detection
Xiangjie Luo
Zhihao Cai
Bo Shao
Yingxun Wang
NoLa
81
2
0
13 Aug 2024
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
Long Bai
Guankun Wang
Mobarakol Islam
Lalithkumar Seenivasan
An-Chi Wang
Hongliang Ren
104
17
0
09 Aug 2024
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context Relation Modeling
Seok Hwan Lee
Taein Son
Soo Won Seo
Jisong Kim
Jun Won Choi
96
0
0
07 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Chaolei Tan
Zihang Lin
Junfu Pu
Zhongang Qi
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
88
0
0
03 Aug 2024
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization
Vinaya Sree Katamneni
A. Rattani
76
4
0
02 Aug 2024
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
Wei Chen
Mahdieh Hatamian
Yu Wu
102
5
0
02 Aug 2024
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function
Matias Oscar Volman Stern
Dominic Hohs
Markos Diomataris
Michael J. Black
Gerhard Schneider
DiffM
69
0
0
01 Aug 2024
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
109
0
0
29 Jul 2024
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal
Seoyoung Ahn
Zhibo Yang
Niranjan Balasubramanian
Dimitris Samaras
G. Zelinsky
Minh Hoai
90
2
0
28 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
76
3
0
23 Jul 2024
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Yiyang Jiang
Wengyu Zhang
Xu-Lu Zhang
Xiaoyong Wei
Chang Wen Chen
Qing Li
77
4
0
21 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
48
4
0
20 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
62
2
0
19 Jul 2024
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Yuetan Chu
Yilan Zhang
Zhongyi Han
Changchun Yang
Longxi Zhou
Gongning Luo
Chao Huang
Xin Gao
MedIm
136
1
0
19 Jul 2024
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Jiahao Zhang
Frederic Z. Zhang
Cristian Rodriguez
Yizhak Ben-Shabat
A. Cherian
Stephen Gould
103
2
0
16 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
80
6
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjD
VLM
89
2
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
127
1
0
13 Jul 2024
Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets
L. Ma
Tran Thien Dat Nguyen
Changbeom Shim
Du Yong Kim
Namkoo Ha
Moongu Jeon
VOT
111
14
0
11 Jul 2024
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan
Olga Isupova
Gustavo Carneiro
Xiatian Zhu
Yunpeng Li
ObjD
84
0
0
10 Jul 2024
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher
Jiangming Chen
Li Liu
Wanxia Deng
Zhen Liu
Yu Liu
Yingmei Wei
Yongxiang Liu
94
0
0
10 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
79
2
0
10 Jul 2024
Described Spatial-Temporal Video Detection
Wei Ji
Xiangyan Liu
Yingfei Sun
Jiajun Deng
You Qin
Ammar Nuwanna
Mengyao Qiu
Lina Wei
Roger Zimmermann
108
2
0
08 Jul 2024
Towards Reflected Object Detection: A Benchmark
Yiquan Wu
Zhongtian Wang
You Wu
Ling Huang
Hui Zhou
Shuiwang Li
ObjD
66
2
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
96
7
0
07 Jul 2024
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking
You Wu
Xucheng Wang
Dan Zeng
Hengzhou Ye
Xiaolan Xie
Qijun Zhao
Shuiwang Li
94
3
0
07 Jul 2024
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Zhipeng Qian
Yiwei Ma
Zhekai Lin
Jiayi Ji
Xiawu Zheng
Xiaoshuai Sun
Rongrong Ji
3DV
115
7
0
07 Jul 2024
Previous
1
2
3
4
5
...
21
22
23
Next