ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,104 papers shown
Title
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
458
1
0
19 Sep 2024
Associate Everything Detected: Facilitating Tracking-by-Detection to the
  Unknown
Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Zimeng Fang
Chao Liang
Xue Zhou
Shuyuan Zhu
Xi Li
77
2
0
14 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
161
2
0
12 Sep 2024
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Assefa Seyoum Wahd
B. Felfeliyan
Yuyue Zhou
Shrimanti Ghosh
Adam McArthur
Jiechen Zhang
Jacob L. Jaremko
A. Hareendranathan
VLMMedIm
90
1
0
10 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View
  Perception
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
105
0
0
09 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
80
0
0
06 Sep 2024
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in
  Segmentation, Classification, Detection, and Tracking
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
Md. Mahfuzur Rahman
Sunzida Siddique
Marufa Kamal
Rakib Hossain Rifat
Kishor Datta Gupta
AI4TS
86
1
0
05 Sep 2024
A Modern Take on Visual Relationship Reasoning for Grasp Planning
A Modern Take on Visual Relationship Reasoning for Grasp Planning
Paolo Rabino
Tatiana Tommasi
79
1
0
03 Sep 2024
TrackSSM: A General Motion Predictor by State-Space Model
TrackSSM: A General Motion Predictor by State-Space Model
Bin Hu
Run Luo
Zelin Liu
Cheng Wang
Wenyu Liu
145
3
0
31 Aug 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause
  Analysis
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
90
0
0
30 Aug 2024
Hybrid Classification-Regression Adaptive Loss for Dense Object
  Detection
Hybrid Classification-Regression Adaptive Loss for Dense Object Detection
Yanquan Huang
Liu Wei Zhen
Yun Hao
Mengyuan Zhang
Qingyao Wu
Zikun Deng
Xueming Liu
Hong Deng
75
0
0
30 Aug 2024
UTrack: Multi-Object Tracking with Uncertain Detections
UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo
Felix Sattler
Antje Alex
Alexander Klein
Bruno Pereira Costa
Ángel Bueno Rodríguez
Jannis Stoppe
VOT
110
1
0
30 Aug 2024
ResVG: Enhancing Relation and Semantic Understanding in Multiple
  Instances for Visual Grounding
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
Minghang Zheng
Jiahua Zhang
Qingchao Chen
Yuxin Peng
Yang Liu
ObjD
96
2
0
29 Aug 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object
  Detection in Bird's-Eye-View
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
66
3
0
29 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
138
3
0
27 Aug 2024
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by
  Probability Distribution Learning and Interval Cluster Refinement
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by Probability Distribution Learning and Interval Cluster Refinement
Fengshun Wang
Qiurui Wang
Yuting Wang
80
0
0
25 Aug 2024
MCTR: Multi Camera Tracking Transformer
MCTR: Multi Camera Tracking Transformer
Alexandru Niculescu-Mizil
Deep Patel
Iain Melvin
129
0
0
23 Aug 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Wenjing Bian
Zirui Wang
Andrea Vedaldi
96
1
0
22 Aug 2024
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Hanzheng Wang
Wei Li
X. Xia
Qian Du
139
2
0
22 Aug 2024
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating,
  Mapping, and Multiple Object Tracking System
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System
Shuo Wang
Yongcai Wang
Zhimin Xu
Yongyu Guo
Wanting Li
Zhe Huang
Xuewei Bai
Deying Li
VOT
65
2
0
17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
136
9
0
17 Aug 2024
Language-Driven Interactive Shadow Detection
Language-Driven Interactive Shadow Detection
Hongqiu Wang
Wei Wang
Haipeng Zhou
Huihui Xu
Shaozhi Wu
Lei Zhu
86
8
0
16 Aug 2024
RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking
RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking
Song Guo
Rujie Liu
N. Abe
VOT
78
0
0
14 Aug 2024
Unified-IoU: For High-Quality Object Detection
Unified-IoU: For High-Quality Object Detection
Xiangjie Luo
Zhihao Cai
Bo Shao
Yingxun Wang
NoLa
81
2
0
13 Aug 2024
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust
  Visual Question-Localized Answering in Robotic Surgery
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
Long Bai
Guankun Wang
Mobarakol Islam
Lalithkumar Seenivasan
An-Chi Wang
Hongliang Ren
104
17
0
09 Aug 2024
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context
  Relation Modeling
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context Relation Modeling
Seok Hwan Lee
Taein Son
Soo Won Seo
Jisong Kim
Jun Won Choi
96
0
0
07 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Chaolei Tan
Zihang Lin
Junfu Pu
Zhongang Qi
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
88
0
0
03 Aug 2024
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and
  Localization
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization
Vinaya Sree Katamneni
A. Rattani
76
4
0
02 Aug 2024
An Efficient and Effective Transformer Decoder-Based Framework for
  Multi-Task Visual Grounding
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
Wei Chen
Mahdieh Hatamian
Yu Wu
102
5
0
02 Aug 2024
Synthetic dual image generation for reduction of labeling efforts in
  semantic segmentation of micrographs with a customized metric function
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function
Matias Oscar Volman Stern
Dominic Hohs
Markos Diomataris
Michael J. Black
Gerhard Schneider
DiffM
69
0
0
01 Aug 2024
Classification Matters: Improving Video Action Detection with
  Class-Specific Attention
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
109
0
0
29 Jul 2024
Look Hear: Gaze Prediction for Speech-directed Human Attention
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal
Seoyoung Ahn
Zhibo Yang
Niranjan Balasubramanian
Dimitris Samaras
G. Zelinsky
Minh Hoai
90
2
0
28 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
76
3
0
23 Jul 2024
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation
  for Video Moment Retrieval
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Yiyang Jiang
Wengyu Zhang
Xu-Lu Zhang
Xiaoyong Wei
Chang Wen Chen
Qing Li
77
4
0
21 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
48
4
0
20 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
62
2
0
19 Jul 2024
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Yuetan Chu
Yilan Zhang
Zhongyi Han
Changchun Yang
Longxi Zhou
Gongning Luo
Chao Huang
Xin Gao
MedIm
136
1
0
19 Jul 2024
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Jiahao Zhang
Frederic Z. Zhang
Cristian Rodriguez
Yizhak Ben-Shabat
A. Cherian
Stephen Gould
103
2
0
16 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model
  and Benchmark Dataset
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
80
6
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjDVLM
89
2
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object
  Detection
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
127
1
0
13 Jul 2024
Visual Multi-Object Tracking with Re-Identification and Occlusion
  Handling using Labeled Random Finite Sets
Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets
L. Ma
Tran Thien Dat Nguyen
Changbeom Shim
Du Yong Kim
Namkoo Ha
Moongu Jeon
VOT
111
14
0
11 Jul 2024
Bayesian Detector Combination for Object Detection with Crowdsourced
  Annotations
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan
Olga Isupova
Gustavo Carneiro
Xiatian Zhu
Yunpeng Li
ObjD
84
0
0
10 Jul 2024
Cross Domain Object Detection via Multi-Granularity Confidence Alignment
  based Mean Teacher
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher
Jiangming Chen
Li Liu
Wanxia Deng
Zhen Liu
Yu Liu
Yingmei Wei
Yongxiang Liu
94
0
0
10 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
79
2
0
10 Jul 2024
Described Spatial-Temporal Video Detection
Described Spatial-Temporal Video Detection
Wei Ji
Xiangyan Liu
Yingfei Sun
Jiajun Deng
You Qin
Ammar Nuwanna
Mengyao Qiu
Lina Wei
Roger Zimmermann
108
2
0
08 Jul 2024
Towards Reflected Object Detection: A Benchmark
Towards Reflected Object Detection: A Benchmark
Yiquan Wu
Zhongtian Wang
You Wu
Ling Huang
Hui Zhou
Shuiwang Li
ObjD
66
2
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene
  Synthesis
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC3DV
96
7
0
07 Jul 2024
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit
  for Real-Time UAV Tracking
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking
You Wu
Xucheng Wang
Dan Zeng
Hengzhou Ye
Xiaolan Xie
Qijun Zhao
Shuiwang Li
94
3
0
07 Jul 2024
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Zhipeng Qian
Yiwei Ma
Zhekai Lin
Jiayi Ji
Xiawu Zheng
Xiaoshuai Sun
Rongrong Ji
3DV
115
7
0
07 Jul 2024
Previous
12345...212223
Next