ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
Efficient Motion Prompt Learning for Robust Visual Tracking
Efficient Motion Prompt Learning for Robust Visual Tracking
Jie Zhao
Xin Chen
Yongsheng Yuan
Michael Felsberg
Dong Wang
Huchuan Lu
10
0
0
22 May 2025
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
Shiyu Xuan
Zechao Li
Jinhui Tang
13
0
0
19 May 2025
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges
Yuan Zhang
Xinfeng Zhang
Xiaoming Qi Xinyu Wu
Feng Chen
Guanyu Yang
Huazhu Fu
MedIm
LM&MA
AI4CE
33
0
0
16 May 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Visually Interpretable Subtask Reasoning for Visual Question Answering
Yu Cheng
A. Goel
Hakan Bilen
LRM
36
0
0
12 May 2025
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer
Ho-Joong Kim
Y. E. Lee
Jung-Ho Hong
Seong-Whan Lee
58
0
0
09 May 2025
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking
CGTrack: Cascade Gating Network with Hierarchical Feature Aggregation for UAV Tracking
Weihong Li
Xiaoqiong Liu
Heng Fan
L. Zhang
31
0
0
09 May 2025
RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet
RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet
Eliraz Orfaig
Inna Stainvas
Igal Bilik
29
0
0
05 May 2025
Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging
Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging
Elena Mulero Ayllón
Massimiliano Mantegna
Linlin Shen
Paolo Soda
V. Guarrasi
M. Tortora
54
0
0
02 May 2025
Efficient Vision-based Vehicle Speed Estimation
Efficient Vision-based Vehicle Speed Estimation
Andrej Macko
Lukás Gajdosech
Viktor Kocur
247
0
0
02 May 2025
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration
Juhan Park
Kyungjae Lee
Hyung Jin Chang
Jungchan Cho
VLM
66
0
0
28 Apr 2025
Improving Open-World Object Localization by Discovering Background
Improving Open-World Object Localization by Discovering Background
Ashish Singh
Michael Jeffrey Jones
Kuan-Chuan Peng
A. Cherian
Moitreya Chatterjee
Erik Learned-Miller
ObjD
OCL
VLM
66
0
0
24 Apr 2025
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes
Duy-Tho Le
Trung Pham
Jianfei Cai
H. Rezatofighi
34
0
0
23 Apr 2025
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
Jingchao Wang
Hong Wang
Wenlong Zhang
Kunhua Ji
Dingjiang Huang
Yefeng Zheng
ObjD
50
0
0
22 Apr 2025
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
EIoU-EMC: A Novel Loss for Domain-specific Nested Entity Recognition
Jian Zhang
Tianqing Zhang
Qi Li
Hongwei Wang
29
0
0
19 Apr 2025
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV Tracking
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV Tracking
Ying Wang
Tingfa Xu
Jianan Li
37
0
0
18 Apr 2025
Image Editing with Diffusion Models: A Survey
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
74
0
0
17 Apr 2025
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Yaqian Ning
Tianze Zhang
Yin Zhuang
He Chen
Jun Li
Xuerui Mao
43
0
0
17 Apr 2025
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes
Andreas Lau Hansen
Lukas Wanzeck
Dim P. Papadopoulos
36
0
0
17 Apr 2025
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
You Wu
Xucheng Wang
Xiangyang Yang
Mengyuan Liu
Dan Zeng
Hengzhou Ye
Shuiwang Li
36
0
0
12 Apr 2025
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Light-YOLOv8-Flame: A Lightweight High-Performance Flame Detection Algorithm
Jiawei Lan
Zhibiao Wang
Haoyang Yu
Ye Tao
Wenhua Cui
41
0
0
11 Apr 2025
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding
Dibyadip Chatterjee
Edoardo Remelli
Yale Song
Bugra Tekin
Abhay Mittal
...
Shreyas Hampali
Eric Sauser
Shugao Ma
Angela Yao
Fadime Sener
VLM
51
0
0
10 Apr 2025
End-to-End Facial Expression Detection in Long Videos
End-to-End Facial Expression Detection in Long Videos
Yini Fang
Alec Diallo
Yiqi Shi
F. Jumelle
Bertram Shi
CVBM
29
0
0
10 Apr 2025
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Few-Shot Adaptation of Grounding DINO for Agricultural Domain
Rajhans Singh
Rafael Bidese Puhl
Kshitiz Dhakal
Sudhir Sornapudi
33
0
0
09 Apr 2025
Are We Done with Object-Centric Learning?
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
734
0
0
09 Apr 2025
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario
Sriram Mandalika
Lalitha V
Athira Nambiar
41
1
0
08 Apr 2025
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Mathematical Modeling of Option Pricing with an Extended Black-Scholes Framework
Nikhil Shivakumar Nayak
57
0
0
04 Apr 2025
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking
Chunhui Zhang
Li Liu
Jialin Gao
Xin Sun
Hao Wen
Xi Zhou
Shiming Ge
Yucheng Wang
44
1
0
02 Apr 2025
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
Hang Zhou
Wei Ji
Rui Ma
Li Cheng
ViT
47
0
0
27 Mar 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
58
0
0
27 Mar 2025
RelTriple: Learning Plausible Indoor Layouts by Integrating Relationship Triples into the Diffusion Process
RelTriple: Learning Plausible Indoor Layouts by Integrating Relationship Triples into the Diffusion Process
Kaifan Sun
Bingchen Yang
Peter Wonka
Jun Xiao
Haiyong Jiang
80
0
0
26 Mar 2025
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
Wenrui Cai
Qingjie Liu
Yansen Wang
MoE
70
0
0
24 Mar 2025
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation
Chengbo Yuan
Suraj Joshi
Shaoting Zhu
Hang Su
Hang Zhao
Yang Gao
VGen
53
4
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Yao Hu
Yongchao Xu
ObjD
57
0
0
24 Mar 2025
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking
Haolin Qin
Tingfa Xu
Tianhao Li
Zhenxiang Chen
Tao Feng
Jia-Nan Li
37
0
0
22 Mar 2025
Temporal Action Detection Model Compression by Progressive Block Drop
Temporal Action Detection Model Compression by Progressive Block Drop
Xiaoyong Chen
Yong Guo
Jiaming Liang
Sitong Zhuang
Runhao Zeng
Xiping Hu
55
0
0
21 Mar 2025
STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans
STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans
Shashikant Verma
Harish Katti
Soumyaratna Debnath
Yamuna Swamy
Shanmuganathan Raman
264
0
0
17 Mar 2025
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection
Ship Detection in Remote Sensing Imagery for Arbitrarily Oriented Object Detection
Bibi Erum Ayesha
T. Satyanarayana Murthy
Palamakula Ramesh Babu
Ramu Kuchipudi
50
1
0
17 Mar 2025
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory
Saket Gurukar
Asim Kadav
VLM
60
0
0
17 Mar 2025
Towards General Multimodal Visual Tracking
Andong Lu
Mai Wen
Jinhu Wang
Yuanzhi Guo
Chenglong Li
Jin Tang
Bin Luo
41
0
0
14 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
245
0
0
14 Mar 2025
Large-scale Pre-training for Grounded Video Caption Generation
Large-scale Pre-training for Grounded Video Caption Generation
Evangelos Kazakos
Cordelia Schmid
Josef Sivic
64
0
0
13 Mar 2025
SDTrack: A Baseline for Event-based Tracking via Spiking Neural Networks
Yimeng Shan
Zhenbang Ren
Haodi Wu
Wenjie Wei
Rui-jie Zhu
...
Jason K. Eshraghian
Haicheng Qu
Jing Zhang
Malu Zhang
Yiran Yang
52
0
0
09 Mar 2025
Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
Chaocan Xue
Bineng Zhong
Qihua Liang
Yaozong Zheng
Ning Li
Yuanliang Xue
Shuxiang Song
46
0
0
09 Mar 2025
Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection
Chandan Kumar Sah
Ankit Kumar Shaw
Xiaoli Lian
Arsalan Shahid Baig
Tuopu Wen
Kun Jiang
Mengmeng Yang
Ke Wang
41
1
0
08 Mar 2025
Detection of Customer Interested Garments in Surveillance Video using Computer Vision
Earnest Paul Ijjina
A. Joshi
Goutham Kanahasabai
34
0
0
01 Mar 2025
A Guide to Failure in Machine Learning: Reliability and Robustness from Foundations to Practice
Eric Heim
Oren Wright
David Shriver
OOD
FaML
73
0
0
01 Mar 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
Jiawen Zhu
Huayi Tang
Xin Chen
Xinying Wang
Dong Wang
Huchuan Lu
55
2
0
01 Mar 2025
MITracker: Multi-View Integration for Visual Object Tracking
MITracker: Multi-View Integration for Visual Object Tracking
Mengjie Xu
Yitao Zhu
Haotian Jiang
Jiaming Li
Zhenrong Shen
...
Haolin Huang
Xinyu Wang
Qing Yang
H. Zhang
Qian Wang
50
0
0
27 Feb 2025
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
70
3
0
27 Feb 2025
LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking
LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking
Yunfeng Li
Bo Wang
Ye Li
70
0
0
25 Feb 2025
1234...202122
Next