ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
AutoGameUI: Constructing High-Fidelity Game UIs via Multimodal Learning
  and Interactive Web-Based Tool
AutoGameUI: Constructing High-Fidelity Game UIs via Multimodal Learning and Interactive Web-Based Tool
Zhongliang Tang
Mengchen Tan
Fei Xia
Qingrong Cheng
Hao Jiang
Yuyao Zhang
38
0
0
06 Nov 2024
SIRA: Scalable Inter-frame Relation and Association for Radar Perception
SIRA: Scalable Inter-frame Relation and Association for Radar Perception
Ryoma Yataka
Peng Wang
P. Boufounos
R. Takahashi
46
4
0
04 Nov 2024
Polar R-CNN: End-to-End Lane Detection with Fewer Anchors
Polar R-CNN: End-to-End Lane Detection with Fewer Anchors
Shengqi Wang
Junmin Liu
Xiangyong Cao
Zengjie Song
Kai Sun
48
0
0
03 Nov 2024
Is Multiple Object Tracking a Matter of Specialization?
Is Multiple Object Tracking a Matter of Specialization?
G. Mancusi
Mattia Bernardi
Aniello Panariello
Angelo Porrello
Rita Cucchiara
Simone Calderara
MoMe
44
1
0
01 Nov 2024
LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion
  Attention Mechanism YOLO
LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO
Yuchen Zheng
Yuxin Jing
Jufeng Zhao
Guangmang Cui
ObjD
42
0
0
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
50
2
0
31 Oct 2024
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive
  Position Correction for Visual Grounding
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Minghong Xie
Ming Wang
Huafeng Li
Yafei Zhang
Dapeng Tao
Z. Yu
ObjD
40
1
0
31 Oct 2024
Unbiased Regression Loss for DETRs
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
39
0
0
30 Oct 2024
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
Ming Kang
F. F. Ting
Raphaël C.-W. Phan
C. Ting
ViT
MedIm
62
1
0
29 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Referring Human Pose and Mask Estimation in the Wild
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Ajmal Mian
31
0
0
27 Oct 2024
AlphaChimp: Tracking and Behavior Recognition of Chimpanzees
AlphaChimp: Tracking and Behavior Recognition of Chimpanzees
Xiaoxuan Ma
Yutang Lin
Yuan Xu
Stephan P. Kaufhold
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
41
0
0
22 Oct 2024
Sketch2Code: Evaluating Vision-Language Models for Interactive Web
  Design Prototyping
Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping
Ryan Li
Yanzhe Zhang
Diyi Yang
3DV
24
4
0
21 Oct 2024
A Paradigm Shift in Mouza Map Vectorization: A Human-Machine
  Collaboration Approach
A Paradigm Shift in Mouza Map Vectorization: A Human-Machine Collaboration Approach
Mahir Shahriar Dhrubo
Samira Akter
Anwarul Bashir Shuaib
Md Toki Tahmid
Zahid Hasan
A. B. M. Alim Al Islam
25
0
0
21 Oct 2024
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
Jiayi Liu
Denys Iliash
Angel X. Chang
Manolis Savva
Ali Mahdavi-Amiri
65
8
0
21 Oct 2024
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
Ning Wang
Yun Xiao
Xiaopeng Peng
Xiaojun Chang
Xuanhong Wang
Dingyi Fang
36
2
0
20 Oct 2024
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text
  Understanding
Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Jongbhin Woo
H. Ryu
Youngjoon Jang
Jae-Won Cho
Joon Son Chung
35
1
0
17 Oct 2024
VividMed: Vision Language Model with Versatile Visual Grounding for
  Medicine
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
Lingxiao Luo
Bingda Tang
Xuanzhong Chen
Rong Han
Ting Chen
VLM
37
3
0
16 Oct 2024
Multiview Scene Graph
Multiview Scene Graph
Juexiao Zhang
Gao Zhu
Sihang Li
Xinhao Liu
Haorui Song
Xinran Tang
Chen Feng
3DV
31
1
0
15 Oct 2024
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked
  Autoencoders
Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders
Yaohua Zha
Tao Dai
Yanzi Wang
Hang Guo
Taolin Zhang
Zhihao Ouyang
Chunlin Fan
Bin Chen
Ke Chen
Shu-Tao Xia
3DPC
35
1
0
13 Oct 2024
Token Pruning using a Lightweight Background Aware Vision Transformer
Token Pruning using a Lightweight Background Aware Vision Transformer
Sudhakar Sah
Ravish Kumar
Honnesh Rohmetra
Ehsan Saboori
ViT
31
1
0
12 Oct 2024
OneRef: Unified One-tower Expression Grounding and Segmentation with
  Mask Referring Modeling
OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
Linhui Xiao
Xiaoshan Yang
Fang Peng
Yaowei Wang
Changsheng Xu
ObjD
43
5
0
10 Oct 2024
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Aleksandr Gordeev
Vladimir Dokholyan
Irina Tolstykh
Maksim Kuprashevich
36
4
0
02 Oct 2024
KPCA-CAM: Visual Explainability of Deep Computer Vision Models using
  Kernel PCA
KPCA-CAM: Visual Explainability of Deep Computer Vision Models using Kernel PCA
Sachin Karmani
Thanushon Sivakaran
Gaurav Prasad
Mehmet Ali
Wenbo Yang
Sheyang Tang
FAtt
18
3
0
30 Sep 2024
Improving Visual Object Tracking through Visual Prompting
Improving Visual Object Tracking through Visual Prompting
Shih-Fang Chen
Jun-Cheng Chen
I-Hong Jhuo
Yen-Yu Lin
VLM
38
1
0
27 Sep 2024
SimVG: A Simple Framework for Visual Grounding with Decoupled
  Multi-modal Fusion
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
Ming Dai
Lingfeng Yang
Yihao Xu
Zhenhua Feng
Wankou Yang
ObjD
37
9
0
26 Sep 2024
MorphoSeg: An Uncertainty-Aware Deep Learning Method for Biomedical
  Segmentation of Complex Cellular Morphologies
MorphoSeg: An Uncertainty-Aware Deep Learning Method for Biomedical Segmentation of Complex Cellular Morphologies
Tianhao Zhang
Heather J. McCourty
Berardo M. Sanchez-Tafolla
Anton Nikolaev
Lyudmila Mihaylova
23
0
0
25 Sep 2024
Language-based Audio Moment Retrieval
Language-based Audio Moment Retrieval
Hokuto Munakata
Taichi Nishimura
Shota Nakada
Tatsuya Komatsu
50
1
0
24 Sep 2024
OW-Rep: Open World Object Detection with Instance Representation Learning
OW-Rep: Open World Object Detection with Instance Representation Learning
Sunoh Lee
Minsik Jeon
Jihong Min
Junwon Seo
ObjD
242
0
0
24 Sep 2024
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Bo Yue
Jian Li
Guiliang Liu
36
2
0
24 Sep 2024
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous
  Driving
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving
Xiyang Wang
Shouzheng Qi
Jieyou Zhao
Hangning Zhou
Siyu Zhang
...
Kai Tu
Songlin Guo
Jianbo Zhao
Jian Li
Mu Yang
VOT
50
5
0
23 Sep 2024
Region Prompt Tuning: Fine-grained Scene Text Detection Utilizing Region
  Text Prompt
Region Prompt Tuning: Fine-grained Scene Text Detection Utilizing Region Text Prompt
Xingtao Lin
Heqian Qiu
Lanxiao Wang
RUihang Wang
Linfeng XU
Hongliang Li
VLM
28
0
0
20 Sep 2024
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
230
1
0
19 Sep 2024
Associate Everything Detected: Facilitating Tracking-by-Detection to the
  Unknown
Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Zimeng Fang
Chao Liang
Xue Zhou
Shuyuan Zhu
Xi Li
44
2
0
14 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
45
1
0
12 Sep 2024
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts
Assefa Seyoum Wahd
B. Felfeliyan
Yuyue Zhou
Shrimanti Ghosh
Adam McArthur
Jiechen Zhang
Jacob L. Jaremko
A. Hareendranathan
VLM
MedIm
54
1
0
10 Sep 2024
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View
  Perception
Vision-Driven 2D Supervised Fine-Tuning Framework for Bird's Eye View Perception
Lei He
Qiaoyi Wang
Honglin Sun
Qing Xu
Bolin Gao
Shengbo Eben Li
Jianqiang Wang
Keqiang Li
36
0
0
09 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
33
0
0
06 Sep 2024
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in
  Segmentation, Classification, Detection, and Tracking
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking
Md. Mahfuzur Rahman
Sunzida Siddique
Marufa Kamal
Rakib Hossain Rifat
Kishor Datta Gupta
AI4TS
44
0
0
05 Sep 2024
A Modern Take on Visual Relationship Reasoning for Grasp Planning
A Modern Take on Visual Relationship Reasoning for Grasp Planning
Paolo Rabino
Tatiana Tommasi
33
1
0
03 Sep 2024
TrackSSM: A General Motion Predictor by State-Space Model
TrackSSM: A General Motion Predictor by State-Space Model
Bin Hu
Run Luo
Zelin Liu
Cheng Wang
Wenyu Liu
42
2
0
31 Aug 2024
Unintentional Security Flaws in Code: Automated Defense via Root Cause
  Analysis
Unintentional Security Flaws in Code: Automated Defense via Root Cause Analysis
Nafis Tanveer Islam
Mazal Bethany
Dylan Manuel
Murtuza Jadliwala
Peyman Najafirad
41
0
0
30 Aug 2024
Hybrid Classification-Regression Adaptive Loss for Dense Object
  Detection
Hybrid Classification-Regression Adaptive Loss for Dense Object Detection
Yanquan Huang
Liu Wei Zhen
Yun Hao
Mengyuan Zhang
Qingyao Wu
Zikun Deng
Xueming Liu
Hong Deng
35
0
0
30 Aug 2024
UTrack: Multi-Object Tracking with Uncertain Detections
UTrack: Multi-Object Tracking with Uncertain Detections
Edgardo Solano-Carrillo
Felix Sattler
Antje Alex
Alexander Klein
Bruno Pereira Costa
Ángel Bueno Rodríguez
Jannis Stoppe
VOT
44
1
0
30 Aug 2024
ResVG: Enhancing Relation and Semantic Understanding in Multiple
  Instances for Visual Grounding
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
Minghang Zheng
Jiahua Zhang
Qingchao Chen
Yuxin Peng
Yang Liu
ObjD
44
2
0
29 Aug 2024
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object
  Detection in Bird's-Eye-View
PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View
Zichen Yu
Quanli Liu
Wei Wang
Liyong Zhang
Xiaoguang Zhao
32
1
0
29 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
60
2
0
27 Aug 2024
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by
  Probability Distribution Learning and Interval Cluster Refinement
FMI-TAL: Few-shot Multiple Instances Temporal Action Localization by Probability Distribution Learning and Interval Cluster Refinement
Fengshun Wang
Qiurui Wang
Yuting Wang
33
0
0
25 Aug 2024
MCTR: Multi Camera Tracking Transformer
MCTR: Multi Camera Tracking Transformer
Alexandru Niculescu-Mizil
Deep Patel
Iain Melvin
49
0
0
23 Aug 2024
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Wenjing Bian
Zirui Wang
Andrea Vedaldi
42
1
0
22 Aug 2024
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking
Hanzheng Wang
Wei Li
X. Xia
Qian Du
62
1
0
22 Aug 2024
Previous
123456...202122
Next