ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression
v1v2 (latest)

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Deyuan Li
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXiv (abs)PDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,104 papers shown
Title
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint
  Moment Retrieval and Highlight Detection
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
Jin Yang
Ping Wei
Huan Li
Ziyang Ren
92
12
0
14 Apr 2024
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation
  in Operating Rooms
Tri-modal Confluence with Temporal Dynamics for Scene Graph Generation in Operating Rooms
Diandian Guo
Manxi Lin
Jialun Pei
He Tang
Yueming Jin
Pheng-Ann Heng
72
2
0
14 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLMObjD
96
26
0
14 Apr 2024
SFSORT: Scene Features-based Simple Online Real-Time Tracker
SFSORT: Scene Features-based Simple Online Real-Time Tracker
M. M. Morsali
Z. Sharifi
F. Fallah
S. Hashembeiki
H. Mohammadzade
S. B. Shouraki
VOT
89
3
0
11 Apr 2024
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
Lifan Jiang
Zhihui Wang
Changmiao Wang
Ming Li
Jiaxu Leng
DiffM
35
0
0
11 Apr 2024
MedRG: Medical Report Grounding with Multi-modal Large Language Model
MedRG: Medical Report Grounding with Multi-modal Large Language Model
K. Zou
Yang Bai
Zhihao Chen
Yang Zhou
Yidi Chen
Kai Ren
Meng Wang
Xuedong Yuan
Xiaojing Shen
Huazhu Fu
MedIm
99
4
0
10 Apr 2024
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
Chenguang Liu
Guangshuai Gao
Ziyue Huang
Zhenghui Hu
Qingjie Liu
Yunhong Wang
ObjD
111
20
0
09 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
121
2
0
06 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
104
19
0
04 Apr 2024
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Tiantian Geng
Teng Wang
Yanfu Zhang
Jinming Duan
Weili Guan
Feng Zheng
84
2
0
04 Apr 2024
TE-TAD: Towards Full End-to-End Temporal Action Detection via
  Time-Aligned Coordinate Expression
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression
Ho-Joong Kim
Jung-Ho Hong
Heejo Kong
Seong-Whan Lee
78
5
0
03 Apr 2024
EGTR: Extracting Graph from Transformer for Scene Graph Generation
EGTR: Extracting Graph from Transformer for Scene Graph Generation
Jinbae Im
Jeongyeon Nam
Nokyung Park
Hyungmin Lee
Seunghyun Park
ViT
152
23
0
02 Apr 2024
Red-Teaming Segment Anything Model
Red-Teaming Segment Anything Model
K. Jankowski
Bartlomiej Sobieski
Mateusz Kwiatkowski
J. Szulc
Michael F. Janik
Hubert Baniecki
P. Biecek
VLMAAML
75
3
0
02 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object
  Detection
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
90
12
0
02 Apr 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Peng Gao
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Hongsheng Li
VLM
184
47
0
29 Mar 2024
ENet-21: An Optimized light CNN Structure for Lane Detection
ENet-21: An Optimized light CNN Structure for Lane Detection
Seyed Rasoul Hosseini
Mohammad Teshnehlab
105
3
0
28 Mar 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via
  Cycle-Modality Propagation
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPCObjD
102
8
0
28 Mar 2024
Infrared Small Target Detection with Scale and Location Sensitivity
Infrared Small Target Detection with Scale and Location Sensitivity
Qiankun Liu
Rui Liu
Bolun Zheng
Hongkui Wang
Ying Fu
131
40
0
28 Mar 2024
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Qingping Sun
Yanjun Wang
Ailing Zeng
Wanqi Yin
Chen Wei
...
Haiyi Mei
Chi Sing Leung
Ziwei Liu
Lei Yang
Zhongang Cai
3DH
95
20
0
26 Mar 2024
Exploring Dynamic Transformer for Efficient Object Tracking
Exploring Dynamic Transformer for Efficient Object Tracking
Jiawen Zhu
Xin Chen
Haiwen Diao
Shuai Li
Jun-Yan He
Chenyang Li
Bin Luo
Dong Wang
Huchuan Lu
146
3
0
26 Mar 2024
Multiple Object Tracking as ID Prediction
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
192
16
0
25 Mar 2024
Innovative Quantitative Analysis for Disease Progression Assessment in
  Familial Cerebral Cavernous Malformations
Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations
Ruige Zong
Tao Wang
Chunwang Li
Xinlin Zhang
Yuanbin Chen
...
Qixuan Li
Qinquan Gao
Dezhi Kang
Fuxin Lin
Tong Tong
118
0
0
23 Mar 2024
An In-Depth Analysis of Data Reduction Methods for Sustainable Deep
  Learning
An In-Depth Analysis of Data Reduction Methods for Sustainable Deep Learning
Víctor Toscano-Durán
Javier Perera-Lago
Eduardo Paluzo-Hidalgo
Rocio Gonzalez-Diaz
Miguel A. Gutiérrez-Naranjo
Matteo Rucco
78
1
0
22 Mar 2024
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
129
17
0
22 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
114
43
0
21 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
91
30
0
21 Mar 2024
Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity
  Recognition
Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition
Sumin Lee
Yooseung Wang
Sangmin Woo
Changick Kim
67
0
0
21 Mar 2024
MaskSAM: Towards Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
MaskSAM: Towards Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation
Bin Xie
Hao Tang
Bin Duan
Dawen Cai
Yan Yan
Gady Agam
VLMMedIm
81
0
0
21 Mar 2024
Bounding Box Stability against Feature Dropout Reflects Detector
  Generalization across Environments
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments
Yang Yang
Wenhai Wang
Zhe Chen
Jifeng Dai
Liang Zheng
92
3
0
20 Mar 2024
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object Tracking
Fast-Poly: A Fast Polyhedral Framework For 3D Multi-Object Tracking
Xiaoyu Li
Dedong Liu
Lijun Zhao
Yitao Wu
Xian Wu
Jinghan Gao
3DPC
113
8
0
20 Mar 2024
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images
  without Supervision
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
Yuxuan Luo
Zekun Wu
Zhouhui Lian
95
0
0
20 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video
  Object Segmentation
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
110
8
0
18 Mar 2024
Siamese Learning with Joint Alignment and Regression for
  Weakly-Supervised Video Paragraph Grounding
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
Chaolei Tan
Jian-Huang Lai
Wei-Shi Zheng
Jianfang Hu
AI4TS
128
5
0
18 Mar 2024
Cannabis Seed Variant Detection using Faster R-CNN
Cannabis Seed Variant Detection using Faster R-CNN
Toqi Tahamid Sarker
Taminul Islam
Khaled R Ahmed
67
2
0
15 Mar 2024
Autoregressive Queries for Adaptive Tracking with
  Spatio-TemporalTransformers
Autoregressive Queries for Adaptive Tracking with Spatio-TemporalTransformers
Jinxia Xie
Bineng Zhong
Zhiyi Mo
Shengping Zhang
Liangtao Shi
Shuxiang Song
Rongrong Ji
95
42
0
15 Mar 2024
OneTracker: Unifying Visual Object Tracking with Foundation Models and
  Efficient Tuning
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning
Lingyi Hong
Shilin Yan
Renrui Zhang
Wanyun Li
Xinyu Zhou
...
Kaixun Jiang
Yiting Chen
Jinglun Li
Zhaoyu Chen
Wenqiang Zhang
VLM
82
51
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained
  Detectors
Efficient Transferability Assessment for Selection of Pre-trained Detectors
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
79
0
0
14 Mar 2024
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences
  using Attention-based Temporal Fusion
MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion
Arul Selvam Periyasamy
Sven Behnke
3DPC
68
0
0
14 Mar 2024
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
Anushri Dixit
Zhiting Mei
Meghan Booker
Mariko Storey-Matsutani
Mariko Storey-Matsutani
Allen Z. Ren
Ola Shorinwa
Anirudha Majumdar
225
7
0
13 Mar 2024
Learning Data Association for Multi-Object Tracking using Only
  Coordinates
Learning Data Association for Multi-Object Tracking using Only Coordinates
Mehdi Miah
Guillaume-Alexandre Bilodeau
Nicolas Saunier
VOT
113
1
0
12 Mar 2024
Class Imbalance in Object Detection: An Experimental Diagnosis and Study
  of Mitigation Strategies
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies
Nieves Crasto
ObjD
99
6
0
11 Mar 2024
Ada-Tracker: Soft Tissue Tracking via Inter-Frame and Adaptive-Template
  Matching
Ada-Tracker: Soft Tissue Tracking via Inter-Frame and Adaptive-Template Matching
Jiaxin Guo
Jiangliu Wang
Zhaoshuo Li
Tongyu Jia
Qi Dou
Yun-Hui Liu
MedIm
113
2
0
11 Mar 2024
Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline
Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline
Tianlin Li
Ju Huang
Shiao Wang
Chuanming Tang
Bowei Jiang
Yonghong Tian
Jin Tang
Bin Luo
98
10
0
09 Mar 2024
PEEB: Part-based Image Classifiers with an Explainable and Editable
  Language Bottleneck
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
Thang M. Pham
Peijie Chen
Tin Nguyen
Seunghyun Yoon
Trung Bui
Peijie Chen
VLM
115
9
0
08 Mar 2024
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance
Liting Lin
Heng Fan
Zhipeng Zhang
Yaowei Wang
Yong-mei Xu
Haibin Ling
129
35
0
08 Mar 2024
Multi-step Temporal Modeling for UAV Tracking
Multi-step Temporal Modeling for UAV Tracking
Xiaoying Yuan
Tingfa Xu
Xincong Liu
Ying Wang
Haolin Qin
Yuqiang Fang
Jianan Li
89
7
0
07 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Chak Tou Leong
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
87
8
0
07 Mar 2024
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
AO-DETR: Anti-Overlapping DETR for X-Ray Prohibited Items Detection
Mingyuan Li
Tong Jia
Hao Wang
Bowen Ma
Shuyang Lin
Da Cai
Dongyue Chen
ViT
122
21
0
07 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question
  Answering: A Review
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
102
81
0
04 Mar 2024
FakeNewsGPT4: Advancing Multimodal Fake News Detection through
  Knowledge-Augmented LVLMs
FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Xuannan Liu
Peipei Li
Huaibo Huang
Zekun Li
Xing Cui
Jiahao Liang
Lixiong Qin
Weihong Deng
Zhaofeng He
60
3
0
04 Mar 2024
Previous
123...567...212223
Next