ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.09630
  4. Cited By
Generalized Intersection over Union: A Metric and A Loss for Bounding
  Box Regression

Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression

25 February 2019
S. Hamid Rezatofighi
Nathan Tsoi
JunYoung Gwak
Amir Sadeghian
Ian Reid
Silvio Savarese
ArXivPDFHTML

Papers citing "Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression"

50 / 1,082 papers shown
Title
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating,
  Mapping, and Multiple Object Tracking System
GSLAMOT: A Tracklet and Query Graph-based Simultaneous Locating, Mapping, and Multiple Object Tracking System
Shuo Wang
Yongcai Wang
Zhimin Xu
Yongyu Guo
Wanting Li
Zhe Huang
Xuewei Bai
Deying Li
VOT
38
2
0
17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
71
7
0
17 Aug 2024
Language-Driven Interactive Shadow Detection
Language-Driven Interactive Shadow Detection
Hongqiu Wang
Wei Wang
Haipeng Zhou
Huihui Xu
Shaozhi Wu
Lei Zhu
44
6
0
16 Aug 2024
RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking
RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking
Song Guo
Rujie Liu
N. Abe
VOT
39
0
0
14 Aug 2024
Unified-IoU: For High-Quality Object Detection
Unified-IoU: For High-Quality Object Detection
Xiangjie Luo
Zhihao Cai
Bo Shao
Yingxun Wang
NoLa
38
0
0
13 Aug 2024
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust
  Visual Question-Localized Answering in Robotic Surgery
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
Long Bai
Guankun Wang
Mobarakol Islam
Lalithkumar Seenivasan
An-Chi Wang
Hongliang Ren
54
13
0
09 Aug 2024
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context
  Relation Modeling
JARViS: Detecting Actions in Video Using Unified Actor-Scene Context Relation Modeling
Seok Hwan Lee
Taein Son
Soo Won Seo
Jisong Kim
Jun Won Choi
52
0
0
07 Aug 2024
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding
  from TV Dramas and Synopses
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Chaolei Tan
Zihang Lin
Junfu Pu
Zhongang Qi
Wei-Yi Pei
Zhi Qu
Yexin Wang
Ying Shan
Wei-Shi Zheng
Jianfang Hu
AI4TS
48
0
0
03 Aug 2024
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and
  Localization
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization
Vinaya Sree Katamneni
A. Rattani
43
4
0
02 Aug 2024
An Efficient and Effective Transformer Decoder-Based Framework for
  Multi-Task Visual Grounding
An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding
Wei Chen
Mahdieh Hatamian
Yu Wu
50
3
0
02 Aug 2024
Synthetic dual image generation for reduction of labeling efforts in
  semantic segmentation of micrographs with a customized metric function
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function
Matias Oscar Volman Stern
Dominic Hohs
Markos Diomataris
Michael J. Black
Gerhard Schneider
DiffM
47
0
0
01 Aug 2024
Classification Matters: Improving Video Action Detection with
  Class-Specific Attention
Classification Matters: Improving Video Action Detection with Class-Specific Attention
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
54
0
0
29 Jul 2024
Look Hear: Gaze Prediction for Speech-directed Human Attention
Look Hear: Gaze Prediction for Speech-directed Human Attention
Sounak Mondal
Seoyoung Ahn
Zhibo Yang
Niranjan Balasubramanian
Dimitris Samaras
G. Zelinsky
Minh Hoai
47
1
0
28 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
41
1
0
23 Jul 2024
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation
  for Video Moment Retrieval
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Yiyang Jiang
Wengyu Zhang
Xu-Lu Zhang
Xiaoyong Wei
Chang Wen Chen
Qing Li
46
4
0
21 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
31
3
0
20 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
44
2
0
19 Jul 2024
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Yuetan Chu
Yilan Zhang
Zhongyi Han
Changchun Yang
Longxi Zhou
Gongning Luo
Chao Huang
Xin Gao
MedIm
51
1
0
19 Jul 2024
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Temporally Grounding Instructional Diagrams in Unconstrained Videos
Jiahao Zhang
Frederic Z. Zhang
Cristian Rodriguez
Yizhak Ben-Shabat
A. Cherian
Stephen Gould
52
2
0
16 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model
  and Benchmark Dataset
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
36
4
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjD
VLM
32
1
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object
  Detection
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
76
1
0
13 Jul 2024
Visual Multi-Object Tracking with Re-Identification and Occlusion
  Handling using Labeled Random Finite Sets
Visual Multi-Object Tracking with Re-Identification and Occlusion Handling using Labeled Random Finite Sets
L. Ma
Tran Thien Dat Nguyen
Changbeom Shim
Du Yong Kim
Namkoo Ha
Moongu Jeon
VOT
47
10
0
11 Jul 2024
Bayesian Detector Combination for Object Detection with Crowdsourced
  Annotations
Bayesian Detector Combination for Object Detection with Crowdsourced Annotations
Zhi Qin Tan
Olga Isupova
Gustavo Carneiro
Xiatian Zhu
Yunpeng Li
ObjD
39
0
0
10 Jul 2024
Cross Domain Object Detection via Multi-Granularity Confidence Alignment
  based Mean Teacher
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher
Jiangming Chen
Li Liu
Wanxia Deng
Zhen Liu
Yu Liu
Yingmei Wei
Yongxiang Liu
59
0
0
10 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
50
2
0
10 Jul 2024
Described Spatial-Temporal Video Detection
Described Spatial-Temporal Video Detection
Wei Ji
Xiangyan Liu
Yingfei Sun
Jiajun Deng
You Qin
Ammar Nuwanna
Mengyao Qiu
Lina Wei
Roger Zimmermann
47
2
0
08 Jul 2024
Towards Reflected Object Detection: A Benchmark
Towards Reflected Object Detection: A Benchmark
Zhongtian Wang
You Wu
Hui Zhou
Shuiwang Li
ObjD
36
2
0
08 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene
  Synthesis
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
46
6
0
07 Jul 2024
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit
  for Real-Time UAV Tracking
Learning Motion Blur Robust Vision Transformers with Dynamic Early Exit for Real-Time UAV Tracking
You Wu
Xucheng Wang
Dan Zeng
Hengzhou Ye
Xiaolan Xie
Qijun Zhao
Shuiwang Li
48
3
0
07 Jul 2024
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Multi-branch Collaborative Learning Network for 3D Visual Grounding
Zhipeng Qian
Yiwei Ma
Zhekai Lin
Jiayi Ji
Xiawu Zheng
Xiaoshuai Sun
Rongrong Ji
3DV
56
4
0
07 Jul 2024
POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part
  Segmentation
POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation
Arindam Dutta
Rohit Lal
Yash Garg
Calvin-Khang Ta
Dripta S. Raychaudhuri
Hannah Dela Cruz
Amit K. Roy-Chowdhury
37
1
0
04 Jul 2024
ACTRESS: Active Retraining for Semi-supervised Visual Grounding
ACTRESS: Active Retraining for Semi-supervised Visual Grounding
Weitai Kang
Mengxue Qu
Yunchao Wei
Yan Yan
46
6
0
03 Jul 2024
Visual Grounding with Attention-Driven Constraint Balancing
Visual Grounding with Attention-Driven Constraint Balancing
Weitai Kang
Luowei Zhou
Junyi Wu
Changchang Sun
Yan Yan
45
4
0
03 Jul 2024
SegVG: Transferring Object Bounding Box to Segmentation for Visual
  Grounding
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Weitai Kang
Gaowen Liu
Mubarak Shah
Yan Yan
ObjD
41
9
0
03 Jul 2024
Explainable vertebral fracture analysis with uncertainty estimation
  using differentiable rule-based classification
Explainable vertebral fracture analysis with uncertainty estimation using differentiable rule-based classification
Victor Wåhlstrand Skärström
L. Johansson
Jennifer Alvén
M. Lorentzon
Ida Häggström
23
1
0
03 Jul 2024
Similarity Distance-Based Label Assignment for Tiny Object Detection
Similarity Distance-Based Label Assignment for Tiny Object Detection
Shuohao Shi
Qiang Fang
Tong Zhao
Xin Xu
ObjD
54
2
0
02 Jul 2024
Fake News Detection and Manipulation Reasoning via Large Vision-Language
  Models
Fake News Detection and Manipulation Reasoning via Large Vision-Language Models
Ruihan Jin
Ruibo Fu
Zhengqi Wen
Shuai Zhang
Yukun Liu
Jianhua Tao
48
5
0
02 Jul 2024
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding
Jinghui Lu
Haiyang Yu
Yuanda Wang
Yongjie Ye
Jingqun Tang
...
Qi Liu
Hao Feng
Han Wang
Hao Liu
Can Huang
59
21
0
02 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
69
2
0
01 Jul 2024
eMoE-Tracker: Environmental MoE-based Transformer for Robust
  Event-guided Object Tracking
eMoE-Tracker: Environmental MoE-based Transformer for Robust Event-guided Object Tracking
Yucheng Chen
Lin Wang
49
3
0
28 Jun 2024
Weighted Circle Fusion: Ensembling Circle Representation from Different
  Object Detection Results
Weighted Circle Fusion: Ensembling Circle Representation from Different Object Detection Results
Jialin Yue
Tianyuan Yao
Ruining Deng
Quan Liu
Juming Xiong
Haichun Yang
Yuankai Huo
37
1
0
27 Jun 2024
Looking 3D: Anomaly Detection with 2D-3D Alignment
Looking 3D: Anomaly Detection with 2D-3D Alignment
A. Bhunia
Changjian Li
Hakan Bilen
45
3
0
27 Jun 2024
STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via
  Collaborating Self-Training and Adversarial Learning
STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning
Yanan Zhang
Chao Zhou
Di Huang
51
5
0
27 Jun 2024
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning
  Challenges
VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
Jan van Gemert
VLM
44
0
0
26 Jun 2024
ScanFormer: Referring Expression Comprehension by Iteratively Scanning
ScanFormer: Referring Expression Comprehension by Iteratively Scanning
Wei Su
Peihan Miao
Huanzhang Dou
Xi Li
ObjD
50
7
0
26 Jun 2024
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring
  Video Object Segmentation
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Ci-Siang Lin
I-Jieh Liu
Min-Hung Chen
Chien-Yi Wang
Sifei Liu
Yu-Chiang Frank Wang
VOS
58
0
0
18 Jun 2024
Adaptively Bypassing Vision Transformer Blocks for Efficient Visual
  Tracking
Adaptively Bypassing Vision Transformer Blocks for Efficient Visual Tracking
Xiangyang Yang
Dan Zeng
Xucheng Wang
You Wu
Hengzhou Ye
Qijun Zhao
Shuiwang Li
64
3
0
12 Jun 2024
RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer
  Tracker
RGB-Sonar Tracking Benchmark and Spatial Cross-Attention Transformer Tracker
Yunfeng Li
Bo Wang
Jiuran Sun
Xueyi Wu
Ye Li
42
3
0
11 Jun 2024
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection
Qiang Chen
Xiangbo Su
Xinyu Zhang
Jian Wang
Jiahui Chen
...
Shan Zhang
Kun Yao
Errui Ding
Gang Zhang
Jingdong Wang
ViT
60
13
0
05 Jun 2024
Previous
12345...202122
Next