ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.01497
  4. Cited By
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
v1v2v3 (latest)

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

4 June 2015
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
    AIMatObjD
ArXiv (abs)PDFHTML

Papers citing "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks"

50 / 10,561 papers shown
Title
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding
Zhuoyuan Li
Zikun Yuan
Li Li
Dong Liu
Xiaohu Tang
Feng Wu
VOS
88
12
0
18 Mar 2024
Better (pseudo-)labels for semi-supervised instance segmentation
Better (pseudo-)labels for semi-supervised instance segmentation
Franccois Porcher
Camille Couprie
Marc Szafraniec
Jakob Verbeek
ISeg
70
1
0
18 Mar 2024
R2SNet: Scalable Domain Adaptation for Object Detection in Cloud-Based
  Robotic Ecosystems via Proposal Refinement
R2SNet: Scalable Domain Adaptation for Object Detection in Cloud-Based Robotic Ecosystems via Proposal Refinement
Michele Antonazzi
Matteo Luperto
N. A. Borghese
Nicola Basilico
97
2
0
18 Mar 2024
Circle Representation for Medical Instance Object Segmentation
Circle Representation for Medical Instance Object Segmentation
Juming Xiong
Ethan H. Nguyen
Yilin Liu
Ruining Deng
R. Tyree
...
Girish Hiremath
Yaohong Wang
Haichun Yang
Agnes B. Fogo
Yuankai Huo
48
2
0
18 Mar 2024
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient
  Motion Prediction
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Yang Zhou
Hao Shao
Letian Wang
Steven L. Waslander
Hongsheng Li
Yu Liu
97
18
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
121
7
0
18 Mar 2024
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object
  Detection under Unknown Degradations
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations
Yuwei Zhang
Yan Wu
Yanming Liu
Xinyue Peng
110
8
0
17 Mar 2024
GRA: Detecting Oriented Objects through Group-wise Rotating and
  Attention
GRA: Detecting Oriented Objects through Group-wise Rotating and Attention
Jiangshan Wang
Yifan Pu
Yizeng Han
Jiayi Guo
Yiru Wang
Xiu Li
Gao Huang
93
11
0
17 Mar 2024
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent
  Recognition and Out-of-scope Detection in Conversations
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations
Hanlei Zhang
Xin Wang
Hua Xu
Qianrui Zhou
Kai Gao
Jianhua Su
jinyue Zhao
Wenrui Li
Yanting Chen
155
6
0
16 Mar 2024
Cannabis Seed Variant Detection using Faster R-CNN
Cannabis Seed Variant Detection using Faster R-CNN
Toqi Tahamid Sarker
Taminul Islam
Khaled R Ahmed
72
2
0
15 Mar 2024
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized
  Scaled Prediction Consistency
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency
Soumyadeep Pal
Yuguang Yao
Ren Wang
Bingquan Shen
Sijia Liu
AAML
78
9
0
15 Mar 2024
Open Stamped Parts Dataset
Open Stamped Parts Dataset
Sara Antiles
S. Talathi
48
0
0
15 Mar 2024
SimPB: A Single Model for 2D and 3D Object Detection from Multiple
  Cameras
SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras
Yingqi Tang
Zhaotie Meng
Guoliang Chen
Erkang Cheng
3DPC
65
1
0
15 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Generative Region-Language Pretraining for Open-Ended Object Detection
Chuang Lin
Yi Jiang
Zhuang Li
Zehuan Yuan
Jianfei Cai
ObjDVLM
86
23
0
15 Mar 2024
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for
  Long-Range 3D Perception
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception
Yiheng Li
Hongyang Li
Zehao Huang
Hong Chang
Naiyan Wang
100
3
0
15 Mar 2024
Attention-based Class-Conditioned Alignment for Multi-Source Domain
  Adaptation of Object Detectors
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object Detectors
Atif Belal
Akhil Meethal
Francisco Perdigon Romero
M. Pedersoli
Eric Granger
98
1
0
14 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and
  Instance Contrastive Optimization
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjDVLM
126
2
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained
  Detectors
Efficient Transferability Assessment for Selection of Pre-trained Detectors
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
81
0
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
119
13
0
14 Mar 2024
D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap
  for Domain-Adaptive Object Detection
D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object Detection
Dinh Phat Do
Taehoon Kim
Jaemin Na
Jiwon Kim
Keonho Lee
Kyunghwan Cho
Wonjun Hwang
106
8
0
14 Mar 2024
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling
  and Visual-Language Co-Referring
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Yufei Zhan
Yousong Zhu
Hongyin Zhao
Fan Yang
Ming Tang
Jinqiao Wang
ObjD
103
16
0
14 Mar 2024
Anatomical Structure-Guided Medical Vision-Language Pre-training
Anatomical Structure-Guided Medical Vision-Language Pre-training
Qingqiu Li
Xiaohan Yan
Jilan Xu
Runtian Yuan
Yuejie Zhang
Rui Feng
Quanli Shen
Xiaobo Zhang
Shujun Wang
105
6
0
14 Mar 2024
SHAN: Object-Level Privacy Detection via Inference on Scene
  Heterogeneous Graph
SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph
Zhuohang Jiang
Bingkui Tong
Xia Du
Ahmed Alhammadi
Jizhe Zhou
92
0
0
14 Mar 2024
MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation
MARVIS: Motion & Geometry Aware Real and Virtual Image Segmentation
Jiayi Wu
Xiao-sheng Lin
S. Negahdaripour
Cornelia Fermuller
Yiannis Aloimonos
219
3
0
14 Mar 2024
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with
  Focused Masked Autoencoders
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
Soumen Basu
Mayuna Gupta
Chetan Madan
Pankaj Gupta
Chetan Arora
90
7
0
13 Mar 2024
Data Augmentation in Human-Centric Vision
Data Augmentation in Human-Centric Vision
Wentao Jiang
Yige Zhang
Shaozhong Zheng
Si Liu
Shuicheng Yan
101
1
0
13 Mar 2024
Fast Inference of Removal-Based Node Influence
Fast Inference of Removal-Based Node Influence
Weikai Li
Zhiping Xiao
Xiao Luo
Yizhou Sun
AAML
109
1
0
13 Mar 2024
AutoDFP: Automatic Data-Free Pruning via Channel Similarity
  Reconstruction
AutoDFP: Automatic Data-Free Pruning via Channel Similarity Reconstruction
Siqi Li
Jun Chen
Jingyang Xiang
Chengrui Zhu
Yong-Jin Liu
88
1
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object
  Detection
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjDMLLMVLM
106
19
0
12 Mar 2024
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial
  Creation on Physical Tasks
TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks
Yuexi Chen
Vlad I. Morariu
Anh Truong
Zhicheng Liu
DiffMVGen
72
5
0
12 Mar 2024
MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation
MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation
Yuelong Li
Yafei Mao
Raja Bala
Sunil Hadap
115
12
0
12 Mar 2024
Aedes aegypti Egg Counting with Neural Networks for Object Detection
Aedes aegypti Egg Counting with Neural Networks for Object Detection
Micheli Nayara de Oliveira Vicente
G. Higa
João Vitor de Andrade Porto
Higor Henrique
Picoli Nucci
Asser Botelho Santana
K. R. A. Porto
A. R. Roel
H. Pistori
37
1
0
12 Mar 2024
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Han Qiu
Jiaxing Huang
Peng Gao
Lewei Lu
Xiaoqin Zhang
Shijian Lu
89
4
0
12 Mar 2024
Mondrian: On-Device High-Performance Video Analytics with Compressive
  Packed Inference
Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference
Changmin Jeon
Seonjun Kim
Juheon Yi
Youngki Lee
110
1
0
12 Mar 2024
Open-World Semantic Segmentation Including Class Similarity
Open-World Semantic Segmentation Including Class Similarity
Matteo Sodano
Federico Magistri
Lucas Nunes
Jens Behley
C. Stachniss
VLM
76
8
0
12 Mar 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing
  Objects in 3D Scenes
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
96
12
0
12 Mar 2024
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large
  Multimodal Models
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Yang Jiao
Shaoxiang Chen
Zequn Jie
Wenke Huang
Lin Ma
Yueping Jiang
MLLM
93
21
0
12 Mar 2024
Continual All-in-One Adverse Weather Removal with Knowledge Replay on a
  Unified Network Structure
Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure
De Cheng
Yanling Ji
Dong Gong
Yan Li
Nannan Wang
Junwei Han
Dingwen Zhang
CLL
96
19
0
12 Mar 2024
COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization
COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization
Aozhong Zhang
Zi Yang
Naigang Wang
Yingyong Qin
Jack Xin
Xin Li
Penghang Yin
VLMMQ
72
3
0
11 Mar 2024
Class Imbalance in Object Detection: An Experimental Diagnosis and Study
  of Mitigation Strategies
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies
Nieves Crasto
ObjD
104
7
0
11 Mar 2024
A cascaded deep network for automated tumor detection and segmentation
  in clinical PET imaging of diffuse large B-cell lymphoma
A cascaded deep network for automated tumor detection and segmentation in clinical PET imaging of diffuse large B-cell lymphoma
Shadab Ahamed
Natalia Dubljevic
I. Bloise
C. Gowdy
Patrick Martineau
Don C. Wilson
C. Uribe
Arman Rahmim
F. Yousefirizi
MedIm
77
4
0
11 Mar 2024
Memory-based Adapters for Online 3D Scene Perception
Memory-based Adapters for Online 3D Scene Perception
Xiuwei Xu
Chong Xia
Ziwei Wang
Linqing Zhao
Yueqi Duan
Jie Zhou
Jiwen Lu
3DPC
70
6
0
11 Mar 2024
Bayesian Diffusion Models for 3D Shape Reconstruction
Bayesian Diffusion Models for 3D Shape Reconstruction
Haiyang Xu
Yu Lei
Zeyuan Chen
Xiang Zhang
Yue Zhao
Yilin Wang
Zhuowen Tu
DiffM
108
10
0
11 Mar 2024
Real-time Transformer-based Open-Vocabulary Detection with Efficient
  Fusion Head
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head
Tiancheng Zhao
Peng Liu
Xuan He
Lu Zhang
Kyusong Lee
ObjD
77
10
0
11 Mar 2024
Genetic Learning for Designing Sim-to-Real Data Augmentations
Genetic Learning for Designing Sim-to-Real Data Augmentations
Bram Vanherle
Nick Michiels
F. Reeth
72
0
0
11 Mar 2024
Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
Chaoqun Du
Yulin Wang
Shiji Song
Gao Huang
109
33
0
11 Mar 2024
Car Damage Detection and Patch-to-Patch Self-supervised Image Alignment
Car Damage Detection and Patch-to-Patch Self-supervised Image Alignment
Hanxiao Chen
29
0
0
11 Mar 2024
Density-Guided Label Smoothing for Temporal Localization of Driving
  Actions
Density-Guided Label Smoothing for Temporal Localization of Driving Actions
Tunç Alkanat
Erkut Akdag
Egor Bondarev
Peter H. N. de With
76
4
0
11 Mar 2024
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for
  Distracted Driver Action Recognition
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition
Erkut Akdag
Zeqi Zhu
Egor Bondarev
Peter H. N. de With
ViT
94
5
0
11 Mar 2024
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale
  SAR Object Detection
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
Yuxuan Li
Xiang Li
Wei-Jang Li
Qibin Hou
Li Liu
Ming-Ming Cheng
Jian Yang
122
39
0
11 Mar 2024
Previous
123...303132...210211212
Next