ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.03144
  4. Cited By
Feature Pyramid Networks for Object Detection
v1v2 (latest)

Feature Pyramid Networks for Object Detection

9 December 2016
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
    ObjD
ArXiv (abs)PDFHTML

Papers citing "Feature Pyramid Networks for Object Detection"

50 / 5,329 papers shown
Title
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
FlatFusion: Delving into Details of Sparse Transformer-based Camera-LiDAR Fusion for Autonomous Driving
Yutao Zhu
Xiaosong Jia
Xinyu Yang
Junchi Yan
ViT
87
6
0
01 Jul 2025
Open World Object Detection: A Survey
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
94
1
0
01 Jul 2025
Dense Feature Interaction Network for Image Inpainting Localization
Dense Feature Interaction Network for Image Inpainting Localization
Ye Yao
Tingfeng Han
Shan Jia
Siwei Lyu
78
1
0
01 Jul 2025
MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation
MedSegNet10: A Publicly Accessible Network Repository for Split Federated Medical Image Segmentation
C. Shiranthika
Zahra Hafezi Kafshgari
Hadi Hadizadeh
Parvaneh Saeedi
FedML
96
0
0
01 Jul 2025
A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving
A Synthetic Benchmark for Collaborative 3D Semantic Occupancy Prediction in V2X Autonomous Driving
Hanlin Wu
Pengfei Lin
Ehsan Javanmardi
Naren Bao
Bo Qian
Hao Si
Manabu Tsukada
3DV
14
0
0
20 Jun 2025
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
Sen Wang
Le Wang
Sanping Zhou
Jingyi Tian
Jiayi Li
Haowen Sun
Wei Tang
32
0
0
19 Jun 2025
Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset
Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset
Nikolaos Dionelis
Jente Bosmans
Riccardo Musto
Giancarlo Paoletti
Simone Sarti
Giacomo Cascarano
Casper Fibaek
Luke Camilleri
B. L. Saux
Nicolas Longépé
22
0
0
17 Jun 2025
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
Johannes Flotzinger
Fabian Deuser
Achref Jaziri
Heiko Neumann
Norbert Oswald
Visvanathan Ramesh
T. Braml
37
0
0
17 Jun 2025
ViSAGe: Video-to-Spatial Audio Generation
ViSAGe: Video-to-Spatial Audio Generation
Jaeyeon Kim
Heeseung Yun
Gunhee Kim
VGen
37
2
0
13 Jun 2025
Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling
Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling
Yunhan Ren
Ruihuang Li
Lingbo Liu
Changwen Chen
30
0
0
13 Jun 2025
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Xiao Xu
L. Qin
Wanxiang Che
Min-Yen Kan
MoEVLM
36
0
0
13 Jun 2025
CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects
Tao Liu
Zhenchao Cui
71
0
0
11 Jun 2025
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective
Data-Efficient Challenges in Visual Inductive Priors: A Retrospective
Robert-Jan Bruintjes
A. Lengyel
O. Kayhan
Davide Zambrano
Nergis Tomen
Hadi Jamali Rad
Jan van Gemert
VLM
38
0
0
10 Jun 2025
TABLET: Table Structure Recognition using Encoder-only Transformers
TABLET: Table Structure Recognition using Encoder-only Transformers
Qiyu Hou
Jun Wang
ViTLMTD
22
0
0
08 Jun 2025
BePo: Leveraging Birds Eye View and Sparse Points for Efficient and Accurate 3D Occupancy Prediction
BePo: Leveraging Birds Eye View and Sparse Points for Efficient and Accurate 3D Occupancy Prediction
Yunxiao Shi
Hong Cai
Jisoo Jeong
Yinhao Zhu
Shizhong Han
Amin Ansari
Fatih Porikli
3DPC
24
0
0
08 Jun 2025
Robust sensor fusion against on-vehicle sensor staleness
Robust sensor fusion against on-vehicle sensor staleness
Meng Fan
Yifan Zuo
Patrick Blaes
Harley Montgomery
Subhasis Das
53
0
0
06 Jun 2025
Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Fan Yang
Per Frivik
David Hoeller
Chen Wang
Cesar Cadena
Marco Hutter
52
0
0
06 Jun 2025
Hierarchical Implicit Neural Emulators
Ruoxi Jiang
Xiao Zhang
Karan Jakhar
Peter Y. Lu
Pedram Hassanzadeh
Michael Maire
Rebecca Willett
AI4CE
102
0
0
05 Jun 2025
Deep Learning Reforms Image Matching: A Survey and Outlook
Shihua Zhang
Zizhuo Li
Kaining Zhang
Yifan Lu
Yuxin Deng
Linfeng Tang
Xingyu Jiang
Jiayi Ma
3DV
115
0
0
05 Jun 2025
VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
W. Li
Zhu Yu
Alexandre Alahi
3DPC
126
0
0
05 Jun 2025
PDSE: A Multiple Lesion Detector for CT Images using PANet and Deformable Squeeze-and-Excitation Block
PDSE: A Multiple Lesion Detector for CT Images using PANet and Deformable Squeeze-and-Excitation Block
Di Fan
Heng Yu
Zhiyuan Xu
MedIm
77
0
0
04 Jun 2025
Modelship Attribution: Tracing Multi-Stage Manipulations Across Generative Models
Modelship Attribution: Tracing Multi-Stage Manipulations Across Generative Models
Zhiya Tan
Xin Zhang
Joey Tianyi Zhou
74
0
0
03 Jun 2025
Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation
Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation
Mingjie Wei
Xuemei Xie
Yutong Zhong
Guangming Shi
3DH
76
2
0
03 Jun 2025
MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow
MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow
Jakob Schmid
Azin Jahedi
Noah Berenguel Senn
Andrés Bruhn
3DPC
58
0
0
02 Jun 2025
Quotient Network - A Network Similar to ResNet but Learning Quotients
Quotient Network - A Network Similar to ResNet but Learning Quotients
Peng Hui
Jiamuyang Zhao
Changxin Li
Qingzhen Zhu
OOD
31
0
0
01 Jun 2025
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting
Yuyuan Liu
Yuanhong Chen
Chong Wang
Junlin Han
Junde Wu
Can Peng
Jingkun Chen
Yu Tian
Gustavo Carneiro
VLM
54
0
0
01 Jun 2025
LiDAR Based Semantic Perception for Forklifts in Outdoor Environments
LiDAR Based Semantic Perception for Forklifts in Outdoor Environments
Benjamin Serfling
H. Reichert
Lorenzo Bayerlein
Konrad Doll
Kati Radkhah-Lens
38
0
0
28 May 2025
Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations
Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations
Yue Li Du
Ben Alexander
Mikhail Antonenka
Rohan Mahadev
Hao Wu
Dmitry Kislyuk
41
0
0
27 May 2025
Spatial RoboGrasp: Generalized Robotic Grasping Control Policy
Spatial RoboGrasp: Generalized Robotic Grasping Control Policy
Yiqi Huang
Travis Davies
Jiahuan Yan
Jiankai Sun
Xiang Chen
Luhui Hu
73
0
0
27 May 2025
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Wen Yin
Yong Wang
Guiduo Duan
Dongyang Zhang
Xin Hu
Yuan-Fang Li
Tao He
127
0
0
26 May 2025
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis
You Wang
Li Fang
Hao Zhu
Fei Hu
Long Ye
Zhan Ma
ViT
43
0
0
26 May 2025
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving
Chen Shi
Shaoshuai Shi
Kehua Sheng
Bo Zhang
Li Jiang
VGen
88
0
0
25 May 2025
Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling
Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling
Hojun Son
Asma Almutairi
Arpan Kusari
127
0
0
24 May 2025
Semantic segmentation with reward
Semantic segmentation with reward
Xie Ting
Ye Huang
Zhilin Liu
Lixin Duan
316
0
0
23 May 2025
Semantic Correspondence: Unified Benchmarking and a Strong Baseline
Semantic Correspondence: Unified Benchmarking and a Strong Baseline
Kaiyan Zhang
Xinghui Li
Jingyi Lu
Kai Han
3DV
87
1
0
23 May 2025
Deep Learning-Driven Ultra-High-Definition Image Restoration: A Survey
Deep Learning-Driven Ultra-High-Definition Image Restoration: A Survey
Liyan Wang
Weixiang Zhou
Cong Wang
Kin-Man Lam
Zhixun Su
Jinshan Pan
62
0
0
22 May 2025
gen2seg: Generative Models Enable Generalizable Instance Segmentation
gen2seg: Generative Models Enable Generalizable Instance Segmentation
Om Khangaonkar
Hamed Pirsiavash
DiffMVLM
149
0
0
21 May 2025
Deep Learning Enabled Segmentation, Classification and Risk Assessment of Cervical Cancer
Deep Learning Enabled Segmentation, Classification and Risk Assessment of Cervical Cancer
Abdul Samad Shaik
Shashaank Mattur Aswatha
Rahul Jashvantbhai Pandya
44
0
0
21 May 2025
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset
Hua Li
Shijie Lian
Zhiyuan Li
Runmin Cong
Sam Kwong
VLM
83
0
0
21 May 2025
Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey
Plane Geometry Problem Solving with Multi-modal Reasoning: A Survey
Seunghyuk Cho
Zhenyue Qin
Yang Liu
Youngbin Choi
Seungbeom Lee
Dongwoo Kim
LRM
108
0
0
20 May 2025
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Rethinking Features-Fused-Pyramid-Neck for Object Detection
Hulin Li
221
0
0
19 May 2025
Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation
Robust Multimodal Segmentation with Representation Regularization and Hybrid Prototype Distillation
Jiaqi Tan
Xu Zheng
Yuhang Liu
79
0
0
19 May 2025
Cross-modal feature fusion for robust point cloud registration with ambiguous geometry
Cross-modal feature fusion for robust point cloud registration with ambiguous geometry
Zhaoyi Wang
Shengyu Huang
Jemil Avers Butt
Yuanzhou Cai
Matej Varga
A. Wieser
3DPC
103
1
0
19 May 2025
A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation
A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation
Jinke Li
Yue Wu
Xiaoyan Yang
70
0
0
16 May 2025
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
Ziyu Zhao
Xiaoguang Li
Linjia Shi
Nasrin Imanpour
Song Wang
VLM
77
0
0
16 May 2025
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures
Shun Inadumi
Nobuhiro Ueda
Koichiro Yoshino
ObjD
80
0
0
16 May 2025
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible Benchmarking
Sicheng Shen
Dongcheng Zhao
Linghao Feng
Zeyang Yue
Jindong Li
Tenglong Li
Guobin Shen
Yi Zeng
74
0
0
16 May 2025
Multi-view dense image matching with similarity learning and geometry priors
Multi-view dense image matching with similarity learning and geometry priors
Mohamed Ali Chebbi
Ewelina Rupnik
P. Lopes
M. Pierrot-Deseilligny
52
0
0
16 May 2025
GaussianFormer3D: Multi-Modal Gaussian-based Semantic Occupancy Prediction with 3D Deformable Attention
GaussianFormer3D: Multi-Modal Gaussian-based Semantic Occupancy Prediction with 3D Deformable Attention
Lingjun Zhao
Sizhe Wei
James Hays
Lu Gan
3DGS3DPC
77
0
0
15 May 2025
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
Bin-Bin Gao
VLM
196
0
0
14 May 2025
1234...105106107
Next