Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.03144
Cited By
v1
v2 (latest)
Feature Pyramid Networks for Object Detection
9 December 2016
Nayeon Lee
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Feature Pyramid Networks for Object Detection"
50 / 5,330 papers shown
Title
InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression
Dongchen Lu
Yuyao Sun
Zilu Zhang
Leping Huang
Jianliang Zeng
Mao Shu
Huo Cao
140
4
0
27 Mar 2025
Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders
Paul Koch
Jörg Krüger
Ankit Chowdhury
O. Heimann
MDE
99
0
0
25 Mar 2025
MaSS13K: A Matting-level Semantic Segmentation Benchmark
C. Xie
Minghan Li
Hui Zeng
Jun Luo
Lei Zhang
VLM
176
0
0
24 Mar 2025
Your ViT is Secretly an Image Segmentation Model
Tommie Kerssies
Niccolò Cavagnero
Alexander Hermans
Narges Norouzi
Giuseppe Averta
Bastian Leibe
Gijs Dubbelman
Daan de Geus
ViT
VLM
123
5
0
24 Mar 2025
Training-free Diffusion Acceleration with Bottleneck Sampling
Ye Tian
Xin Xia
Yuxi Ren
Shanchuan Lin
Xing Wang
Xuefeng Xiao
Yunhai Tong
L. Yang
Tengjiao Wang
126
2
0
24 Mar 2025
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Sara Al-Emadi
Yin Yang
Ferda Ofli
71
0
0
24 Mar 2025
FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation
Qingshan Hou
Ming Wang
Peng Cao
Zou Ke
Xiaoli Liu
Huazhu Fu
Osmar R. Zaiane
MedIm
120
2
0
22 Mar 2025
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
Xiyue Guo
Jiarui Hu
Junjie Hu
Hujun Bao
Guofeng Zhang
117
1
0
21 Mar 2025
Should we pre-train a decoder in contrastive learning for dense prediction tasks?
S. Quetin
Tapotosh Ghosh
Farhad Maleki
SSL
112
0
0
21 Mar 2025
You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection
Daniel Kuhse
Harun Teper
Sebastian Buschjäger
Chien-Yao Wang
Jian-Jia Chen
AAML
120
1
0
21 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
80
3
0
20 Mar 2025
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
Zechuan Li
Hongshan Yu
Yihao Ding
Jinhao Qiao
Basim Azam
Naveed Akhtar
3DPC
126
0
0
19 Mar 2025
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Gyeongrok Oh
Sungjune Kim
Heeju Ko
Hyung-Gun Chi
J. Kim
Dongwook Lee
Daehyun Ji
Sungjoon Choi
Sujin Jang
Sangpil Kim
93
1
0
19 Mar 2025
Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images
Nobuhiko Wakai
Satoshi Sato
Yasunori Ishii
Takayoshi Yamashita
122
0
0
18 Mar 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
143
0
0
18 Mar 2025
Is Discretization Fusion All You Need for Collaborative Perception?
Kang Yang
Tianci Bu
L. Li
Chunxu Li
Yanjie Wang
Deying Li
145
0
0
18 Mar 2025
Dynamic Relation Inference via Verb Embeddings
Omri Suissa
Muhiim Ali
Ariana Azarbal
Hui Shen
Shekhar Pradhan
102
0
0
17 Mar 2025
Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation
Nassim Ali Ousalah
Anis Kacem
Enjie Ghorbel
Emmanuel Koumandakis
Djamila Aouada
99
0
0
17 Mar 2025
AI-Powered Automated Model Construction for Patient-Specific CFD Simulations of Aortic Flows
P. Du
Delin An
Chaoli Wang
Jian-Xun Wang
MedIm
AI4CE
88
1
0
16 Mar 2025
Atlas: Multi-Scale Attention Improves Long Context Image Modeling
Kumar Krishna Agrawal
Long Lian
Lu Liu
Natalia Harguindeguy
Boyi Li
Alexander Bick
Maggie Chung
Trevor Darrell
Adam Yala
ViT
89
0
0
16 Mar 2025
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
Jiangning Wei
Lixiong Qin
Bo Yu
Tianjian Zou
Chuhan Yan
Dandan Xiao
Yang Yu
Lan Yang
Ke Li
Jun Liu
71
0
0
14 Mar 2025
DCAT: Dual Cross-Attention Fusion for Disease Classification in Radiological Images with Uncertainty Estimation
Jutika Borah
H. Singh
MedIm
170
0
0
14 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjD
VLM
87
0
0
13 Mar 2025
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer
Zhang Zhang
Chao Sun
Chao Yue
Da Wen
Yujie Chen
Tianze Wang
Jianghao Leng
ViT
92
1
0
13 Mar 2025
Implicit Contrastive Representation Learning with Guided Stop-gradient
Byeongchan Lee
Sehyun Lee
SSL
272
2
0
12 Mar 2025
TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues
Hannah VanderHoeven
Brady Bhalla
Ibrahim Khebour
Austin Youngren
Videep Venkatesha
...
Yifan Zhu
Kenneth Lai
Changsoo Jung
James Pustejovsky
Nikhil Krishnaswamy
82
2
0
12 Mar 2025
CL-MVSNet: Unsupervised Multi-view Stereo with Dual-level Contrastive Learning
K. Xiong
Rui Peng
Zhe Zhang
Tianxing Feng
Jianbo Jiao
Feng Gao
Ronggang Wang
124
14
0
11 Mar 2025
PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net
Jun Yin
Yangfan He
Miao Zhang
Pengyu Zeng
Tianyi Wang
Shuai Lu
Xueqian Wang
DiffM
145
7
0
11 Mar 2025
Bridge Frame and Event: Common Spatiotemporal Fusion for High-Dynamic Scene Optical Flow
Hanyu Zhou
Haonan Wang
Haoyue Liu
Yuxing Duan
Yi Chang
Luxin Yan
102
0
0
10 Mar 2025
AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection
Jialin Lu
Junjie Shan
Ziqi Zhao
Ka-Ho Chow
AAML
160
0
0
09 Mar 2025
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion
Meng Wang
Huilong Pi
Ruihui Li
Yunchuan Qin
Zhuo Tang
KenLi Li
92
2
0
08 Mar 2025
Vision-based 3D Semantic Scene Completion via Capture Dynamic Representations
Meng Wang
Fan Wu
Yunchuan Qin
Ruihui Li
Zhuo Tang
KenLi Li
3DPC
146
0
0
08 Mar 2025
CoinRobot: Generalized End-to-end Robotic Learning for Physical Intelligence
Yue Zhao
Huxian Liu
Xiang Chen
Jiankai Sun
Jiahuan Yan
Luhui Hu
115
1
0
07 Mar 2025
EDM: Efficient Deep Feature Matching
Xi Li
Tong Rao
Cihui Pan
96
0
0
07 Mar 2025
Periodontal Bone Loss Analysis via Keypoint Detection With Heuristic Post-Processing
Ryan Banks
Vishal Thengane
María Eugenia Guerrero
Nelly Maria García-Madueño
Yunpeng Li
Hongying Tang
A. Chaurasia
85
0
0
05 Mar 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
284
0
0
05 Mar 2025
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Suhwan Cho
Seunghoon Lee
Minhyeok Lee
Jungho Lee
Sangyoun Lee
VOS
168
0
0
05 Mar 2025
DarkDeblur: Learning single-shot image deblurring in low-light condition
S. Sharif
R. A. Naqvi
Farman Alic
Mithun Biswas
VLM
174
21
0
04 Mar 2025
Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance
Jiayi Zhao
Fei Teng
Kai Luo
Guoqiang Zhao
Hui Yuan
Xu Zheng
Kailun Yang
VLM
127
7
0
04 Mar 2025
Out-of-Distribution Segmentation in Autonomous Driving: Problems and State of the Art
Youssef Shoeb
Azarm Nowzad
Hanno Gottschalk
UQCV
286
2
0
04 Mar 2025
Aerial Infrared Health Monitoring of Solar Photovoltaic Farms at Scale
Isaac Corley
Conor Wallace
Sourav Agrawal
Burton Putrah
Jonathan Lwowski
89
1
0
03 Mar 2025
An Efficient Approach to Detecting Lung Nodules Using Swin Transformer
Saeed Shakuri
Alireza Rezvanian
ViT
MedIm
97
1
0
03 Mar 2025
Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR
Muhammad Musab Ansari
55
0
0
03 Mar 2025
Wavelet-Driven Masked Image Modeling: A Path to Efficient Visual Representation
Wenzhao Xiang
Chang Liu
Hongyang Yu
Xilin Chen
77
0
0
02 Mar 2025
RFWNet: A Lightweight Remote Sensing Object Detector Integrating Multiscale Receptive Fields and Foreground Focus Mechanism
Yujie Lei
Wenjie Sun
Sen Jia
Qingquan Li
Jie Zhang
86
0
0
01 Mar 2025
Transformers with Joint Tokens and Local-Global Attention for Efficient Human Pose Estimation
K. A. Kinfu
René Vidal
ViT
62
0
0
28 Feb 2025
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
ObjD
VLM
80
0
0
28 Feb 2025
BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground
Yufei Wei
Sha Lu
Wangtao Lu
R. Xiong
Yansen Wang
114
0
0
27 Feb 2025
Automatic Temporal Segmentation for Post-Stroke Rehabilitation: A Keypoint Detection and Temporal Segmentation Approach for Small Datasets
Jisoo Lee
Tamim Ahmed
Thanassis Rikakis
Pavan Turaga
77
0
0
27 Feb 2025
HDM: Hybrid Diffusion Model for Unified Image Anomaly Detection
Zekang Weng
Jinjin Shi
Jinwei Wang
Zeming Han
127
0
0
26 Feb 2025
Previous
1
2
3
4
5
...
105
106
107
Next