ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.02446
  4. Cited By
Panoptic Feature Pyramid Networks

Panoptic Feature Pyramid Networks

8 January 2019
Alexander Kirillov
Ross B. Girshick
Kaiming He
Piotr Dollár
    ISeg
    SSeg
ArXivPDFHTML

Papers citing "Panoptic Feature Pyramid Networks"

50 / 247 papers shown
Title
TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection
TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection
Wenkui Yang
Zhida Zhang
Xiaoqiang Zhou
Junxian Duan
Jie Cao
DiffM
30
0
0
13 May 2025
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation
Yulong Guo
Zilun Zhang
Yongheng Shang
Tiancheng Zhao
Shuiguang Deng
Yingchun Yang
Jianwei Yin
68
0
0
28 Apr 2025
Is Pre-training Applicable to the Decoder for Dense Prediction?
Is Pre-training Applicable to the Decoder for Dense Prediction?
Chao Ning
Wanshui Gan
Weihao Xuan
Naoto Yokoya
48
0
0
05 Mar 2025
DarkDeblur: Learning single-shot image deblurring in low-light condition
S. Sharif
R. A. Naqvi
Farman Alic
Mithun Biswas
VLM
92
18
0
04 Mar 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
72
0
0
26 Jan 2025
PolaFormer: Polarity-aware Linear Attention for Vision Transformers
Weikang Meng
Yadan Luo
Xin Li
D. Jiang
Zheng Zhang
159
0
0
25 Jan 2025
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
Yunxiang Fu
Meng Lou
Yizhou Yu
115
1
0
16 Dec 2024
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
115
3
0
22 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
47
1
0
12 Nov 2024
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Triplane Grasping: Efficient 6-DoF Grasping with Single RGB Images
Yiming Li
Hanchi Ren
Jingjing Deng
Jingjing Deng
Xianghua Xie
36
0
0
21 Oct 2024
Task Consistent Prototype Learning for Incremental Few-shot Semantic
  Segmentation
Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation
Wenbo Xu
Yanan Wu
Haoran Jiang
Yang Wang
Qiang Wu
Jian Zhang
CLL
VLM
30
0
0
16 Oct 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
132
0
0
19 Sep 2024
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Brain-Inspired Stepwise Patch Merging for Vision Transformers
Yonghao Yu
Dongcheng Zhao
Guobin Shen
Yiting Dong
Yi Zeng
58
0
0
11 Sep 2024
Accuracy Improvement of Cell Image Segmentation Using Feedback Former
Accuracy Improvement of Cell Image Segmentation Using Feedback Former
Hinako Mitsuoka
Kazuhiro Hotta
ViT
MedIm
44
0
0
23 Aug 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
46
3
0
31 Jul 2024
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
Kazi Sajeed Mehrab
M. Maruf
Arka Daw
Harish Babu Manogaran
Abhilash Neog
...
Paula Mabee
Wasila Dahdul
Anuj Karpatne
Wasila M Dahdul
Anuj Karpatne
41
4
0
10 Jul 2024
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning
Xiangyu Zhao
Xiangtai Li
Haodong Duan
Haian Huang
Yining Li
Kai Chen
Hua Yang
VLM
MLLM
45
10
0
25 Jun 2024
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
LOGCAN++: Adaptive Local-global class-aware network for semantic segmentation of remote sensing imagery
Xiaowen Ma
Rongrong Lian
Zhenkai Wu
Hongbo Guo
Mengting Ma
Sensen Wu
Zhenhong Du
Siyang Song
Wei Zhang
47
4
0
24 Jun 2024
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part
  Representations
Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations
Daan de Geus
Gijs Dubbelman
40
0
0
14 Jun 2024
Vision Transformer with Sparse Scan Prior
Vision Transformer with Sparse Scan Prior
Qihang Fan
Huaibo Huang
Mingrui Chen
Ran He
ViT
48
5
0
22 May 2024
Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic
  Segmentation
Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation
G. Abati
J. C. V. Soares
V. S. Medeiros
M. Meggiolaro
Claudio Semini
29
2
0
03 May 2024
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster
  Pre-training on Web-scale Image-Text Data
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
Sachin Mehta
Maxwell Horton
Fartash Faghri
Mohammad Hossein Sekhavat
Mahyar Najibi
Mehrdad Farajtabar
Oncel Tuzel
Mohammad Rastegari
VLM
CLIP
44
6
0
24 Apr 2024
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic
  Dataset in Crowded Human Environments
JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Duy-Tho Le
Chenhui Gou
Stavya Datta
Hengcan Shi
Ian Reid
Jianfei Cai
Hamid Rezatofighi
32
2
0
02 Apr 2024
Efficient Modulation for Vision Networks
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
43
17
0
29 Mar 2024
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Ting Yao
Yehao Li
Yingwei Pan
Tao Mei
ViT
31
15
0
18 Mar 2024
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Hongyuan Yu
Cheng Wan
Mengchen Liu
Dongdong Chen
Bin Xiao
Xiyang Dai
Yan Huang
Yuan Lu
Liang Wang
73
5
0
15 Mar 2024
Open-World Semantic Segmentation Including Class Similarity
Open-World Semantic Segmentation Including Class Similarity
Matteo Sodano
Federico Magistri
Lucas Nunes
Jens Behley
C. Stachniss
VLM
42
8
0
12 Mar 2024
Benchmarking the Robustness of Panoptic Segmentation for Automated
  Driving
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving
Yiting Wang
Haonan Zhao
Daniel Gummadi
M. Dianati
Kurt Debattista
Valentina Donzella
36
2
0
23 Feb 2024
FViT: A Focal Vision Transformer with Gabor Filter
FViT: A Focal Vision Transformer with Gabor Filter
Yulong Shi
Mingwei Sun
Yongshuai Wang
Rui Wang
60
4
0
17 Feb 2024
Gyroscope-Assisted Motion Deblurring Network
Gyroscope-Assisted Motion Deblurring Network
Simin Luan
Cong Yang
Zeyd Boukhers
Xue Qin
Dongfeng Cheng
Wei Sui
Zhijun Li
20
0
0
10 Feb 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask
  Inpainting
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
46
11
0
18 Jan 2024
SCHEME: Scalable Channel Mixer for Vision Transformers
SCHEME: Scalable Channel Mixer for Vision Transformers
Deepak Sridhar
Yunsheng Li
Nuno Vasconcelos
47
0
0
01 Dec 2023
JPPF: Multi-task Fusion for Consistent Panoptic-Part Segmentation
JPPF: Multi-task Fusion for Consistent Panoptic-Part Segmentation
Shishir Muralidhara
Sravan Kumar Jagadeesh
René Schuster
Didier Stricker
29
1
0
30 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
44
36
0
30 Oct 2023
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Multimodal Variational Auto-encoder based Audio-Visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yiran Zhong
Yuchao Dai
40
34
0
12 Oct 2023
Causal Unsupervised Semantic Segmentation
Causal Unsupervised Semantic Segmentation
Junho Kim
Byung-Kwan Lee
Yonghyun Ro
33
18
0
11 Oct 2023
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention
Yulong Shi
Mingwei Sun
Yongshuai Wang
Hui Sun
Zengqiang Chen
34
4
0
10 Oct 2023
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and
  Favorable Transferability For ViTs
CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs
Ao Wang
Hui Chen
Zijia Lin
Sicheng Zhao
J. Han
Guiguang Ding
ViT
34
6
0
27 Sep 2023
Multi-label affordance mapping from egocentric vision
Multi-label affordance mapping from egocentric vision
Lorenzo Mur-Labadia
Jose J. Guerrero
Ruben Martinez-Cantin
EgoV
31
14
0
05 Sep 2023
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen
  Convolutional CLIP
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VLM
CLIP
45
136
0
04 Aug 2023
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Contrastive Conditional Latent Diffusion for Audio-visual Segmentation
Yuxin Mao
Jing Zhang
Mochu Xiang
Yun-Qiu Lv
Yiran Zhong
Yuchao Dai
DiffM
43
28
0
31 Jul 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth
  Estimation
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
31
13
0
27 Jul 2023
Towards Deeply Unified Depth-aware Panoptic Segmentation with
  Bi-directional Guidance Learning
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning
Ju He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Jinpeng Lan
Bin Luo
Yifeng Geng
Xuansong Xie
MDE
23
8
0
27 Jul 2023
On Point Affiliation in Feature Upsampling
On Point Affiliation in Feature Upsampling
Wenze Liu
Hao Lu
Yuliang Liu
Zhiguo Cao
3DPC
26
2
0
17 Jul 2023
Towards Building Self-Aware Object Detectors via Reliable Uncertainty
  Quantification and Calibration
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration
Kemal Oksuz
Thomas Joy
P. Dokania
UQCV
23
16
0
03 Jul 2023
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Efficient Multi-Task Scene Analysis with RGB-D Transformers
Söhnke Benedikt Fischedick
Daniel Seichter
Robin M. Schmidt
Leonard Rabes
H. Groß
25
9
0
08 Jun 2023
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image
  Interpretation in the Agricultural Domain
PhenoBench -- A Large Dataset and Benchmarks for Semantic Image Interpretation in the Agricultural Domain
J. Weyler
Federico Magistri
E. Marks
Yue Linn Chong
Matteo Sodano
Gianmarco Roggiolani
Nived Chebrolu
C. Stachniss
Jens Behley
32
30
0
07 Jun 2023
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Xiuye Gu
Huayu Chen
Jonathan Huang
Abdullah M. Rashwan
Boxin Wang
...
Golnaz Ghiasi
Weicheng Kuo
Huizhong Chen
Liang-Chieh Chen
David A. Ross
ISeg
28
26
0
02 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
50
28
0
01 Jun 2023
On the Importance of Backbone to the Adversarial Robustness of Object Detectors
On the Importance of Backbone to the Adversarial Robustness of Object Detectors
Xiao-Li Li
Hang Chen
Xiaolin Hu
AAML
38
4
0
27 May 2023
12345
Next