ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.14733
  4. Cited By
Harnessing Diffusion Models for Visual Perception with Meta Prompts

Harnessing Diffusion Models for Visual Perception with Meta Prompts

22 December 2023
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
    MDE
    VLM
ArXivPDFHTML

Papers citing "Harnessing Diffusion Models for Visual Perception with Meta Prompts"

16 / 16 papers shown
Title
What is the Added Value of UDA in the VFM Era?
What is the Added Value of UDA in the VFM Era?
B. B. Englert
Tommie Kerssies
Gijs Dubbelman
44
0
0
25 Apr 2025
PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes
PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes
Xinhua Xu
Hong Liu
Jianbing Wu
Jinfu Liu
DiffM
59
0
0
24 Mar 2025
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Ruicheng Wang
Sicheng Xu
Cassie Dai
Jianfeng Xiang
Yu Deng
Xin Tong
Jiaolong Yang
TPM
3DH
MDE
60
30
0
24 Oct 2024
Not All Diffusion Model Activations Have Been Evaluated as
  Discriminative Features
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features
Benyuan Meng
Qianqian Xu
Zitai Wang
Xiaochun Cao
Qingming Huang
37
2
0
04 Oct 2024
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion
  Preimage
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage
Denis Zavadski
Damjan Kalšan
Carsten Rother
DiffM
MDE
22
5
0
13 Sep 2024
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation
Nischal Khanal
Shivanand Venkanna Sheshappanavar
MDE
42
0
0
10 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
43
1
0
05 Sep 2024
Image Segmentation in Foundation Model Era: A Survey
Image Segmentation in Foundation Model Era: A Survey
Tianfei Zhou
Fei Zhang
Boyu Chang
Wenguan Wang
Ye Yuan
E. Konukoglu
Daniel Cremers
VLM
42
4
0
23 Aug 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
78
6
0
30 May 2024
Zippo: Zipping Color and Transparency Distributions into a Single
  Diffusion Model
Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Kangyang Xie
Binbin Yang
Hao Chen
Meng Wang
Cheng Zou
Hui Xue
Ming Yang
Chunhua Shen
DiffM
32
1
0
17 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
63
6
0
14 Mar 2024
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth
  Estimation
MeSa: Masked, Geometric, and Supervised Pre-training for Monocular Depth Estimation
Muhammad Osama Khan
Junbang Liang
Chun-Kai Wang
Shan Yang
Yu Lou
MDE
49
4
0
06 Oct 2023
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth
  Estimation
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation
Yiqun Duan
Xianda Guo
Zhengbiao Zhu
DiffM
MDE
48
68
0
09 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
160
217
0
03 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
270
4,244
0
30 Jan 2023
Revealing the Dark Secrets of Masked Image Modeling
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie
Zigang Geng
Jingcheng Hu
Zheng-Wei Zhang
Han Hu
Yue Cao
VLM
191
139
0
26 May 2022
1