Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.09616
Cited By
Explore In-Context Segmentation via Latent Diffusion Models
14 March 2024
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Explore In-Context Segmentation via Latent Diffusion Models"
50 / 99 papers shown
Title
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Bolin Lai
F. Xu
Miao Liu
Xiaoliang Dai
Nikhil Mehta
...
Zeyi Huang
James M. Rehg
Sangmin Lee
Ning Zhang
Tong Xiao
98
3
0
02 Dec 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
Muzhi Zhu
Yang Liu
Zekai Luo
Chenchen Jing
Hao Chen
Guangkai Xu
Xinlong Wang
Chunhua Shen
DiffM
VLM
49
6
0
03 Oct 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
104
6
0
30 May 2024
OMG-Seg: Is One Model Good Enough For All Segmentation?
Xiangtai Li
Haobo Yuan
Wei Li
Henghui Ding
Size Wu
Wenwei Zhang
Yining Li
Kai Chen
Chen Change Loy
VLM
MLLM
ViT
113
55
0
18 Jan 2024
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting
Wouter Van Gansbeke
Bert De Brabandere
DiffM
85
11
0
18 Jan 2024
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Jianzong Wu
Xiangtai Li
Chenyang Si
Shangchen Zhou
Jingkang Yang
...
Yining Li
Kai Chen
Yunhai Tong
Ziwei Liu
Chen Change Loy
VGen
DiffM
MLLM
86
17
0
18 Jan 2024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Haobo Yuan
Xiangtai Li
Chong Zhou
Yining Li
Kai Chen
Chen Change Loy
VLM
63
51
0
05 Jan 2024
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Qiang Wan
Zilong Huang
Bingyi Kang
Jiashi Feng
Li Zhang
MDE
VLM
66
16
0
22 Dec 2023
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM
Chong Zhou
Xiangtai Li
Chen Change Loy
Bo Dai
VLM
63
46
0
11 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
VLM
CLIP
98
84
0
06 Dec 2023
UniGS: Unified Representation for Image Generation and Segmentation
Lu Qi
Lehan Yang
Weidong Guo
Yu-Syuan Xu
Bo Du
Varun Jampani
Ming-Hsuan Yang
64
18
0
04 Dec 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
56
20
0
26 Oct 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
100
36
0
22 Sep 2023
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng
Binxin Yang
Tiankai Hang
Chen Li
Shuyang Gu
...
Jianmin Bao
Zheng Zhang
Han Hu
DongDong Chen
Baining Guo
DiffM
VLM
73
98
0
07 Sep 2023
SLiMe: Segment Like Me
Aliasghar Khani
Saeid Asgari Taghanaki
Aditya Sanghi
Ali Mahdavi-Amiri
Ghassan Hamarneh
VLM
52
30
0
06 Sep 2023
DiffusionTrack: Diffusion Model For Multi-Object Tracking
Run Luo
Zikai Song
Lintao Ma
Ji Wei
Wei-Guo Yang
Min Yang
DiffM
66
29
0
19 Aug 2023
MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Chen Change Loy
VOS
75
106
0
16 Aug 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
175
2,242
0
04 Jul 2023
DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
Weimin Tan
Siyuan Chen
Bo Yan
DiffM
37
25
0
03 Jul 2023
Towards Open Vocabulary Learning: A Survey
Jianzong Wu
Xiangtai Li
Shilin Xu
Haobo Yuan
Henghui Ding
...
Jiangning Zhang
Yu Tong
Xudong Jiang
Guohao Li
Dacheng Tao
ObjD
VLM
110
144
0
28 Jun 2023
Explore In-Context Learning for 3D Point Cloud Understanding
Zhongbin Fang
Xiangtai Li
Xia Li
J. M. Buhmann
Chen Change Loy
Mengyuan Liu
3DPC
40
25
0
14 Jun 2023
Towards In-context Scene Understanding
Ivana Balazevic
David Steiner
Nikhil Parthasarathy
Relja Arandjelović
Olivier J. Hénaff
66
29
0
02 Jun 2023
Personalize Segment Anything Model with One Shot
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Peng Gao
Hongsheng Li
MLLM
VLM
90
212
0
04 May 2023
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yi Ding
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
109
73
0
01 May 2023
Transformer-Based Visual Segmentation: A Survey
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
79
136
0
19 Apr 2023
Exploring Effective Factors for Improving Visual In-Context Learning
Yanpeng Sun
Qiang Chen
Jian Wang
Jingdong Wang
Zechao Li
LRM
VLM
81
24
0
10 Apr 2023
SegGPT: Segmenting Everything In Context
Xinlong Wang
Xiaosong Zhang
Yue Cao
Wen Wang
Chunhua Shen
Tiejun Huang
VOS
MLLM
VLM
63
203
0
06 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
302
7,047
0
05 Apr 2023
Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Donggyun Kim
Jinwoo Kim
Seongwoong Cho
Chong Luo
Seunghoon Hong
VLM
84
23
0
27 Mar 2023
MaskDiff: Modeling Mask Distribution with Diffusion Probabilistic Model for Few-Shot Instance Segmentation
Minh-Quan Le
Tam V. Nguyen
Trung-Nghia Le
Thanh-Toan Do
Minh N. Do
M. Tran
DiffM
55
13
0
09 Mar 2023
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
Jiarui Xu
Sifei Liu
Arash Vahdat
Wonmin Byeon
Xiaolong Wang
Shalini De Mello
VLM
254
327
0
08 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
213
225
0
03 Mar 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
89
4,015
1
10 Feb 2023
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding
Chang Liu
Shuting He
Xudong Jiang
Philip Torr
S. Bai
VOS
67
139
0
03 Feb 2023
What Makes Good Examples for Visual In-Context Learning?
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
MLLM
VPVLM
VLM
LRM
51
110
0
31 Jan 2023
Open-vocabulary Object Segmentation with Diffusion Models
Ziyi Li
Qinye Zhou
Xiaoyun Zhang
Ya Zhang
Yanfeng Wang
Weidi Xie
VLM
98
65
0
12 Jan 2023
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
73
2,182
0
19 Dec 2022
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
Haoxing Chen
Zhuoer Xu
Jun Lan
Changhua Meng
Weiqiang Wang
DiffM
49
67
0
06 Dec 2022
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
Xinlong Wang
Wen Wang
Yue Cao
Chunhua Shen
Tiejun Huang
VLM
MLLM
89
249
0
05 Dec 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
165
1,764
0
17 Nov 2022
DiffusionDet: Diffusion Model for Object Detection
Shoufa Chen
Pei Sun
Yibing Song
Ping Luo
99
450
0
17 Nov 2022
Feature-Proxy Transformer for Few-Shot Segmentation
Jianwei Zhang
Yifan Sun
Yi Yang
Wei Chen
ViT
46
62
0
13 Oct 2022
A Generalist Framework for Panoptic Segmentation of Images and Videos
Ting-Li Chen
Lala Li
Saurabh Saxena
Geoffrey E. Hinton
David J. Fleet
VGen
MLLM
51
102
0
12 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
100
492
0
12 Oct 2022
Visual Prompting via Image Inpainting
Amir Bar
Yossi Gandelsman
Trevor Darrell
Amir Globerson
Alexei A. Efros
VLM
VPVLM
46
204
0
01 Sep 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
141
1,746
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
100
1,837
0
02 Aug 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
136
3,830
0
26 Jul 2022
Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation
Sunghwan Hong
Seokju Cho
Jisu Nam
Stephen Lin
Seung Wook Kim
ViT
61
128
0
22 Jul 2022
Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
Xinyu Shi
Dong Wei
Yu Zhang
Donghuan Lu
Munan Ning
Jiashun Chen
Kai Ma
Yefeng Zheng
41
102
0
18 Jul 2022
1
2
Next