Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.08332
Cited By
v1
v2
v3
v4 (latest)
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1328★)
Papers citing
"Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"
50 / 143 papers shown
Title
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
53
0
0
31 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
91
5
0
24 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
121
45
0
17 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGen
DiffM
MDE
69
0
0
11 Jul 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
75
0
0
18 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe Lin
Rita Singh
Bhiksha Raj
DiffM
93
27
0
14 Jun 2024
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Jiayi Guo
Junhao Zhao
Chunjiang Ge
Chaoqun Du
Zanlin Ni
Shiji Song
Humphrey Shi
Gao Huang
TTA
DiffM
84
6
0
06 Jun 2024
Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design
Ye Wang
Nicole B. Damen
Thomas Gale
Voho Seo
Hooman Shayani
AI4CE
68
3
0
06 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Liang Feng
86
0
0
03 Jun 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedIm
AI4CE
90
0
0
28 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
73
1
0
26 May 2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Gwanghyun Kim
Alonso Martinez
Yu-Chuan Su
Brendan Jou
José Lezama
...
Lijun Yu
Lu Jiang
A. Jansen
Jacob Walker
Krishna Somandepalli
77
9
0
22 May 2024
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li
Xinyao Wang
Sijie Zhu
Chia-Wen Kuo
Lu Xu
Fan Chen
Jitesh Jain
Humphrey Shi
Longyin Wen
MLLM
MoE
100
33
0
09 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
221
22
0
09 May 2024
Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey
Minrui Xu
Dusit Niyato
Jiawen Kang
Zehui Xiong
Abbas Jamalipour
Yuguang Fang
Dong In Kim
Xuemin
X. Shen
44
6
0
25 Apr 2024
UVMap-ID: A Controllable and Personalized UV Map Generative Model
Weijie Wang
Jichao Zhang
Chang Liu
Xia Li
Xingqian Xu
Humphrey Shi
N. Sebe
Bruno Lepri
91
3
0
22 Apr 2024
Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Maria Mihaela Truşcǎ
Wolf Nuyts
Jonathan Thomm
Robert Honig
Thomas Hofmann
Tinne Tuytelaars
Marie-Francine Moens
42
5
0
21 Apr 2024
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Yuchi Wang
Shuhuai Ren
Rundong Gao
Linli Yao
Qingyan Guo
Kaikai An
Jianhong Bai
Xu Sun
DiffM
VLM
106
9
0
16 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
88
7
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
96
18
0
15 Apr 2024
MindBridge: A Cross-Subject Brain Decoding Framework
Shizun Wang
Songhua Liu
Zhenxiong Tan
Xinchao Wang
AI4CE
145
29
0
11 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
74
9
0
10 Apr 2024
Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI
Hugo Caselles-Dupré
Charles Mellerio
Paul Hérent
Alizée Lopez-Persem
Benoit Béranger
Mathieu Soularue
Pierre Fautrel
Gauthier Vernier
Matthieu Cord
VGen
MedIm
DiffM
59
1
0
08 Apr 2024
Dynamic Prompt Optimizing for Text-to-Image Generation
Wenyi Mo
Tianyu Zhang
Yalong Bai
Fuchun Sun
Ji-Rong Wen
Qing Yang
84
13
0
05 Apr 2024
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity
Ruijie Quan
Wenguan Wang
Zhibo Tian
Fan Ma
Yi Yang
82
13
0
29 Mar 2024
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation
Jingyang Huo
Yikai Wang
Xuelin Qian
Yun Wang
Chong Li
Jianfeng Feng
Yanwei Fu
DiffM
MedIm
77
10
0
27 Mar 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
173
27
0
20 Mar 2024
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Paul S. Scotti
Mihir Tripathy
Cesar Kadir Torrico Villanueva
Reese Kneeland
Tong Chen
...
Charan Santhirasegaran
Jonathan Xu
Thomas Naselaris
Kenneth A. Norman
Tanishq Mathew Abraham
96
45
0
17 Mar 2024
See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
Yulong Liu
Yongqiang Ma
Guibo Zhu
Haodong Jing
Nanning Zheng
55
4
0
11 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
132
38
0
07 Mar 2024
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
128
51
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan
Artur Shatveryan
David Kocharyan
Zhangyang Wang
Humphrey Shi
EGVM
131
6
0
15 Feb 2024
Closed-Loop Unsupervised Representation Disentanglement with
β
β
β
-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin
Bo Li
Baao Xie
Wenyao Zhang
Jinming Liu
Ziqiang Li
Tao Yang
Wenjun Zeng
DRL
DiffM
CoGe
91
8
0
04 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
80
11
0
03 Feb 2024
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion
Nisha Huang
Weiming Dong
Yuxin Zhang
Fan Tang
Ronghui Li
Chongyang Ma
Xiu Li
Tong-Yee Lee
Changsheng Xu
DiffM
83
7
0
25 Jan 2024
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
177
11
0
31 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
73
4
0
27 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
97
31
0
21 Dec 2023
Brain-optimized inference improves reconstructions of fMRI brain activity
Reese Kneeland
Jordyn Ojeda
Ghislain St-Yves
Thomas Naselaris
AI4CE
58
6
0
12 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
10
0
11 Dec 2023
Offloading and Quality Control for AI Generated Content Services in 6G Mobile Edge Computing Networks
Yi-Ting Wang
Chang Liu
Jun Zhao
47
1
0
11 Dec 2023
Diffusion for Natural Image Matting
Yihan Hu
Yiheng Lin
Wei Wang
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
103
9
0
10 Dec 2023
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Jiayi Guo
Xingqian Xu
Yifan Pu
Zanlin Ni
Chaofei Wang
Manushree Vasu
Shiji Song
Gao Huang
Humphrey Shi
DiffM
76
32
0
07 Dec 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
97
125
0
28 Nov 2023
UGG: Unified Generative Grasping
Jiaxin Lu
Hao Kang
Haoxiang Li
Bo Liu
Yiding Yang
Qixing Huang
Gang Hua
102
25
0
28 Nov 2023
Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net
Zizhao Hu
Shaochong Jia
Mohammad Rostami
DiffM
MedIm
50
0
0
28 Nov 2023
Gaussian Mixture Solvers for Diffusion Models
Hanzhong Guo
Cheng Lu
Fan Bao
Tianyu Pang
Shuicheng Yan
Chao Du
Chongxuan Li
72
11
0
02 Nov 2023
Diversity and Diffusion: Observations on Synthetic Image Distributions with Stable Diffusion
David Marwood
S. Baluja
Y. Alon
DiffM
96
6
0
31 Oct 2023
Previous
1
2
3
Next