Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.08332
Cited By
v1
v2
v3
v4 (latest)
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1328★)
Papers citing
"Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"
50 / 143 papers shown
Title
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Kevin Rojas
Yuchen Zhu
Sichen Zhu
Felix X.-F. Ye
Molei Tao
DiffM
21
0
0
09 Jun 2025
NSD-Imagery: A benchmark dataset for extending fMRI vision decoding methods to mental imagery
Reese Kneeland
Paul S. Scotti
Ghislain St-Yves
Jesse Breedlove
Kendrick Norris Kay
Thomas Naselaris
18
0
0
07 Jun 2025
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers
Yan Gong
Yiren Song
Yicheng Li
Chenglin Li
Yin Zhang
KELM
60
0
0
03 Jun 2025
HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image
Junyi Guo
Jingxuan Zhang
Fangyu Wu
Huanda Lu
Qiufeng Wang
Wenmian Yang
Eng Gee Lim
Dongming Lu
DiffM
17
0
0
29 May 2025
Improving Brain-to-Image Reconstruction via Fine-Grained Text Bridging
Runze Xia
Shuo Feng
Renzhi Wang
Congchi Yin
Xuyun Wen
Piji Li
DiffM
42
0
0
28 May 2025
Exploring The Visual Feature Space for Multimodal Neural Decoding
Weihao Xia
Cengiz Öztireli
77
0
0
21 May 2025
Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI
Marlène Careil
Yohann Benchetrit
Jean-Rémi King
194
0
0
20 May 2025
FreqSelect: Frequency-Aware fMRI-to-Image Reconstruction
Junliang Ye
Lei Wang
Md Zakir Hossain
DiffM
65
0
0
18 May 2025
Bootstrapping Diffusion: Diffusion Model Training Leveraging Partial and Corrupted Data
Xudong Ma
76
0
0
17 May 2025
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
80
0
0
13 May 2025
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Amara Tariq
Rimita Lahiri
Charles Kahn
Imon Banerjee
66
0
0
12 May 2025
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Daniele Molino
Francesco Di Feola
Linlin Shen
Paolo Soda
V. Guarrasi
MedIm
LM&MA
132
1
0
02 May 2025
The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning
Siyi Chen
Yimeng Zhang
Sijia Liu
Q. Qu
AAML
422
0
0
30 Apr 2025
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Junxi Chen
Junhao Dong
Xiaohua Xie
89
0
0
08 Apr 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Xian Zhong
Feifei Zhang
Rubing Huang
Chia-Wen Lin
65
0
0
04 Apr 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
184
1
0
26 Mar 2025
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
Pengyu Liu
Guohua Dong
D. Guo
Kun Li
Fengling Li
Xun Yang
Meng Wang
Xiaomin Ying
AI4CE
90
0
0
20 Mar 2025
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
Leyang Wang
Joice Lin
DiffM
116
0
0
20 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-Jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yue Yang
221
2
0
16 Mar 2025
EQ-TAA: Equivariant Traffic Accident Anticipation via Diffusion-Based Accident Video Synthesis
Jianwu Fang
Lei-lei Li
Zhedong Zheng
Hongkai Yu
Jianru Xue
Zhengguo Li
Tat-Seng Chua
26
0
0
16 Mar 2025
Make Optimization Once and for All with Fine-grained Guidance
Mingjia Shi
Ruihan Lin
Xuxi Chen
Yuhao Zhou
Zezhen Ding
...
Tong Wang
Kai Wang
Zhangyang Wang
Jing Zhang
Tianlong Chen
121
1
0
14 Mar 2025
Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Cagri Gungor
Derek Eppinger
Adriana Kovashka
97
0
0
10 Mar 2025
SEED: Towards More Accurate Semantic Evaluation for Visual Brain Decoding
Juhyeon Park
P. Y. Kim
Jiook Cha
Shinjae Yoo
Taesup Moon
95
0
0
09 Mar 2025
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Soumyadeep Ro
Sanapala Satwika
Pamarthi Yasoda Gayathri
Mohmmad Ghaith Balsha
Aysegul Ucar
VLM
ObjD
150
0
0
06 Mar 2025
Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation
Zhichao Yang
Leida Li
Pengfei Chen
Jinjian Wu
Giuseppe Valenzise
108
0
0
04 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
104
1
0
04 Mar 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
133
0
0
07 Feb 2025
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Zhibo Tian
Ruijie Quan
Fan Ma
Kun Zhan
Yi Yang
113
1
0
24 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
106
1
0
20 Jan 2025
XGeM: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
Daniele Molino
Francesco Di Feola
E. Faiella
Deborah Fazzini
D. Santucci
Linlin Shen
V. Guarrasi
Paolo Soda
SyDa
MedIm
127
1
0
08 Jan 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
149
13
0
31 Dec 2024
D-Judge: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance
Renyang Liu
Ziyu Lyu
Wei Zhou
See-Kiong Ng
EGVM
85
0
0
23 Dec 2024
Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data
Lorenzo Veronese
Andrea Moglia
Luca Mainardi
Pietro Cerveri
DiffM
128
0
0
17 Dec 2024
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
Yuning Han
Bingyin Zhao
Rui Chu
Feng Luo
Biplab Sikdar
Yingjie Lao
DiffM
AAML
203
1
0
16 Dec 2024
COBRA: A Continual Learning Approach to Vision-Brain Understanding
Xuan-Bac Nguyen
Arabinda Kumar Choudhary
Pawan Sinha
Xin Li
Khoa Luu
CLL
141
0
0
25 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
164
9
0
25 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
Hao Zhang
Yueting Zhuang
DiffM
236
29
0
24 Nov 2024
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Hoang-Quan Nguyen
Xuan-Bac Nguyen
Hugh Churchill
Arabinda Kumar Choudhary
Pawan Sinha
S. Khan
Khoa Luu
163
1
0
20 Nov 2024
Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models
Yanchen Wang
Adam Turnbull
Tiange Xiang
Yunlong Xu
Sa Zhou
Adnan Masoud
Shekoofeh Azizi
F. Lin
Ehsan Adeli
87
1
0
11 Nov 2024
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
David Mayo
Christopher Wang
Asa Harbin
Abdulrahman Alabdulkareem
Albert Eaton Shaw
Boris Katz
Andrei Barbu
DiffM
105
2
0
05 Nov 2024
MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts
Jie Zhu
Yukang Chen
Mingyu Ding
Ping Luo
Leye Wang
Jingdong Wang
DiffM
69
5
0
30 Oct 2024
David and Goliath: Small One-step Model Beats Large Diffusion with Score Post-training
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
96
4
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
114
9
0
24 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
124
13
0
19 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
162
19
0
10 Oct 2024
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance
Jaehoon Joo
Taejin Jeong
Seongjae Hwang
DiffM
88
3
0
18 Sep 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
89
2
0
15 Sep 2024
Spiking Diffusion Models
Jiahang Cao
Hanzhong Guo
Ziqing Wang
Deming Zhou
Hao Cheng
Qiang Zhang
Renjing Xu
DiffM
95
3
0
29 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
Fahad Shahbaz Khan
Hideki Koike
DiffM
64
0
0
14 Aug 2024
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
123
2
0
13 Aug 2024
1
2
3
Next