Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.08332
Cited By
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"
50 / 140 papers shown
Title
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
29
0
0
13 May 2025
Position: Restructuring of Categories and Implementation of Guidelines Essential for VLM Adoption in Healthcare
Amara Tariq
Rimita Lahiri
Charles Kahn
Imon Banerjee
26
0
0
12 May 2025
Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation
Daniele Molino
Francesco Di Feola
Linlin Shen
Paolo Soda
V. Guarrasi
MedIm
LM&MA
67
0
0
02 May 2025
The Dual Power of Interpretable Token Embeddings: Jailbreaking Attacks and Defenses for Diffusion Model Unlearning
Siyi Chen
Yimeng Zhang
Sijia Liu
Q. Qu
AAML
144
0
0
30 Apr 2025
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Junxi Chen
Junhao Dong
Xiaohua Xie
35
0
0
08 Apr 2025
QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning
Quanxing Xu
Ling Zhou
Xian Zhong
Feifei Zhang
Rubing Huang
Chia-Wen Lin
36
0
0
04 Apr 2025
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Prin Phunyaphibarn
Phillip Y. Lee
Jaihoon Kim
Minhyuk Sung
DiffM
89
0
0
26 Mar 2025
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
Pengyu Liu
Guohua Dong
D. Guo
Kun Li
Fengling Li
Xun Yang
Meng Wang
Xiaomin Ying
AI4CE
41
0
0
20 Mar 2025
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
Leyang Wang
Joice Lin
DiffM
63
0
0
20 Mar 2025
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
Tsu-jui Fu
Yusu Qian
Chen Chen
Wenze Hu
Zhe Gan
Yuqing Yang
97
1
0
16 Mar 2025
Make Optimization Once and for All with Fine-grained Guidance
Mingjia Shi
Ruihan Lin
Xuxi Chen
Yuhao Zhou
Zezhen Ding
...
Tong Wang
Kai Wang
Zhangyang Wang
Jingyang Zhang
Tianlong Chen
55
1
0
14 Mar 2025
Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Cagri Gungor
Derek Eppinger
Adriana Kovashka
56
0
0
10 Mar 2025
SEED: Towards More Accurate Semantic Evaluation for Visual Brain Decoding
Juhyeon Park
P. Y. Kim
Jiook Cha
Shinjae Yoo
Taesup Moon
48
0
0
09 Mar 2025
Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach
Soumyadeep Ro
Sanapala Satwika
Pamarthi Yasoda Gayathri
Mohmmad Ghaith Balsha
Aysegul Ucar
VLM
ObjD
56
0
0
06 Mar 2025
Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation
Zhichao Yang
Leida Li
Pengfei Chen
Jinjian Wu
Giuseppe Valenzise
68
0
0
04 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
36
0
0
04 Mar 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
65
0
0
07 Feb 2025
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
Zhibo Tian
Ruijie Quan
Fan Ma
Kun Zhan
Yi Yang
31
1
0
24 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
44
1
0
20 Jan 2025
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
Daniele Molino
Francesco Di Feola
E. Faiella
Deborah Fazzini
D. Santucci
Linlin Shen
V. Guarrasi
Paolo Soda
SyDa
MedIm
41
0
0
10 Jan 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
51
9
0
31 Dec 2024
D-Judge: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance
Renyang Liu
Ziyu Lyu
Wei Zhou
See-Kiong Ng
EGVM
35
0
0
23 Dec 2024
Optimized two-stage AI-based Neural Decoding for Enhanced Visual Stimulus Reconstruction from fMRI Data
Lorenzo Veronese
Andrea Moglia
Luca Mainardi
Pietro Cerveri
DiffM
64
0
0
17 Dec 2024
UIBDiffusion: Universal Imperceptible Backdoor Attack for Diffusion Models
Yuning Han
Bingyin Zhao
Rui Chu
Feng Luo
Biplab Sikdar
Yingjie Lao
DiffM
AAML
77
1
0
16 Dec 2024
COBRA: A Continual Learning Approach to Vision-Brain Understanding
Xuan-Bac Nguyen
Arabinda Kumar Choudhary
Pawan Sinha
Xin Li
Khoa Luu
CLL
76
0
0
25 Nov 2024
One Diffusion to Generate Them All
Duong H. Le
Tuan Pham
Sangho Lee
Christopher Clark
Aniruddha Kembhavi
Stephan Mandt
Ranjay Krishna
Jiasen Lu
VLM
74
5
0
25 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
106
15
0
24 Nov 2024
Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding
Hoang-Quan Nguyen
Xuan-Bac Nguyen
Hugh Churchill
Arabinda Kumar Choudhary
Pawan Sinha
S. Khan
Khoa Luu
69
1
0
20 Nov 2024
Decoding Visual Experience and Mapping Semantics through Whole-Brain Analysis Using fMRI Foundation Models
Yanchen Wang
Adam Turnbull
Tiange Xiang
Yunlong Xu
Sa Zhou
Adnan Masoud
Shekoofeh Azizi
F. Lin
Ehsan Adeli
32
0
0
11 Nov 2024
BrainBits: How Much of the Brain are Generative Reconstruction Methods Using?
David Mayo
Christopher Wang
Asa Harbin
Abdulrahman Alabdulkareem
Albert Eaton Shaw
Boris Katz
Andrei Barbu
DiffM
44
0
0
05 Nov 2024
MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts
Jie Zhu
Y. Chen
Mingyu Ding
Ping Luo
Leye Wang
Jingdong Wang
DiffM
39
3
0
30 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
28
3
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
36
6
0
24 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
41
11
0
19 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
63
13
0
10 Oct 2024
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance
Jaehoon Joo
Taejin Jeong
Seongjae Hwang
DiffM
32
2
0
18 Sep 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
47
1
0
15 Sep 2024
Spiking Diffusion Models
Jiahang Cao
Hanzhong Guo
Ziqing Wang
Deming Zhou
Hao Cheng
Qiang Zhang
Renjing Xu
DiffM
43
3
0
29 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
F. Khan
Hideki Koike
DiffM
39
0
0
14 Aug 2024
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
39
2
0
13 Aug 2024
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
29
0
0
31 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
42
4
0
24 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
60
30
0
17 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGen
DiffM
MDE
22
0
0
11 Jul 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
38
0
0
18 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe-nan Lin
Rita Singh
Bhiksha Raj
DiffM
43
21
0
14 Jun 2024
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Jiayi Guo
Junhao Zhao
Chunjiang Ge
Chaoqun Du
Zanlin Ni
Shiji Song
Humphrey Shi
Gao Huang
TTA
DiffM
42
5
0
06 Jun 2024
Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design
Ye Wang
Nicole B. Damen
Thomas Gale
Voho Seo
Hooman Shayani
AI4CE
34
3
0
06 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Liang Feng
42
0
0
03 Jun 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedIm
AI4CE
25
0
0
28 May 2024
1
2
3
Next