ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.08332
  4. Cited By
Versatile Diffusion: Text, Images and Variations All in One Diffusion
  Model
v1v2v3v4 (latest)

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
    DiffM
ArXiv (abs)PDFHTMLGithub (1328★)

Papers citing "Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"

50 / 143 papers shown
Title
Fine-gained Zero-shot Video Sampling
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGenDiffM
53
0
0
31 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
91
5
0
24 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
121
45
0
17 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion
  Priors
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGenDiffMMDE
69
0
0
11 Jul 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised
  Representations
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
75
0
0
18 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe Lin
Rita Singh
Bhiksha Raj
DiffM
93
27
0
14 Jun 2024
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via
  Synthetic-Domain Alignment
Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Jiayi Guo
Junhao Zhao
Chunjiang Ge
Chaoqun Du
Zanlin Ni
Shiji Song
Humphrey Shi
Gao Huang
TTADiffM
84
6
0
06 Jun 2024
Inspired by AI? A Novel Generative AI System To Assist Conceptual
  Automotive Design
Inspired by AI? A Novel Generative AI System To Assist Conceptual Automotive Design
Ye Wang
Nicole B. Damen
Thomas Gale
Voho Seo
Hooman Shayani
AI4CE
68
3
0
06 Jun 2024
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras
Lingen Li
Mingde Yao
Xingyu Meng
Muquan Yu
Tianfan Xue
Liang Feng
86
0
0
03 Jun 2024
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding
  via fMRI
MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI
Inhwa Han
Jaayeon Lee
Jong Chul Ye
MedImAI4CE
90
0
0
28 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
73
1
0
26 May 2024
A Versatile Diffusion Transformer with Mixture of Noise Levels for
  Audiovisual Generation
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation
Gwanghyun Kim
Alonso Martinez
Yu-Chuan Su
Brendan Jou
José Lezama
...
Lijun Yu
Lu Jiang
A. Jansen
Jacob Walker
Krishna Somandepalli
77
9
0
22 May 2024
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Jiachen Li
Xinyao Wang
Sijie Zhu
Chia-Wen Kuo
Lu Xu
Fan Chen
Jitesh Jain
Humphrey Shi
Longyin Wen
MLLMMoE
100
33
0
09 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Jiaxin Wu
Zhen Lei
Zhaoxiang Zhang
Zhen Lei
Qing Li
EGVM
221
22
0
09 May 2024
Integration of Mixture of Experts and Multimodal Generative AI in
  Internet of Vehicles: A Survey
Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey
Minrui Xu
Dusit Niyato
Jiawen Kang
Zehui Xiong
Abbas Jamalipour
Yuguang Fang
Dong In Kim
Xuemin
X. Shen
44
6
0
25 Apr 2024
UVMap-ID: A Controllable and Personalized UV Map Generative Model
UVMap-ID: A Controllable and Personalized UV Map Generative Model
Weijie Wang
Jichao Zhang
Chang Liu
Xia Li
Xingqian Xu
Humphrey Shi
N. Sebe
Bruno Lepri
91
3
0
22 Apr 2024
Object-Attribute Binding in Text-to-Image Generation: Evaluation and
  Control
Object-Attribute Binding in Text-to-Image Generation: Evaluation and Control
Maria Mihaela Truşcǎ
Wolf Nuyts
Jonathan Thomm
Robert Honig
Thomas Hofmann
Tinne Tuytelaars
Marie-Francine Moens
42
5
0
21 Apr 2024
LaDiC: Are Diffusion Models Really Inferior to Autoregressive
  Counterparts for Image-to-Text Generation?
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Yuchi Wang
Shuhuai Ren
Rundong Gao
Linli Yao
Qingyan Guo
Kaikai An
Jianhong Bai
Xu Sun
DiffMVLM
106
9
0
16 Apr 2024
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion
  Models
MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
MoMe
88
7
0
15 Apr 2024
Magic Clothing: Controllable Garment-Driven Image Synthesis
Magic Clothing: Controllable Garment-Driven Image Synthesis
Weifeng Chen
Tao Gu
Yuhao Xu
Chengcai Chen
96
18
0
15 Apr 2024
MindBridge: A Cross-Subject Brain Decoding Framework
MindBridge: A Cross-Subject Brain Decoding Framework
Shizun Wang
Songhua Liu
Zhenxiong Tan
Xinchao Wang
AI4CE
145
29
0
11 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
74
9
0
10 Apr 2024
Mind-to-Image: Projecting Visual Mental Imagination of the Brain from
  fMRI
Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI
Hugo Caselles-Dupré
Charles Mellerio
Paul Hérent
Alizée Lopez-Persem
Benoit Béranger
Mathieu Soularue
Pierre Fautrel
Gauthier Vernier
Matthieu Cord
VGenMedImDiffM
59
1
0
08 Apr 2024
Dynamic Prompt Optimizing for Text-to-Image Generation
Dynamic Prompt Optimizing for Text-to-Image Generation
Wenyi Mo
Tianyu Zhang
Yalong Bai
Fuchun Sun
Ji-Rong Wen
Qing Yang
84
13
0
05 Apr 2024
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain
  Activity
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity
Ruijie Quan
Wenguan Wang
Zhibo Tian
Fan Ma
Yi Yang
82
13
0
29 Mar 2024
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual
  Pretraining and Multi-level Modulation
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation
Jingyang Huo
Yikai Wang
Xuelin Qian
Yun Wang
Chong Li
Jianfeng Feng
Yanwei Fu
DiffMMedIm
77
10
0
27 Mar 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced
  Data Generation and Perception
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
Yibo Wang
Ruiyuan Gao
Kai Chen
Kaiqiang Zhou
Yingjie Cai
...
Zhenguo Li
Lihui Jiang
Dit-Yan Yeung
Qiang Xu
Kai Zhang
DiffM
173
27
0
20 Mar 2024
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Paul S. Scotti
Mihir Tripathy
Cesar Kadir Torrico Villanueva
Reese Kneeland
Tong Chen
...
Charan Santhirasegaran
Jonathan Xu
Thomas Naselaris
Kenneth A. Norman
Tanishq Mathew Abraham
96
45
0
17 Mar 2024
See Through Their Minds: Learning Transferable Neural Representation
  from Cross-Subject fMRI
See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
Yulong Liu
Yongqiang Ma
Guibo Zhu
Haodong Jing
Nanning Zheng
55
4
0
11 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
132
38
0
07 Mar 2024
Transparent Image Layer Diffusion using Latent Transparency
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
128
51
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
263
103
0
27 Feb 2024
Social Reward: Evaluating and Enhancing Generative AI through
  Million-User Feedback from an Online Creative Community
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan
Artur Shatveryan
David Kocharyan
Zhangyang Wang
Humphrey Shi
EGVM
131
6
0
15 Feb 2024
Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE
  Distillation and Diffusion Probabilistic Feedback
Closed-Loop Unsupervised Representation Disentanglement with βββ-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin
Bo Li
Baao Xie
Wenyao Zhang
Jinming Liu
Ziqiang Li
Tao Yang
Wenjun Zeng
DRLDiffMCoGe
91
8
0
04 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
80
11
0
03 Feb 2024
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion
Nisha Huang
Weiming Dong
Yuxin Zhang
Fan Tang
Ronghui Li
Chongyang Ma
Xiu Li
Tong-Yee Lee
Changsheng Xu
DiffM
83
7
0
25 Jan 2024
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
177
11
0
31 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with
  Time-Decoupled Training and Reusable Coop-Diffusion
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
73
4
0
27 Dec 2023
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
  Inpainting with Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
97
31
0
21 Dec 2023
Brain-optimized inference improves reconstructions of fMRI brain
  activity
Brain-optimized inference improves reconstructions of fMRI brain activity
Reese Kneeland
Jordyn Ojeda
Ghislain St-Yves
Thomas Naselaris
AI4CE
58
6
0
12 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
10
0
11 Dec 2023
Offloading and Quality Control for AI Generated Content Services in 6G
  Mobile Edge Computing Networks
Offloading and Quality Control for AI Generated Content Services in 6G Mobile Edge Computing Networks
Yi-Ting Wang
Chang Liu
Jun Zhao
47
1
0
11 Dec 2023
Diffusion for Natural Image Matting
Diffusion for Natural Image Matting
Yihan Hu
Yiheng Lin
Wei Wang
Yao-Min Zhao
Yunchao Wei
Humphrey Shi
103
9
0
10 Dec 2023
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Jiayi Guo
Xingqian Xu
Yifan Pu
Zanlin Ni
Chaofei Wang
Manushree Vasu
Shiji Song
Gao Huang
Humphrey Shi
DiffM
76
32
0
07 Dec 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffMVGen
97
125
0
28 Nov 2023
UGG: Unified Generative Grasping
UGG: Unified Generative Grasping
Jiaxin Lu
Hao Kang
Haoxiang Li
Bo Liu
Yiding Yang
Qixing Huang
Gang Hua
102
25
0
28 Nov 2023
Efficient Multimodal Diffusion Models Using Joint Data Infilling with
  Partially Shared U-Net
Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net
Zizhao Hu
Shaochong Jia
Mohammad Rostami
DiffMMedIm
50
0
0
28 Nov 2023
Gaussian Mixture Solvers for Diffusion Models
Gaussian Mixture Solvers for Diffusion Models
Hanzhong Guo
Cheng Lu
Fan Bao
Tianyu Pang
Shuicheng Yan
Chao Du
Chongxuan Li
72
11
0
02 Nov 2023
Diversity and Diffusion: Observations on Synthetic Image Distributions
  with Stable Diffusion
Diversity and Diffusion: Observations on Synthetic Image Distributions with Stable Diffusion
David Marwood
S. Baluja
Y. Alon
DiffM
96
6
0
31 Oct 2023
Previous
123
Next