ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.13592
  4. Cited By
Multimodal Image Synthesis and Editing: The Generative AI Era

Multimodal Image Synthesis and Editing: The Generative AI Era

27 December 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
    EGVM
ArXivPDFHTML

Papers citing "Multimodal Image Synthesis and Editing: The Generative AI Era"

50 / 314 papers shown
Title
A Bias-Free Training Paradigm for More General AI-generated Image Detection
A Bias-Free Training Paradigm for More General AI-generated Image Detection
Fabrizio Guillaro
Giada Zingarini
Ben Usman
Avneesh Sud
D. Cozzolino
L. Verdoliva
DiffM
108
4
0
23 Dec 2024
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Xinrui Zhou
Yuhao Huang
Haoran Dou
Shijing Chen
Ao Chang
...
Jie Jessie Ren
Ruobing Huang
Jun Cheng
Wufeng Xue
Dong Ni
MedIm
290
0
0
25 Sep 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
101
14
0
17 Jul 2024
Local 3D Editing via 3D Distillation of CLIP Knowledge
Local 3D Editing via 3D Distillation of CLIP Knowledge
J. Hyung
Sung Ju Hwang
Daejin Kim
Hyunji Lee
Jaegul Choo
38
23
0
21 Jun 2023
Improving visual image reconstruction from human brain activity using
  latent diffusion models via multiple decoded inputs
Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs
Yu Takagi
Shinji Nishimoto
AI4CE
DiffM
45
23
0
20 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human
  Preferences of Text-to-Image Synthesis
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
69
274
0
15 Jun 2023
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with
  Variational Score Distillation
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang
Cheng Lu
Yikai Wang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
149
837
0
25 May 2023
UniControl: A Unified Diffusion Model for Controllable Visual Generation
  In the Wild
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Can Qin
Shu Zhen Zhang
Ning Yu
Yihao Feng
Xinyi Yang
...
Caiming Xiong
Silvio Savarese
Stefano Ermon
Yun Fu
Ran Xu
71
128
0
18 May 2023
Drag Your GAN: Interactive Point-based Manipulation on the Generative
  Image Manifold
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
Xingang Pan
A. Tewari
Thomas Leimkuhler
Lingjie Liu
Abhimitra Meka
Christian Theobalt
DiffM
70
238
0
18 May 2023
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation
  with Neural Radiance Fields
OR-NeRF: Object Removing from 3D Scenes Guided by Multiview Segmentation with Neural Radiance Fields
Youtan Yin
Zhoujie Fu
Fan Yang
Guosheng Lin
70
29
0
17 May 2023
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image
  Generation
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation
Yuval Kirstain
Adam Polyak
Uriel Singer
Shahbuland Matiana
Joe Penna
Omer Levy
EGVM
185
375
0
02 May 2023
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking
  Face Generation
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation
Zhenhui Ye
Jinzheng He
Ziyue Jiang
Rongjie Huang
Jia-Bin Huang
Jinglin Liu
Yixiang Ren
Xiang Yin
Zejun Ma
Zhou Zhao
CVBM
82
29
0
01 May 2023
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial
  Animations
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations
Rongliang Wu
Yingchen Yu
Fangneng Zhan
Jiahui Zhang
Xiaoqin Zhang
Shijian Lu
CVBM
28
9
0
18 Apr 2023
SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing
  Field
SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field
Chong Bao
Yinda Zhang
Bangbang Yang
Tianxing Fan
Zesong Yang
Hujun Bao
Guofeng Zhang
Zhaopeng Cui
DiffM
219
96
0
23 Mar 2023
SKED: Sketch-guided Text-based 3D Editing
SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili
Or Perel
Mehdi Safaee
Daniel Cohen-Or
Ali Mahdavi-Amiri
DiffM
53
66
0
19 Mar 2023
Regularized Vector Quantization for Tokenized Image Synthesis
Regularized Vector Quantization for Tokenized Image Synthesis
Jiahui Zhang
Fangneng Zhan
Christian Theobalt
Shijian Lu
DiffM
MQ
61
30
0
11 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Scaling up GANs for Text-to-Image Synthesis
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
61
463
0
09 Mar 2023
Consistency Models
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLM
DiffM
90
930
0
02 Mar 2023
3D generation on ImageNet
3D generation on ImageNet
Ivan Skorokhodov
Aliaksandr Siarohin
Yinghao Xu
Jian Ren
Hsin-Ying Lee
Peter Wonka
Sergey Tulyakov
91
55
0
02 Mar 2023
3D-aware Conditional Image Synthesis
3D-aware Conditional Image Synthesis
Kangle Deng
Gengshan Yang
Deva Ramanan
Sitong Su
75
30
0
16 Feb 2023
LayoutDiffuse: Adapting Foundational Diffusion Models for
  Layout-to-Image Generation
LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation
Jiaxin Cheng
Xiao Liang
Xingjian Shi
Tong He
Tianjun Xiao
Mu Li
DiffM
57
67
0
16 Feb 2023
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation
Omer Bar-Tal
Lior Yariv
Y. Lipman
Tali Dekel
71
377
1
16 Feb 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
70
4,015
1
10 Feb 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face
  Synthesis
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
Zhenhui Ye
Ziyue Jiang
Yi Ren
Jinglin Liu
Jinzheng He
Zhou Zhao
CVBM
66
125
0
31 Jan 2023
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven
  Portraits Animation
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
Shuai Shen
Wenliang Zhao
Zibin Meng
Wanhua Li
Zhengbiao Zhu
Jie Zhou
Jiwen Lu
DiffM
VGen
58
104
0
10 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
227
539
0
02 Jan 2023
Exploring Transformer Backbones for Image Diffusion Models
Exploring Transformer Backbones for Image Diffusion Models
Princy Chahal
23
3
0
27 Dec 2022
Removing Objects From Neural Radiance Fields
Removing Objects From Neural Radiance Fields
Silvan Weder
Guillermo Garcia-Hernando
Áron Monszpart
Marc Pollefeys
Gabriel J. Brostow
Michael Firman
Sara Vicente
96
60
0
22 Dec 2022
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and
  Video Generation
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan
Yi Ma
Huan Yang
Huiguo He
Bei Liu
Jianlong Fu
Nicholas Jing Yuan
Qin Jin
B. Guo
DiffM
VGen
78
183
0
19 Dec 2022
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
101
156
0
08 Dec 2022
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive
  Pre-Training
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training
Ling Yang
Zhilin Huang
Yang Song
Shenda Hong
Ge Li
Wentao Zhang
Tengjiao Wang
Guohao Li
Ming-Hsuan Yang
61
54
0
21 Nov 2022
EDGE: Editable Dance Generation From Music
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
Chenxi Liu
55
231
0
19 Nov 2022
Magic3D: High-Resolution Text-to-3D Content Creation
Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin
Jun Gao
Luming Tang
Towaki Takikawa
Fangyin Wei
Xun Huang
Karsten Kreis
Sanja Fidler
Ming-Yuan Liu
Nayeon Lee
132
1,141
0
18 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
158
1,745
0
17 Nov 2022
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via
  Audio-Lip Memory
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Se Jin Park
Minsu Kim
Joanna Hong
J. Choi
Y. Ro
CVBM
93
87
0
02 Nov 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
51
1,064
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
125
3,355
0
16 Oct 2022
Mind Reader: Reconstructing complex images from brain activities
Mind Reader: Reconstructing complex images from brain activities
Sikun Lin
Thomas C. Sprague
Ambuj K. Singh
DiffM
133
89
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
116
2,359
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
66
1,385
0
29 Sep 2022
GET3D: A Generative Model of High Quality 3D Textured Shapes Learned
  from Images
GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images
Jun Gao
Tianchang Shen
Zian Wang
Wenzheng Chen
K. Yin
Daiqing Li
Or Litany
Zan Gojcic
Sanja Fidler
61
442
0
22 Sep 2022
User-Controllable Latent Transformer for StyleGAN Image Layout Editing
User-Controllable Latent Transformer for StyleGAN Image Layout Editing
Yuki Endo
40
40
0
26 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
196
2,789
0
25 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
128
1,746
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
89
1,837
0
02 Aug 2022
Classifier-Free Diffusion Guidance
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
86
3,830
0
26 Jul 2022
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head
  Synthesis
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis
Shuai Shen
Wanhua Li
Zhengbiao Zhu
Yueqi Duan
Jie Zhou
Jiwen Lu
CVBM
62
106
0
24 Jul 2022
Panoptic Scene Graph Generation
Panoptic Scene Graph Generation
Jingkang Yang
Yi Zhe Ang
Zujin Guo
Kaiyang Zhou
Wayne Zhang
Ziwei Liu
94
111
0
22 Jul 2022
Auto-regressive Image Synthesis with Integrated Quantization
Auto-regressive Image Synthesis with Integrated Quantization
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Kai Cui
Changgong Zhang
Shijian Lu
77
10
0
21 Jul 2022
Towards Counterfactual Image Manipulation via CLIP
Towards Counterfactual Image Manipulation via CLIP
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jiahui Zhang
Shijian Lu
Miaomiao Cui
Xuansong Xie
Xiansheng Hua
Chunyan Miao
CLIP
70
31
0
06 Jul 2022
1234567
Next