ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,355 papers shown
Title
Interpreting and Improving Diffusion Models from an Optimization
  Perspective
Interpreting and Improving Diffusion Models from an Optimization Perspective
Frank Permenter
Chenyang Yuan
DiffM
30
4
0
08 Jun 2023
WOUAF: Weight Modulation for User Attribution and Fingerprinting in
  Text-to-Image Diffusion Models
WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Changhoon Kim
Kyle Min
Maitreya Patel
Sheng Cheng
Yezhou Yang
WIGM
37
28
0
07 Jun 2023
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image
  Diffusion Models
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models
Maitreya Patel
Tejas Gokhale
Chitta Baral
Yezhou Yang
CoGe
32
16
0
07 Jun 2023
Exposing flaws of generative model evaluation metrics and their unfair
  treatment of diffusion models
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
G. Stein
Jesse C. Cresswell
Rasa Hosseinzadeh
Yi Sui
Brendan Leigh Ross
Valentin Villecroze
Zhaoyan Liu
Anthony L. Caterini
J. E. T. Taylor
Gabriel Loaiza-Ganem
EGVM
59
97
0
07 Jun 2023
Designing a Better Asymmetric VQGAN for StableDiffusion
Designing a Better Asymmetric VQGAN for StableDiffusion
Zixin Zhu
Xuelu Feng
DongDong Chen
Jianmin Bao
Le Wang
Yinpeng Chen
Lu Yuan
Gang Hua
DiffM
51
34
0
07 Jun 2023
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image
  Collections
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections
Chunfeng Yao
Amit Raj
Wei-Chih Hung
Yuanzhen Li
Michael Rubinstein
Ming-Hsuan Yang
Varun Jampani
DiffM
39
19
0
07 Jun 2023
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang
George Alex Koulieris
Hubert P. H. Shum
DiffM
34
56
0
07 Jun 2023
Multi-modal Latent Diffusion
Multi-modal Latent Diffusion
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
DiffM
43
13
0
07 Jun 2023
Improving Diffusion-based Image Translation using Asymmetric Gradient
  Guidance
Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance
Gihyun Kwon
Jong Chul Ye
DiffM
30
2
0
07 Jun 2023
Generative Semantic Communication: Diffusion Models Beyond Bit Recovery
Generative Semantic Communication: Diffusion Models Beyond Bit Recovery
Eleonora Grassucci
Sergio Barbarossa
Danilo Comminiello
DiffM
47
55
0
07 Jun 2023
Phoenix: A Federated Generative Diffusion Model
Phoenix: A Federated Generative Diffusion Model
Fiona Victoria Stanley Jothiraj
A. Mashhadi
DiffM
64
18
0
07 Jun 2023
ATT3D: Amortized Text-to-3D Object Synthesis
ATT3D: Amortized Text-to-3D Object Synthesis
Jonathan Lorraine
Kevin Xie
Fangyin Wei
Chen-Hsuan Lin
Towaki Takikawa
Nicholas Sharp
Nayeon Lee
Xuan Li
Sanja Fidler
James Lucas
DiffM
54
87
0
06 Jun 2023
Towards Visual Foundational Models of Physical Scenes
Towards Visual Foundational Models of Physical Scenes
Chethan Parameshwara
Alessandro Achille
Matthew Trager
Xiaolong Li
Jiawei Mo
...
A. Swaminathan
C. Taylor
D. Venkatraman
Xiaohan Fei
Stefano Soatto
DiffM
46
4
0
06 Jun 2023
On the Role of Attention in Prompt-tuning
On the Role of Attention in Prompt-tuning
Samet Oymak
A. S. Rawat
Mahdi Soltanolkotabi
Christos Thrampoulidis
MLT
LRM
33
41
0
06 Jun 2023
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model
  Given Sparse Views
DreamSparse: Escaping from Plato's Cave with 2D Frozen Diffusion Model Given Sparse Views
Paul D. Yoo
Jiaxian Guo
Yutaka Matsuo
S. Gu
46
24
0
06 Jun 2023
LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Yochai Yemini
Aviv Shamsian
Lior Bracha
Sharon Gannot
Ethan Fetaya
DiffM
52
11
0
05 Jun 2023
Zero-shot CAD Program Re-Parameterization for Interactive Manipulation
Zero-shot CAD Program Re-Parameterization for Interactive Manipulation
Milin Kodnongbua
Benjamin T. Jones
Maaz Bin Safeer Ahmad
Vladimir G. Kim
Adriana Schulz
56
4
0
05 Jun 2023
MotionDiffuser: Controllable Multi-Agent Motion Prediction using
  Diffusion
MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
C. Jiang
Andre Cornman
C. Park
Benjamin Sapp
Yin Zhou
Drago Anguelov
DiffM
56
136
0
05 Jun 2023
HeadSculpt: Crafting 3D Head Avatars with Text
HeadSculpt: Crafting 3D Head Avatars with Text
Xiaoping Han
Yukang Cao
Kai Han
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
Kwan-Yee K. Wong
DiffM
32
46
0
05 Jun 2023
User-friendly Image Editing with Minimal Text Input: Leveraging
  Captioning and Injection Techniques
User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques
Sunwoo Kim
Wooseok Jang
Hyunsung Kim
Junho Kim
Yunjey Choi
Seung Wook Kim
Gayeong Lee
DiffM
47
6
0
05 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
Video Diffusion Models with Local-Global Context Guidance
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGen
DiffM
19
13
0
05 Jun 2023
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion
  Model
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Yizhe Zhang
Jiatao Gu
Zhuofeng Wu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
DiffM
72
26
0
05 Jun 2023
Temporal Dynamic Quantization for Diffusion Models
Temporal Dynamic Quantization for Diffusion Models
Junhyuk So
Jungwon Lee
Daehyun Ahn
Hyungjun Kim
Eunhyeok Park
DiffM
MQ
36
61
0
04 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
39
9
0
04 Jun 2023
Table and Image Generation for Investigating Knowledge of Entities in
  Pre-trained Vision and Language Models
Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Hidetaka Kamigaito
Katsuhiko Hayashi
Taro Watanabe
VLM
25
1
0
03 Jun 2023
Efficient Text-Guided 3D-Aware Portrait Generation with Score
  Distillation Sampling on Distribution
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng
Fei Yin
Xiaoke Huang
Xintong Yu
Jiaxiang Liu
Shi Feng
Yujiu Yang
Yansong Tang
DiffM
39
4
0
03 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
56
324
0
03 Jun 2023
Invisible Image Watermarks Are Provably Removable Using Generative AI
Invisible Image Watermarks Are Provably Removable Using Generative AI
Xuandong Zhao
Kexun Zhang
Zihao Su
Saastha Vasan
Ilya Grishchenko
Christopher Kruegel
Giovanni Vigna
Yu Wang
Lei Li
WIGM
48
50
0
02 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
61
0
0
02 Jun 2023
The Surprising Effectiveness of Diffusion Models for Optical Flow and
  Monocular Depth Estimation
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Saurabh Saxena
Charles Herrmann
Junhwa Hur
Abhishek Kar
Mohammad Norouzi
Deqing Sun
David J. Fleet
DiffM
56
79
0
02 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGen
DiffM
70
24
0
02 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
49
8
0
02 Jun 2023
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling
Zeqiang Lai
Yuchen Duan
Jifeng Dai
Ziheng Li
Ying Fu
Hongsheng Li
Yu Qiao
Wen Wang
DiffM
44
17
0
02 Jun 2023
PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion
  Models
PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models
Jiacheng Chen
Ruizhi Deng
Yasutaka Furukawa
DiffM
70
24
0
02 Jun 2023
Privacy Distillation: Reducing Re-identification Risk of Multimodal
  Diffusion Models
Privacy Distillation: Reducing Re-identification Risk of Multimodal Diffusion Models
Virginia Fernandez
Pedro Sanchez
W. H. Pinaya
Grzegorz Jacenków
Sotirios A. Tsaftaris
Jorge Cardoso
42
18
0
02 Jun 2023
White-Box Transformers via Sparse Rate Reduction
White-Box Transformers via Sparse Rate Reduction
Yaodong Yu
Sam Buchanan
Druv Pai
Tianzhe Chu
Ziyang Wu
Shengbang Tong
B. Haeffele
Yi Ma
ViT
57
81
0
01 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
49
246
0
01 Jun 2023
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual
  Representation Learners
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian
Lijie Fan
Phillip Isola
Huiwen Chang
Dilip Krishnan
VLM
DiffM
57
145
0
01 Jun 2023
StyleDrop: Text-to-Image Generation in Any Style
StyleDrop: Text-to-Image Generation in Any Style
Kihyuk Sohn
Nataniel Ruiz
Kimin Lee
Daniel Castro Chin
Irina Blok
...
Yuanzhen Li
Yuan Hao
Irfan Essa
Michael Rubinstein
Dilip Krishnan
18
146
0
01 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two
  Seconds
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
43
153
0
01 Jun 2023
Discovering Failure Modes of Text-guided Diffusion Models via
  Adversarial Search
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search
Qihao Liu
Adam Kortylewski
Yutong Bai
Song Bai
Alan Yuille
DiffM
47
12
0
01 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
40
39
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
39
10
0
01 Jun 2023
The Hidden Language of Diffusion Models
The Hidden Language of Diffusion Models
Hila Chefer
Oran Lang
Mor Geva
Volodymyr Polosukhin
Assaf Shocher
Michal Irani
Inbar Mosseri
Lior Wolf
DiffM
47
27
0
01 Jun 2023
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image
  Generation
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation
Minghui Hu
Jianbin Zheng
Daqing Liu
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
47
9
0
01 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
47
86
0
01 Jun 2023
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft
Shalev Lifshitz
Keiran Paster
Harris Chan
Jimmy Ba
Sheila A. McIlraith
LM&Ro
45
69
0
01 Jun 2023
Inserting Anybody in Diffusion Models via Celeb Basis
Inserting Anybody in Diffusion Models via Celeb Basis
Genlan Yuan
Xiaodong Cun
Yong Zhang
Maomao Li
Chenyang Qi
Xintao Wang
Ying Shan
Huicheng Zheng
DiffM
25
52
0
01 Jun 2023
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image
  Generation
T2IAT: Measuring Valence and Stereotypical Biases in Text-to-Image Generation
Jialu Wang
Xinyue Liu
Zonglin Di
Yongxu Liu
Xin Eric Wang
34
32
0
01 Jun 2023
FigGen: Text to Scientific Figure Generation
FigGen: Text to Scientific Figure Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
DiffM
29
6
0
01 Jun 2023
Previous
123...666768...868788
Next