ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 1,364 papers shown
Title
FETA: Towards Specializing Foundation Models for Expert Task
  Applications
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
90
20
0
08 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with
  Rectified Flow
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
265
1,056
0
07 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffMMedIm
485
1,420
0
02 Sep 2022
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D
  Object Sets
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D Object Sets
Kristofer Schlachter
Benjamin Ahlbrand
Zhu Wang
V. Ortenzi
Ken Perlin
DiffM3DV
53
7
0
01 Sep 2022
FLAME: Free-form Language-based Motion Synthesis & Editing
FLAME: Free-form Language-based Motion Synthesis & Editing
Jihoon Kim
Jiseob Kim
Sungjoon Choi
VGen
125
213
0
01 Sep 2022
A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images
A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images
Dominik Jens Elias Waibel
Ernst Rooell
Bastian Rieck
Raja Giryes
Carsten Marr
DiffMMedIm
93
41
0
30 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
92
97
0
29 Aug 2022
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
Bjorn Deiseroth
P. Schramowski
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
EGVMDiffM
52
2
0
29 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
304
2,904
0
25 Aug 2022
Understanding Diffusion Models: A Unified Perspective
Understanding Diffusion Models: A Unified Perspective
Calvin Luo
DiffM
102
347
0
25 Aug 2022
AI and 6G into the Metaverse: Fundamentals, Challenges and Future
  Research Trends
AI and 6G into the Metaverse: Fundamentals, Challenges and Future Research Trends
Muhammad Zawish
Fayaz Ali Dharejo
Sunder Ali Khowaja
Saleem Raza
Steven Davy
Kapal Dev
P. Bellavista
82
68
0
23 Aug 2022
Text to Image Generation: Leaving no Language Behind
Text to Image Generation: Leaving no Language Behind
Pedro Reviriego
Elena Merino-Gómez
VLM
49
13
0
19 Aug 2022
Discovering Bugs in Vision Models using Off-the-shelf Image Generation
  and Captioning
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning
Olivia Wiles
Isabela Albuquerque
Sven Gowal
VLM
72
47
0
18 Aug 2022
Enhancing Diffusion-Based Image Synthesis with Robust Classifier
  Guidance
Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance
Bahjat Kawar
Roy Ganz
Michael Elad
DiffM
91
39
0
18 Aug 2022
Multimodal foundation models are better simulators of the human brain
Multimodal foundation models are better simulators of the human brain
Haoyu Lu
Qiongyi Zhou
Nanyi Fei
Zhiwu Lu
Mingyu Ding
...
Changde Du
Xin Zhao
Haoran Sun
Huiguang He
J. Wen
AI4CE
85
13
0
17 Aug 2022
Layout-Bridging Text-to-Image Synthesis
Layout-Bridging Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
EGVM
98
15
0
12 Aug 2022
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
B. Guo
VGen
102
13
0
11 Aug 2022
Quality Not Quantity: On the Interaction between Dataset Design and
  Robustness of CLIP
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP
Thao Nguyen
Gabriel Ilharco
Mitchell Wortsman
Sewoong Oh
Ludwig Schmidt
CLIPVLM
180
108
0
10 Aug 2022
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern
  Hopfield Networks
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
Yonghao Xu
Weikang Yu
Pedram Ghamisi
Michael K Kopp
Sepp Hochreiter
66
34
0
08 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with
  Self-Conditioning
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
132
313
0
08 Aug 2022
Adversarial Attacks on Image Generation With Made-Up Words
Adversarial Attacks on Image Generation With Made-Up Words
Raphael Milliere
90
39
0
04 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image
  transformers
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
107
22
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
247
1,796
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
176
1,903
0
02 Aug 2022
Restoring Vision in Adverse Weather Conditions with Patch-Based
  Denoising Diffusion Models
Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
Ozan Özdenizci
Robert Legenstein
DiffM
119
270
0
29 Jul 2022
Testing Relational Understanding in Text-Guided Image Generation
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
225
66
0
29 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa3DGS
90
139
0
27 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented
  Diffusion Models
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
83
71
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion
  Localization
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedImDiffM
101
68
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
104
31
0
25 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
45
10
0
22 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
89
74
0
20 Jul 2022
Sparse Relational Reasoning with Object-Centric Representations
Sparse Relational Reasoning with Object-Centric Representations
Alex F Spies
Alessandra Russo
Murray Shanahan
OCLNAI
50
3
0
15 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
260
470
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
108
18
0
09 Jul 2022
Big Learning
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
91
0
0
08 Jul 2022
Distilling Model Failures as Directions in Latent Space
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
Aleksander Madry
97
90
0
29 Jun 2022
Neural Neural Textures Make Sim2Real Consistent
Neural Neural Textures Make Sim2Real Consistent
R. Burgert
Jinghuan Shang
Xiang Li
Michael S. Ryoo
64
6
0
27 Jun 2022
ProGen2: Exploring the Boundaries of Protein Language Models
ProGen2: Exploring the Boundaries of Protein Language Models
Erik Nijkamp
Jeffrey A. Ruffolo
Eli N. Weinstein
Nikhil Naik
Ali Madani
AI4TS
76
315
0
27 Jun 2022
Text-Driven Stylization of Video Objects
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge Belongie
Sagie Benaim
VGenDiffM
84
17
0
24 Jun 2022
The ArtBench Dataset: Benchmarking Generative Models with Artworks
The ArtBench Dataset: Benchmarking Generative Models with Artworks
Peiyuan Liao
Xiuyu Li
Xihui Liu
Kurt Keutzer
76
49
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
223
1,134
0
22 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
145
120
0
21 Jun 2022
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for
  Inverse Problems
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
BDL
107
15
0
18 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
150
388
0
17 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
74
51
0
11 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
171
393
0
06 Jun 2022
Discovering the Hidden Vocabulary of DALLE-2
Discovering the Hidden Vocabulary of DALLE-2
Giannis Daras
A. Dimakis
189
68
0
01 Jun 2022
Decomposing NeRF for Editing via Feature Field Distillation
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
266
343
0
31 May 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
343
632
0
29 May 2022
Previous
123...262728
Next