Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,897 papers shown
Title
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
132
313
0
08 Aug 2022
Sampling Based On Natural Image Statistics Improves Local Surrogate Explainers
Ricardo Kleinlein
Alexander Hepburn
Raúl Santos-Rodríguez
Fernando Fernández-Martínez
AAML
FAtt
44
2
0
08 Aug 2022
Creative Wand: A System to Study Effects of Communications in Co-Creative Settings
Zhiyu Lin
Rohan Agarwal
Mark O. Riedl
60
9
0
04 Aug 2022
Adversarial Attacks on Image Generation With Made-Up Words
Raphael Milliere
90
39
0
04 Aug 2022
Masked Vision and Language Modeling for Multi-modal Representation Learning
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
90
68
0
03 Aug 2022
Pyramidal Denoising Diffusion Probabilistic Models
Dohoon Ryu
Jong Chul Ye
109
25
0
03 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
107
22
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
247
1,796
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
176
1,903
0
02 Aug 2022
AI Augmented Edge and Fog Computing: Trends and Challenges
Shreshth Tuli
Fatemeh Mirhakimi
Samodha Pallewatta
Syed Zawad
G. Casale
B. Javadi
Feng Yan
Rajkumar Buyya
N. Jennings
65
59
0
01 Aug 2022
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
225
66
0
29 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
83
71
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedIm
DiffM
101
68
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
104
31
0
25 Jul 2022
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
112
106
0
23 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
45
10
0
22 Jul 2022
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
Ming-Yu Liu
Yuxiang Wei
Xiaohe Wu
Wangmeng Zuo
Lei Zhang
96
1
0
21 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
111
306
0
20 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
89
74
0
20 Jul 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
122
79
0
19 Jul 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
75
17
0
19 Jul 2022
Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis
Sangyun Lee
Hyungjin Chung
Jaehyeon Kim
Jong Chul Ye
DiffM
85
46
0
16 Jul 2022
How to Reuse and Compose Knowledge for a Lifetime of Tasks: A Survey on Continual Learning and Functional Composition
Jorge Armando Mendez Mendez
Eric Eaton
KELM
CLL
121
29
0
15 Jul 2022
WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation
Mengping Yang
Zhe Wang
Ziqiu Chi
Wenyi Feng
87
51
0
15 Jul 2022
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
Jinbin Bai
Chunhui Liu
Feiyue Ni
Haofan Wang
Mengying Hu
Xiaofeng Guo
Lele Cheng
102
11
0
11 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
260
470
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
108
18
0
09 Jul 2022
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
Matteo Manica
Jannis Born
Joris Cadow
Dimitrios Christofidellis
A. Dave
...
Lauren N. McHugh
Alexy Khrabrov
Payel Das
Seiji Takeda
John Smith
95
27
0
08 Jul 2022
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
91
0
0
08 Jul 2022
Exploring Generative Adversarial Networks for Text-to-Image Generation with Evolution Strategies
Victor G. Turrisi da Costa
Nuno Lourenço
João Correia
Penousal Machado
GAN
34
3
0
06 Jul 2022
Can Language Understand Depth?
Renrui Zhang
Ziyao Zeng
Ziyu Guo
Yafeng Li
VLM
MDE
108
75
0
03 Jul 2022
American == White in Multimodal Language-and-Image AI
Robert Wolfe
Aylin Caliskan
VLM
85
51
0
01 Jul 2022
Deep Learning and Symbolic Regression for Discovering Parametric Equations
Michael Zhang
Samuel Kim
Peter Y. Lu
M. Soljavcić
72
21
0
01 Jul 2022
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
Aleksander Madry
97
90
0
29 Jun 2022
Beyond neural scaling laws: beating power law scaling via data pruning
Ben Sorscher
Robert Geirhos
Shashank Shekhar
Surya Ganguli
Ari S. Morcos
122
447
0
29 Jun 2022
Memory Safe Computations with XLA Compiler
A. Artemev
Tilman Roeder
Mark van der Wilk
56
8
0
28 Jun 2022
Studying Generalization Through Data Averaging
C. Gomez-Uribe
FedML
133
0
0
28 Jun 2022
Perspective (In)consistency of Paint by Text
Hany Farid
DiffM
77
37
0
27 Jun 2022
Repository-Level Prompt Generation for Large Language Models of Code
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
103
143
0
26 Jun 2022
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge Belongie
Sagie Benaim
VGen
DiffM
84
17
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
156
304
0
23 Jun 2022
The ArtBench Dataset: Benchmarking Generative Models with Artworks
Peiyuan Liao
Xiuyu Li
Xihui Liu
Kurt Keutzer
76
49
0
22 Jun 2022
A Study on the Evaluation of Generative Models
Eyal Betzalel
Coby Penso
Aviv Navon
Ethan Fetaya
EGVM
125
52
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
223
1,134
0
22 Jun 2022
EpiGRAF: Rethinking training of 3D GANs
Ivan Skorokhodov
Sergey Tulyakov
Yiqun Wang
Peter Wonka
DiffM
117
126
0
21 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
141
120
0
21 Jun 2022
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang
Joonghyuk Shin
Jaesik Park
EGVM
66
70
0
19 Jun 2022
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
BDL
107
15
0
18 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
171
412
0
17 Jun 2022
Lossy Compression with Gaussian Diffusion
Lucas Theis
Tim Salimans
Matthew D. Hoffman
Fabian Mentzer
DiffM
112
82
0
17 Jun 2022
Previous
1
2
3
...
95
96
97
98
Next