Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,897 papers shown
Title
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy
Michael J. Smith
James E. Geach
76
36
0
07 Nov 2022
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale
Federico Bianchi
Pratyusha Kalluri
Esin Durmus
Faisal Ladhak
Myra Cheng
Debora Nozza
Tatsunori Hashimoto
Dan Jurafsky
James Zou
Aylin Caliskan
DiffM
VLM
143
323
0
07 Nov 2022
Image Completion with Heterogeneously Filtered Spectral Hints
Xingqian Xu
Shant Navasardyan
Vahram Tadevosyan
Andranik Sargsyan
Yadong Mu
Humphrey Shi
65
24
0
07 Nov 2022
Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation
Firas Khader
Gustav Mueller-Franzes
Soroosh Tayebi Arasteh
T. Han
Christoph Haarburger
...
Johannes Stegmaier
Christiane Kuhl
S. Nebelung
Jakob Nikolas Kather
Daniel Truhn
DiffM
MedIm
180
68
0
07 Nov 2022
I Hear Your True Colors: Image Guided Audio Generation
Roy Sheffer
Yossi Adi
VLM
85
76
0
06 Nov 2022
Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion
Marin Bilos
Kashif Rasul
Anderson Schneider
Yuriy Nevmyvaka
Stephan Günnemann
DiffM
111
36
0
04 Nov 2022
CASA: Category-agnostic Skeletal Animal Reconstruction
Yuefan Wu
Ze-Yin Chen
Shao-Wei Liu
Zhongzheng Ren
Shenlong Wang
93
31
0
04 Nov 2022
Rickrolling the Artist: Injecting Backdoors into Text Encoders for Text-to-Image Synthesis
Lukas Struppek
Dominik Hintersdorf
Kristian Kersting
SILM
130
40
0
04 Nov 2022
Large Language Models Are Human-Level Prompt Engineers
Yongchao Zhou
Andrei Ioan Muresanu
Ziwen Han
Keiran Paster
Silviu Pitis
Harris Chan
Jimmy Ba
ALM
LLMAG
195
904
0
03 Nov 2022
Evaluating a Synthetic Image Dataset Generated with Stable Diffusion
Andreas Stöckl
82
23
0
03 Nov 2022
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language
Aditya Sanghi
Rao Fu
Vivian Liu
Karl Willis
Hooman Shayani
Amir Hosein Khasahmadi
Srinath Sridhar
Daniel E. Ritchie
90
55
0
02 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
213
832
0
02 Nov 2022
Verifying And Interpreting Neural Networks using Finite Automata
Marco Sälzer
Eric Alsmann
Florian Bruse
M. Lange
AAML
81
3
0
02 Nov 2022
Spot the fake lungs: Generating Synthetic Medical Images using Neural Diffusion Models
Hazrat Ali
Shafaq Murad
Zubair Shah
DiffM
MedIm
119
53
0
02 Nov 2022
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models
Cheng Lu
Yuhao Zhou
Fan Bao
Jianfei Chen
Chongxuan Li
Jun Zhu
DiffM
249
616
0
02 Nov 2022
On the detection of synthetic images generated by diffusion models
Riccardo Corvi
D. Cozzolino
Giada Zingarini
Giovanni Poggi
Koki Nagano
L. Verdoliva
207
240
0
01 Nov 2022
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
Junde Wu
Rao Fu
Huihui Fang
Yu Zhang
Yehui Yang
Haoyi Xiong
Huiying Liu
Yanwu Xu
MedIm
VLM
DiffM
240
254
0
01 Nov 2022
Kuaipedia: a Large-scale Multi-modal Short-video Encyclopedia
Haojie Pan
Zepeng Zhai
Yuzhou Zhang
Ruiji Fu
Ming Liu
Yangqiu Song
Zhongyuan Wang
Bing Qin
96
6
0
28 Oct 2022
MagicMix: Semantic Mixing with Diffusion Models
Jun Hao Liew
Hanshu Yan
Daquan Zhou
Jiashi Feng
DiffM
233
64
0
28 Oct 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
232
30
0
28 Oct 2022
Deep Generative Models on 3D Representations: A Survey
Zifan Shi
Sida Peng
Yinghao Xu
Andreas Geiger
Yiyi Liao
Yujun Shen
MedIm
3DV
96
0
0
27 Oct 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
71
18
0
27 Oct 2022
Explaining the Explainers in Graph Neural Networks: a Comparative Study
Antonio Longa
Steve Azzolin
G. Santin
G. Cencetti
Pietro Lio
Bruno Lepri
Andrea Passerini
107
31
0
27 Oct 2022
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation
Zhaorui Tan
Xi Yang
Zihan Ye
Qiufeng Wang
Yuyao Yan
Anh Nguyen
Kaizhu Huang
EGVM
76
3
0
27 Oct 2022
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
Hritik Bansal
Da Yin
Masoud Monajatipoor
Kai-Wei Chang
116
103
0
27 Oct 2022
Conversing with Copilot: Exploring Prompt Engineering for Solving CS1 Problems Using Natural Language
Paul Denny
Viraj Kumar
Nasser Giacaman
78
249
0
27 Oct 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
136
305
0
26 Oct 2022
A Sign That Spells: DALL-E 2, Invisual Images and The Racial Politics of Feature Space
Fabian Offert
Thao Phan
44
16
0
26 Oct 2022
Categorical SDEs with Simplex Diffusion
Pierre Harvey Richemond
Sander Dieleman
Arnaud Doucet
DiffM
72
26
0
26 Oct 2022
Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker
Simon Damm
Thorsten Holz
Asja Fischer
DiffM
129
107
0
26 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Yufan Zhou
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
108
11
0
25 Oct 2022
A Survey on Artificial Intelligence for Music Generation: Agents, Domains and Perspectives
Carlos Hernandez-Olivan
Javier Hernandez-Olivan
J. R. Beltrán
MGen
93
7
0
25 Oct 2022
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality
Ankur Handa
Arthur Allshire
Viktor Makoviychuk
Aleksei Petrenko
Ritvik Singh
...
Balakumar Sundaralingam
Yashraj S. Narang
Jean-Francois Lafleche
Dieter Fox
Gavriel State
136
157
0
25 Oct 2022
Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing
Tuhin Chakrabarty
Vishakh Padmakumar
Hengxing He
82
82
0
25 Oct 2022
Vitruvio: 3D Building Meshes via Single Perspective Sketches
Alberto Tono
Heyaojing Huang
Ashwin Agrawal
Martin Fischer
47
5
0
24 Oct 2022
High-Resolution Image Editing via Multi-Stage Blended Diffusion
J. Ackermann
Minjun Li
DiffM
68
16
0
24 Oct 2022
DALL-E 2 Fails to Reliably Capture Common Syntactic Processes
Evelina Leivada
Elliot Murphy
G. Marcus
193
38
0
23 Oct 2022
Deep Equilibrium Approaches to Diffusion Models
Ashwini Pokle
Zhengyang Geng
Zico Kolter
DiffM
94
43
0
23 Oct 2022
Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion Model
Zhiyuan Ren
Zhihong Pan
Xingfa Zhou
Le Kang
VGen
DiffM
117
39
0
22 Oct 2022
Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks
A. S. Bansal
Yoonjin Lee
Kyle Hilburn
I. Ebert‐Uphoff
AI4TS
94
2
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
99
22
0
21 Oct 2022
Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi
P. Mayer
Josue Casco-Rodriguez
Ali Siahkoohi
Richard G. Baraniuk
DiffM
108
20
0
21 Oct 2022
Evolution of Neural Tangent Kernels under Benign and Adversarial Training
Noel Loo
Ramin Hasani
Alexander Amini
Daniela Rus
AAML
86
13
0
21 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
88
126
0
20 Oct 2022
Composing Ensembles of Pre-trained Models via Iterative Consensus
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
Igor Mordatch
MoMe
73
25
0
20 Oct 2022
The Natural Robotics Contest: Crowdsourced Biomimetic Design
Robert Siddall
R. Zufferey
S. Armanini
Ketao Zhang
S. Sareh
Elisavetha Sergeev
AI4CE
13
8
0
20 Oct 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
151
515
0
20 Oct 2022
Representation Learning with Diffusion Models
Jeremias Traub
DiffM
97
8
0
20 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
108
7
0
19 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
152
20
0
19 Oct 2022
Previous
1
2
3
...
91
92
93
...
96
97
98
Next