Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,897 papers shown
Title
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer
Elad Levi
Eli Brosh
Mykola Mykhailych
Meir Perez
DiffM
96
17
0
07 Mar 2023
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers
Sitan Chen
Giannis Daras
A. Dimakis
DiffM
89
65
0
06 Mar 2023
Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language
Philipp Seidl
Andreu Vall
Sepp Hochreiter
Günter Klambauer
140
41
0
06 Mar 2023
StyO: Stylize Your Face in Only One-Shot
Bonan li
Zicheng Zhang
Xuecheng Nie
Congying Han
Yinhan Hu
Tiande Guo
DiffM
125
6
0
06 Mar 2023
Choice Over Control: How Users Write with Large Language Models using Diegetic and Non-Diegetic Prompting
Hai Dang
Sven Goller
Florian Lehmann
Daniel Buschek
AI4CE
148
77
0
06 Mar 2023
Towards Zero-Shot Functional Compositionality of Language Models
Hangyeol Yu
Myeongho Jeong
Jamin Shin
Hyeongdon Moon
Juneyoung Park
Seungtaek Choi
69
1
0
06 Mar 2023
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Wei Li
Linchao Zhu
Longyin Wen
Yi Yang
VLM
107
89
0
06 Mar 2023
Learning multi-scale local conditional probability models of images
Zahra Kadkhodaie
Florentin Guth
S. Mallat
Eero P. Simoncelli
DiffM
103
19
0
06 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
116
7
0
06 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Victor C. Dibia
VLM
134
90
0
06 Mar 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Xu Ju
Ailing Zeng
Jianan Wang
Qian Xu
Lei Zhang
3DH
89
48
0
05 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
249
233
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
79
28
0
03 Mar 2023
Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion
Shaheer U. Saeed
Tom Syer
Wen Yan
Qianye Yang
M. Emberton
S. Punwani
Matthew J. Clarkson
D. Barratt
Yipeng Hu
DiffM
MedIm
99
11
0
03 Mar 2023
Word-As-Image for Semantic Typography
Shira Iluz
Yael Vinker
Amir Hertz
Daniel Berio
Daniel Cohen-Or
Ariel Shamir
DiffM
99
64
0
03 Mar 2023
A Complete Recipe for Diffusion Generative Models
Kushagra Pandey
Stephan Mandt
DiffM
67
9
0
03 Mar 2023
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax
Zachary Huemann
Xin Tie
Junjie Hu
Tyler Bradshaw
59
17
0
02 Mar 2023
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
63
0
0
02 Mar 2023
Dropout Reduces Underfitting
Zhuang Liu
Zhi-Qin John Xu
Joseph Jin
Zhiqiang Shen
Trevor Darrell
160
42
0
02 Mar 2023
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLM
DiffM
115
982
0
02 Mar 2023
3D generation on ImageNet
Ivan Skorokhodov
Aliaksandr Siarohin
Yinghao Xu
Jian Ren
Hsin-Ying Lee
Peter Wonka
Sergey Tulyakov
121
55
0
02 Mar 2023
A Pathway Towards Responsible AI Generated Content
Chen Chen
Jie Fu
Lingjuan Lyu
106
72
0
02 Mar 2023
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Rui Zhao
Wei Li
Zhipeng Hu
Lincheng Li
Zhengxia Zou
Z. Shi
Changjie Fan
66
19
0
02 Mar 2023
DSD
2
^2
2
: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
Victor Quétu
Enzo Tartaglione
85
7
0
02 Mar 2023
X&Fuse: Fusing Visual Information in Text-to-Image Generation
Yuval Kirstain
Omer Levy
Adam Polyak
DiffM
50
6
0
02 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation
Diederik P. Kingma
Ruiqi Gao
DiffM
100
144
0
01 Mar 2023
Continuous-Time Functional Diffusion Processes
Giulio Franzese
Dario Rossi
Simone Rossi
Markus Heinonen
Maurizio Filippone
Pietro Michiardi
114
27
0
01 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
111
12
0
01 Mar 2023
Rethinking Efficient Tuning Methods from a Unified Perspective
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Yiliang Lv
Deli Zhao
Jingren Zhou
85
11
0
01 Mar 2023
Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models
Luca Guarnera
O. Giudice
Sebastiano Battiato
111
30
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
94
2
0
01 Mar 2023
Understanding Natural Language Understanding Systems. A Critical Analysis
Alessandro Lenci
ELM
61
12
0
01 Mar 2023
Collage Diffusion
Vishnu Sarukkai
Linden Li
Arden Ma
Christopher Ré
Kayvon Fatahalian
DiffM
82
27
0
01 Mar 2023
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning
Zhenmei Shi
Jiefeng Chen
Kunyang Li
Jayaram Raghuram
Xi Wu
Yingyu Liang
S. Jha
SSL
71
20
0
28 Feb 2023
Monocular Depth Estimation using Diffusion Models
Saurabh Saxena
Abhishek Kar
Mohammad Norouzi
David J. Fleet
DiffM
VLM
MDE
109
86
0
28 Feb 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
115
15
0
28 Feb 2023
Synthesizing Mixed-type Electronic Health Records using Diffusion Models
T. Ceritli
Ghadeer O. Ghosheh
V. Chauhan
T. Zhu
Andrew P. Creagh
David Clifton
MedIm
DiffM
92
19
0
28 Feb 2023
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
105
11
0
28 Feb 2023
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation
Chaohui Yu
Qiang-feng Zhou
Jingliang Li
Jia-Chao Yuan
Zhibin Wang
Fan Wang
VLM
CLL
69
14
0
28 Feb 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
157
7
0
28 Feb 2023
AVscript: Accessible Video Editing with Audio-Visual Scripts
Mina Huh
Saelyne Yang
Yi-Hao Peng
Xiang Ánthony' Chen
Young-Ho Kim
Amy Pavel
64
34
0
27 Feb 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei
Yabo Zhang
Zhilong Ji
Jinfeng Bai
Lei Zhang
W. Zuo
DiffM
107
329
0
27 Feb 2023
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Jiyoung Lee
Joon Son Chung
Soo-Whan Chung
DiffM
101
31
0
27 Feb 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
109
4
0
26 Feb 2023
MetaAID 2.0: An Extensible Framework for Developing Metaverse Applications via Human-controllable Pre-trained Models
Hongyin Zhu
56
6
0
25 Feb 2023
Directed Diffusion: Direct Control of Object Placement through Attention Guidance
W. Ma
J. P. Lewis
Avisek Lahiri
Thomas Leung
W. Kleijn
DiffM
106
68
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
139
160
0
25 Feb 2023
Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties
Nikolaos N. Vlassis
WaiChing Sun
AI4CE
DiffM
116
51
0
24 Feb 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
102
24
0
24 Feb 2023
SurvivalGAN: Generating Time-to-Event Data for Survival Analysis
Alexander Norcliffe
B. Cebere
F. Imrie
Pietro Lio
M. Schaar
SyDa
72
15
0
24 Feb 2023
Previous
1
2
3
...
82
83
84
...
96
97
98
Next