Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
47 / 4,897 papers shown
Title
CCMB: A Large-scale Chinese Cross-modal Benchmark
Chunyu Xie
Heng Cai
Jincheng Li
Fanjing Kong
Xiaoyu Wu
...
Xiangzheng Zhang
Dawei Leng
Baochang Zhang
Xiangyang Ji
Yafeng Deng
MLLM
VLM
76
12
0
08 May 2022
BlobGAN: Spatially Disentangled Scene Representations
Dave Epstein
Taesung Park
Richard Y. Zhang
Eli Shechtman
Alexei A. Efros
GAN
SSL
OCL
99
43
0
05 May 2022
Language Models Can See: Plugging Visual Controls in Text Generation
Yixuan Su
Tian Lan
Yahui Liu
Fangyu Liu
Dani Yogatama
Yan Wang
Lingpeng Kong
Nigel Collier
VLM
MLLM
102
98
0
05 May 2022
A Computational Inflection for Scientific Discovery
Tom Hope
Doug Downey
Oren Etzioni
Daniel S. Weld
Eric Horvitz
AI4CE
85
34
0
04 May 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
86
8
0
03 May 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
420
3,617
0
29 Apr 2022
Fast Sampling of Diffusion Models with Exponential Integrator
Qinsheng Zhang
Yongxin Chen
DiffM
109
439
0
29 Apr 2022
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Ming Ding
Wendi Zheng
Wenyi Hong
Jie Tang
VLM
144
335
0
28 Apr 2022
Can deep learning match the efficiency of human visual long-term memory in storing object details?
Emin Orhan
VLM
OCL
111
0
0
27 Apr 2022
An Overview of Recent Work in Media Forensics: Methods and Threats
Kratika Bhagtani
A. Yadav
Emily R. Bartusiak
Ziyue Xiang
Ruiting Shao
Sriram Baireddy
Edward J. Delp
AAML
89
25
0
26 Apr 2022
A very preliminary analysis of DALL-E 2
G. Marcus
E. Davis
S. Aaronson
104
139
0
25 Apr 2022
Semi-Parametric Neural Image Synthesis
A. Blattmann
Robin Rombach
Kaan Oktay
Jonas Muller
Bjorn Ommer
DiffM
100
31
0
25 Apr 2022
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
134
171
0
25 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
Hao Fei
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
96
89
0
20 Apr 2022
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
J. Oppenlaender
101
107
0
20 Apr 2022
Opal: Multimodal Image Generation for News Illustration
Vivian Liu
Han Qiao
Lydia B. Chilton
114
103
0
19 Apr 2022
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Gabriel Loaiza-Ganem
Brendan Leigh Ross
Jesse C. Cresswell
Anthony L. Caterini
GAN
DRL
104
31
0
14 Apr 2022
Synthesizing Adversarial Visual Scenarios for Model-Based Robotic Control
Shubhankar Agarwal
Sandeep Chinchali
AAML
87
4
0
13 Apr 2022
Contrastive language and vision learning of general fashion concepts
P. Chia
Giuseppe Attanasio
Federico Bianchi
Silvia Terragni
A. Magalhães
Diogo Gonçalves
C. Greco
Jacopo Tagliabue
CLIP
115
44
0
08 Apr 2022
KNN-Diffusion: Image Generation via Large-Scale Retrieval
Shelly Sheynin
Oron Ashual
Adam Polyak
Uriel Singer
Oran Gafni
Eliya Nachmani
Yaniv Taigman
VLM
SyDa
DiffM
82
124
0
06 Apr 2022
CLIP-Mesh: Generating textured meshes from text using pretrained image-text models
N. Khalid
Tianhao Xie
Eugene Belilovsky
Tiberiu Popa
CLIP
102
302
0
24 Mar 2022
Complex Scene Image Editing by Scene Graph Comprehension
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
68
6
0
24 Mar 2022
How well does CLIP understand texture?
Chenyun Wu
Subhransu Maji
67
7
0
22 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
157
267
0
16 Mar 2022
The Role of ImageNet Classes in Fréchet Inception Distance
Tuomas Kynkaanniemi
Tero Karras
M. Aittala
Timo Aila
J. Lehtinen
EGVM
VLM
142
212
0
11 Mar 2022
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
ViT
85
10
0
09 Mar 2022
Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4
William Berrios
Arturo Deza
MedIm
ViT
84
13
0
08 Mar 2022
A Typology for Exploring the Mitigation of Shortcut Behavior
Felix Friedrich
Wolfgang Stammer
P. Schramowski
Kristian Kersting
LLMAG
62
7
0
04 Mar 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Junseok Oh
Donghwee Yoon
Injung Kim
66
1
0
28 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
ViT
241
193
0
08 Feb 2022
When Do Flat Minima Optimizers Work?
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
134
64
0
01 Feb 2022
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
Dimitri von Rutte
Luca Biggio
Yannic Kilcher
Thomas Hofmann
74
0
0
26 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
192
51
0
27 Dec 2021
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Hideyuki Tachibana
Mocho Go
Muneyoshi Inahara
Yotaro Katayama
Yotaro Watanabe
DiffM
64
3
0
26 Dec 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
154
172
0
11 Oct 2021
An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF Mass Spectrometry
V. Seethi
Z. LaCasse
P. Chivte
Joshua Bland
Shrihari S. Kadkol
E. Gaillard
Pratool Bharti
Hamed Alhoori
31
10
0
28 Sep 2021
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?
Emin Orhan
OOD
100
4
0
23 Sep 2021
Systematic human learning and generalization from a brief tutorial with explanatory feedback
A. Nam
James L. McClelland
38
1
0
10 Jul 2021
Visual Probing: Cognitive Framework for Explaining Self-Supervised Image Representations
Witold Oleszkiewicz
Dominika Basaj
Igor Sieradzki
Michal Górszczak
Barbara Rychalska
K. Lewandowska
Tomasz Trzciñski
Bartosz Zieliñski
SSL
75
3
0
21 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLM
MedIm
108
386
0
16 Jun 2021
Communicating Natural Programs to Humans and Machines
Samuel Acquaviva
Yewen Pu
Marta Kryven
Theo Sechopoulos
Catherine Wong
Gabrielle Ecanow
Maxwell Nye
Michael Henry Tessler
J. Tenenbaum
92
42
0
15 Jun 2021
Neural Monge Map estimation and its applications
JiaoJiao Fan
Shu Liu
Shaojun Ma
Haomin Zhou
Yongxin Chen
OT
120
27
0
07 Jun 2021
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
129
43
0
06 Apr 2021
Structure Inducing Pre-Training
Matthew B. A. McDermott
Brendan Yap
Peter Szolovits
Marinka Zitnik
85
21
0
18 Mar 2021
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
229
2,276
0
23 Dec 2020
RainNet: A Large-Scale Imagery Dataset and Benchmark for Spatial Precipitation Downscaling
Xuanhong Chen
Kairui Feng
Naiyuan Liu
Bingbing Ni
Yifan Lu
Zhengyan Tong
Ziang Liu
70
11
0
17 Dec 2020
Model-Based Deep Learning
Nir Shlezinger
Jay Whang
Yonina C. Eldar
A. Dimakis
122
327
0
15 Dec 2020
Previous
1
2
3
...
96
97
98