ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

44 / 4,744 papers shown
Title
CogView2: Faster and Better Text-to-Image Generation via Hierarchical
  Transformers
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
Ming Ding
Wendi Zheng
Wenyi Hong
Jie Tang
VLM
41
322
0
28 Apr 2022
Can deep learning match the efficiency of human visual long-term memory
  in storing object details?
Can deep learning match the efficiency of human visual long-term memory in storing object details?
Emin Orhan
VLM
OCL
28
0
0
27 Apr 2022
An Overview of Recent Work in Media Forensics: Methods and Threats
An Overview of Recent Work in Media Forensics: Methods and Threats
Kratika Bhagtani
A. Yadav
Emily R. Bartusiak
Ziyue Xiang
Ruiting Shao
Sriram Baireddy
Edward J. Delp
AAML
52
25
0
26 Apr 2022
A very preliminary analysis of DALL-E 2
A very preliminary analysis of DALL-E 2
G. Marcus
E. Davis
S. Aaronson
16
134
0
25 Apr 2022
Semi-Parametric Neural Image Synthesis
Semi-Parametric Neural Image Synthesis
A. Blattmann
Robin Rombach
Kaan Oktay
Jonas Muller
Bjorn Ommer
DiffM
36
28
0
25 Apr 2022
Translation between Molecules and Natural Language
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
33
157
0
25 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation
  and Beyond
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
Hao Fei
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
32
82
0
20 Apr 2022
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
J. Oppenlaender
28
102
0
20 Apr 2022
Opal: Multimodal Image Generation for News Illustration
Opal: Multimodal Image Generation for News Illustration
Vivian Liu
Han Qiao
Lydia B. Chilton
19
101
0
19 Apr 2022
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
G. Loaiza-Ganem
Brendan Leigh Ross
Jesse C. Cresswell
Anthony L. Caterini
GAN
DRL
19
28
0
14 Apr 2022
Synthesizing Adversarial Visual Scenarios for Model-Based Robotic
  Control
Synthesizing Adversarial Visual Scenarios for Model-Based Robotic Control
Shubhankar Agarwal
Sandeep P. Chinchali
AAML
37
4
0
13 Apr 2022
Contrastive language and vision learning of general fashion concepts
Contrastive language and vision learning of general fashion concepts
P. Chia
Giuseppe Attanasio
Federico Bianchi
Silvia Terragni
A. Magalhães
Diogo Gonçalves
C. Greco
Jacopo Tagliabue
CLIP
21
42
0
08 Apr 2022
KNN-Diffusion: Image Generation via Large-Scale Retrieval
KNN-Diffusion: Image Generation via Large-Scale Retrieval
Shelly Sheynin
Oron Ashual
Adam Polyak
Uriel Singer
Oran Gafni
Eliya Nachmani
Yaniv Taigman
VLM
SyDa
DiffM
21
113
0
06 Apr 2022
CLIP-Mesh: Generating textured meshes from text using pretrained
  image-text models
CLIP-Mesh: Generating textured meshes from text using pretrained image-text models
N. Khalid
Tianhao Xie
Eugene Belilovsky
Tiberiu Popa
CLIP
13
291
0
24 Mar 2022
Complex Scene Image Editing by Scene Graph Comprehension
Complex Scene Image Editing by Scene Graph Comprehension
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
30
6
0
24 Mar 2022
How well does CLIP understand texture?
How well does CLIP understand texture?
Chenyun Wu
Subhransu Maji
30
6
0
22 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
59
257
0
16 Mar 2022
The Role of ImageNet Classes in Fréchet Inception Distance
The Role of ImageNet Classes in Fréchet Inception Distance
Tuomas Kynkaanniemi
Tero Karras
M. Aittala
Timo Aila
J. Lehtinen
EGVM
VLM
38
200
0
11 Mar 2022
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
ViT
32
10
0
09 Mar 2022
Joint rotational invariance and adversarial training of a dual-stream
  Transformer yields state of the art Brain-Score for Area V4
Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4
William Berrios
Arturo Deza
MedIm
ViT
30
13
0
08 Mar 2022
A Typology for Exploring the Mitigation of Shortcut Behavior
A Typology for Exploring the Mitigation of Shortcut Behavior
Felix Friedrich
Wolfgang Stammer
P. Schramowski
Kristian Kersting
LLMAG
20
9
0
04 Mar 2022
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
Zihao Wang
Wei Liu
Qian He
Xin-ru Wu
Zili Yi
CLIP
VLM
201
73
0
01 Mar 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That
  Synthesizes 16K Images On A Single GPU
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Junseok Oh
Donghwee Yoon
Injung Kim
34
1
0
28 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of
  Text-to-Image Generation Models
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
ViT
145
170
0
08 Feb 2022
When Do Flat Minima Optimizers Work?
When Do Flat Minima Optimizers Work?
Jean Kaddour
Linqing Liu
Ricardo M. A. Silva
Matt J. Kusner
ODL
24
58
0
01 Feb 2022
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
Dimitri von Rutte
Luca Biggio
Yannic Kilcher
Thomas Hofmann
33
0
0
26 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
29
48
0
27 Dec 2021
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal
  Derivatives
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Hideyuki Tachibana
Mocho Go
Muneyoshi Inahara
Yotaro Katayama
Yotaro Watanabe
DiffM
27
3
0
26 Dec 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
G. Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
207
102
0
21 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
37
163
0
11 Oct 2021
An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF
  Mass Spectrometry
An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF Mass Spectrometry
V. Seethi
Z. LaCasse
P. Chivte
Joshua Bland
Shrihari S. Kadkol
E. Gaillard
Pratool Bharti
Hamed Alhoori
24
9
0
28 Sep 2021
How much human-like visual experience do current self-supervised
  learning algorithms need in order to achieve human-level object recognition?
How much human-like visual experience do current self-supervised learning algorithms need in order to achieve human-level object recognition?
Emin Orhan
OOD
38
4
0
23 Sep 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
202
405
0
13 Jul 2021
Systematic human learning and generalization from a brief tutorial with
  explanatory feedback
Systematic human learning and generalization from a brief tutorial with explanatory feedback
A. Nam
James L. McClelland
16
1
0
10 Jul 2021
Visual Probing: Cognitive Framework for Explaining Self-Supervised Image
  Representations
Visual Probing: Cognitive Framework for Explaining Self-Supervised Image Representations
Witold Oleszkiewicz
Dominika Basaj
Igor Sieradzki
Michal Górszczak
Barbara Rychalska
K. Lewandowska
Tomasz Trzciñski
Bartosz Zieliñski
SSL
37
3
0
21 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models
  Smaller, Faster, and Better
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLM
MedIm
23
366
0
16 Jun 2021
Communicating Natural Programs to Humans and Machines
Communicating Natural Programs to Humans and Machines
Samuel Acquaviva
Yewen Pu
Marta Kryven
Theo Sechopoulos
Catherine Wong
Gabrielle Ecanow
Maxwell Nye
Michael Henry Tessler
J. Tenenbaum
33
40
0
15 Jun 2021
Neural Monge Map estimation and its applications
Neural Monge Map estimation and its applications
JiaoJiao Fan
Shu Liu
Shaojun Ma
Haomin Zhou
Yongxin Chen
OT
30
23
0
07 Jun 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
34
40
0
06 Apr 2021
Structure Inducing Pre-Training
Structure Inducing Pre-Training
Matthew B. A. McDermott
Brendan Yap
Peter Szolovits
Marinka Zitnik
42
18
0
18 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,796
0
24 Feb 2021
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
18
2,130
0
23 Dec 2020
RainNet: A Large-Scale Imagery Dataset and Benchmark for Spatial
  Precipitation Downscaling
RainNet: A Large-Scale Imagery Dataset and Benchmark for Spatial Precipitation Downscaling
Xuanhong Chen
Kairui Feng
Naiyuan Liu
Bingbing Ni
Yifan Lu
Zhengyan Tong
Ziang Liu
27
10
0
17 Dec 2020
Model-Based Deep Learning
Model-Based Deep Learning
Nir Shlezinger
Jay Whang
Yonina C. Eldar
A. Dimakis
28
317
0
15 Dec 2020
Previous
123...939495