ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLMDiffM
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,897 papers shown
Title
DLT: Conditioned layout generation with Joint Discrete-Continuous
  Diffusion Layout Transformer
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer
Elad Levi
Eli Brosh
Mykola Mykhailych
Meir Perez
DiffM
96
17
0
07 Mar 2023
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic
  Analysis For DDIM-Type Samplers
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers
Sitan Chen
Giannis Daras
A. Dimakis
DiffM
89
65
0
06 Mar 2023
Enhancing Activity Prediction Models in Drug Discovery with the Ability
  to Understand Human Language
Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language
Philipp Seidl
Andreu Vall
Sepp Hochreiter
Günter Klambauer
140
41
0
06 Mar 2023
StyO: Stylize Your Face in Only One-Shot
StyO: Stylize Your Face in Only One-Shot
Bonan li
Zicheng Zhang
Xuecheng Nie
Congying Han
Yinhan Hu
Tiande Guo
DiffM
125
6
0
06 Mar 2023
Choice Over Control: How Users Write with Large Language Models using
  Diegetic and Non-Diegetic Prompting
Choice Over Control: How Users Write with Large Language Models using Diegetic and Non-Diegetic Prompting
Hai Dang
Sven Goller
Florian Lehmann
Daniel Buschek
AI4CE
148
77
0
06 Mar 2023
Towards Zero-Shot Functional Compositionality of Language Models
Towards Zero-Shot Functional Compositionality of Language Models
Hangyeol Yu
Myeongho Jeong
Jamin Shin
Hyeongdon Moon
Juneyoung Park
Seungtaek Choi
69
1
0
06 Mar 2023
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only
  Training
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Wei Li
Linchao Zhu
Longyin Wen
Yi Yang
VLM
107
89
0
06 Mar 2023
Learning multi-scale local conditional probability models of images
Learning multi-scale local conditional probability models of images
Zahra Kadkhodaie
Florentin Guth
S. Mallat
Eero P. Simoncelli
DiffM
103
19
0
06 Mar 2023
FoundationTTS: Text-to-Speech for ASR Customization with Generative
  Language Model
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model
Rui Xue
Yanqing Liu
Lei He
Xuejiao Tan
Linquan Liu
Ed Lin
Sheng Zhao
116
7
0
06 Mar 2023
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations
  and Infographics using Large Language Models
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Victor C. Dibia
VLM
134
90
0
06 Mar 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and
  Artificial Scenes
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Xu Ju
Ailing Zeng
Jianan Wang
Qian Xu
Lei Zhang
3DH
89
48
0
05 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjDVLMMDE
249
233
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
79
28
0
03 Mar 2023
Bi-parametric prostate MR image synthesis using pathology and
  sequence-conditioned stable diffusion
Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion
Shaheer U. Saeed
Tom Syer
Wen Yan
Qianye Yang
M. Emberton
S. Punwani
Matthew J. Clarkson
D. Barratt
Yipeng Hu
DiffMMedIm
99
11
0
03 Mar 2023
Word-As-Image for Semantic Typography
Word-As-Image for Semantic Typography
Shira Iluz
Yael Vinker
Amir Hertz
Daniel Berio
Daniel Cohen-Or
Ariel Shamir
DiffM
99
64
0
03 Mar 2023
A Complete Recipe for Diffusion Generative Models
A Complete Recipe for Diffusion Generative Models
Kushagra Pandey
Stephan Mandt
DiffM
67
9
0
03 Mar 2023
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of
  Pneumothorax
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax
Zachary Huemann
Xin Tie
Junjie Hu
Tyler Bradshaw
59
17
0
02 Mar 2023
Counterfactual Edits for Generative Evaluation
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
63
0
0
02 Mar 2023
Dropout Reduces Underfitting
Dropout Reduces Underfitting
Zhuang Liu
Zhi-Qin John Xu
Joseph Jin
Zhiqiang Shen
Trevor Darrell
160
42
0
02 Mar 2023
Consistency Models
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLMDiffM
115
982
0
02 Mar 2023
3D generation on ImageNet
3D generation on ImageNet
Ivan Skorokhodov
Aliaksandr Siarohin
Yinghao Xu
Jian Ren
Hsin-Ying Lee
Peter Wonka
Sergey Tulyakov
121
55
0
02 Mar 2023
A Pathway Towards Responsible AI Generated Content
A Pathway Towards Responsible AI Generated Content
Chen Chen
Jie Fu
Lingjuan Lyu
106
72
0
02 Mar 2023
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Rui Zhao
Wei Li
Zhipeng Hu
Lincheng Li
Zhengxia Zou
Z. Shi
Changjie Fan
66
19
0
02 Mar 2023
DSD$^2$: Can We Dodge Sparse Double Descent and Compress the Neural
  Network Worry-Free?
DSD2^22: Can We Dodge Sparse Double Descent and Compress the Neural Network Worry-Free?
Victor Quétu
Enzo Tartaglione
85
7
0
02 Mar 2023
X&Fuse: Fusing Visual Information in Text-to-Image Generation
X&Fuse: Fusing Visual Information in Text-to-Image Generation
Yuval Kirstain
Omer Levy
Adam Polyak
DiffM
50
6
0
02 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data
  Augmentation
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation
Diederik P. Kingma
Ruiqi Gao
DiffM
100
144
0
01 Mar 2023
Continuous-Time Functional Diffusion Processes
Continuous-Time Functional Diffusion Processes
Giulio Franzese
Dario Rossi
Simone Rossi
Markus Heinonen
Maurizio Filippone
Pietro Michiardi
114
27
0
01 Mar 2023
StraIT: Non-autoregressive Generation with Stratified Image Transformer
StraIT: Non-autoregressive Generation with Stratified Image Transformer
Shengju Qian
Huiwen Chang
Yuanzhen Li
Zizhao Zhang
Jiaya Jia
Han Zhang
111
12
0
01 Mar 2023
Rethinking Efficient Tuning Methods from a Unified Perspective
Rethinking Efficient Tuning Methods from a Unified Perspective
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Yiliang Lv
Deli Zhao
Jingren Zhou
85
11
0
01 Mar 2023
Level Up the Deepfake Detection: a Method to Effectively Discriminate
  Images Generated by GAN Architectures and Diffusion Models
Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models
Luca Guarnera
O. Giudice
Sebastiano Battiato
111
30
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge
  Collaborative AutoML System
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
94
2
0
01 Mar 2023
Understanding Natural Language Understanding Systems. A Critical
  Analysis
Understanding Natural Language Understanding Systems. A Critical Analysis
Alessandro Lenci
ELM
61
12
0
01 Mar 2023
Collage Diffusion
Collage Diffusion
Vishnu Sarukkai
Linden Li
Arden Ma
Christopher Ré
Kayvon Fatahalian
DiffM
82
27
0
01 Mar 2023
The Trade-off between Universality and Label Efficiency of
  Representations from Contrastive Learning
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning
Zhenmei Shi
Jiefeng Chen
Kunyang Li
Jayaram Raghuram
Xi Wu
Yingyu Liang
S. Jha
SSL
71
20
0
28 Feb 2023
Monocular Depth Estimation using Diffusion Models
Monocular Depth Estimation using Diffusion Models
Saurabh Saxena
Abhishek Kar
Mohammad Norouzi
David J. Fleet
DiffMVLMMDE
109
86
0
28 Feb 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
115
15
0
28 Feb 2023
Synthesizing Mixed-type Electronic Health Records using Diffusion Models
Synthesizing Mixed-type Electronic Health Records using Diffusion Models
T. Ceritli
Ghadeer O. Ghosheh
V. Chauhan
T. Zhu
Andrew P. Creagh
David Clifton
MedImDiffM
92
19
0
28 Feb 2023
Benchmarking Deepart Detection
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
105
11
0
28 Feb 2023
Foundation Model Drives Weakly Incremental Learning for Semantic
  Segmentation
Foundation Model Drives Weakly Incremental Learning for Semantic Segmentation
Chaohui Yu
Qiang-feng Zhou
Jingliang Li
Jia-Chao Yuan
Zhibin Wang
Fan Wang
VLMCLL
69
14
0
28 Feb 2023
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
Wonwoong Cho
Hareesh Ravi
Midhun Harikumar
V. Khuc
Krishna Kumar Singh
Jingwan Lu
David I. Inouye
Ajinkya Kale
DiffM
157
7
0
28 Feb 2023
AVscript: Accessible Video Editing with Audio-Visual Scripts
AVscript: Accessible Video Editing with Audio-Visual Scripts
Mina Huh
Saelyne Yang
Yi-Hao Peng
Xiang Ánthony' Chen
Young-Ho Kim
Amy Pavel
64
34
0
27 Feb 2023
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized
  Text-to-Image Generation
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Yuxiang Wei
Yabo Zhang
Zhilong Ji
Jinfeng Bai
Lei Zhang
W. Zuo
DiffM
107
329
0
27 Feb 2023
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Jiyoung Lee
Joon Son Chung
Soo-Whan Chung
DiffM
101
31
0
27 Feb 2023
Spatial-temporal Transformer-guided Diffusion based Data Augmentation
  for Efficient Skeleton-based Action Recognition
Spatial-temporal Transformer-guided Diffusion based Data Augmentation for Efficient Skeleton-based Action Recognition
Yifan Jiang
Han Chen
Hanseok Ko
DiffM
109
4
0
26 Feb 2023
MetaAID 2.0: An Extensible Framework for Developing Metaverse
  Applications via Human-controllable Pre-trained Models
MetaAID 2.0: An Extensible Framework for Developing Metaverse Applications via Human-controllable Pre-trained Models
Hongyin Zhu
56
6
0
25 Feb 2023
Directed Diffusion: Direct Control of Object Placement through Attention
  Guidance
Directed Diffusion: Direct Control of Object Placement through Attention Guidance
W. Ma
J. P. Lewis
Avisek Lahiri
Thomas Leung
W. Kleijn
DiffM
106
68
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
139
160
0
25 Feb 2023
Denoising diffusion algorithm for inverse design of microstructures with
  fine-tuned nonlinear material properties
Denoising diffusion algorithm for inverse design of microstructures with fine-tuned nonlinear material properties
Nikolaos N. Vlassis
WaiChing Sun
AI4CEDiffM
116
51
0
24 Feb 2023
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Modulating Pretrained Diffusion Models for Multimodal Image Synthesis
Cusuh Ham
James Hays
Jingwan Lu
Krishna Kumar Singh
Zhifei Zhang
Tobias Hinz
DiffM
102
24
0
24 Feb 2023
SurvivalGAN: Generating Time-to-Event Data for Survival Analysis
SurvivalGAN: Generating Time-to-Event Data for Survival Analysis
Alexander Norcliffe
B. Cebere
F. Imrie
Pietro Lio
M. Schaar
SyDa
72
15
0
24 Feb 2023
Previous
123...828384...969798
Next