Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,759 papers shown
Title
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
32
18
0
08 Jul 2024
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis
Emaad Khwaja
Abdullah Rashwan
Ting Chen
Oliver Wang
Suraj Kothawade
Yeqing Li
DiffM
48
0
0
08 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
84
26
0
08 Jul 2024
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Jia Liu
Changlin Li
Qirui Sun
Jiahui Ming
Chen Fang
Jue Wang
Bing Zeng
Shuaicheng Liu
DiffM
42
3
0
08 Jul 2024
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Misha Sra
Pradeep Sen
42
0
0
08 Jul 2024
Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Xingyang Nie
Su Pan
Xiaoyu Zhai
Shifei Tao
Fengzhong Qu
Biao Wang
Huilin Ge
Guojie Xiao
49
2
0
07 Jul 2024
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang
Ruohan Dong
Jiayi Ji
Yiwei Ma
Haowei Wang
Xiaoshuai Sun
Rongrong Ji
62
3
0
07 Jul 2024
An Improved Method for Personalizing Diffusion Models
Yan Zeng
Masanori Suganuma
Takayuki Okatani
DiffM
52
1
0
07 Jul 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao
Xiaojian Ma
Liang Chen
Shuzheng Si
Rujie Wu
Kaikai An
Peiyu Yu
Minjia Zhang
Qing Li
Baobao Chang
71
46
0
07 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
103
8
0
07 Jul 2024
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Cheng-Han Chiang
Wei-Chih Chen
Chun-Yi Kuan
Chienchou Yang
Hung-yi Lee
ELM
AI4Ed
54
5
0
07 Jul 2024
Synthetic Data Aided Federated Learning Using Foundation Models
Fatima Abacha
Sin G. Teo
Lucas C. Cordeiro
Mustafa A. Mustafa
FedML
44
2
0
06 Jul 2024
FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning
Boyu Fan
Chenrui Wu
Xiang Su
Pan Hui
FedML
69
2
0
06 Jul 2024
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
Zhekai Chen
Wen Wang
Zhen Yang
Zeqing Yuan
Hao Chen
Chunhua Shen
DiffM
63
1
0
06 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
MLLM
76
30
0
05 Jul 2024
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
56
6
0
05 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
54
7
0
05 Jul 2024
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu
Chaohui Yu
Chenjie Cao
Wen Qian
Fan Wang
DiffM
47
3
0
05 Jul 2024
GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Yuxuan Mu
Wei Ji
Chuan Guo
Yilin Wang
Juwei Lu
Xiaofeng Wu
Songcen Xu
Peng Dai
Youliang Yan
Li Cheng
3DGS
67
5
0
05 Jul 2024
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection
Federico Girella
Ziyue Liu
Franco Fummi
Francesco Setti
Marco Cristani
Luigi Capogrosso
64
3
0
04 Jul 2024
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration
Yuhong Zhang
Hengsheng Zhang
Xinning Chai
Zhengxue Cheng
Rong Xie
Li Song
Wenjun Zhang
DiffM
55
4
0
04 Jul 2024
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer
Dheeraj Vattikonda
Luis Lara
Varun Jampani
Eva Portelance
Christopher Pal
Siva Reddy
EGVM
VGen
54
4
0
03 Jul 2024
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu
Gabriele Corso
Tommi Jaakkola
Arash Vahdat
Karsten Kreis
49
13
0
03 Jul 2024
Improved Noise Schedule for Diffusion Training
Tiankai Hang
Shuyang Gu
DiffM
39
9
0
03 Jul 2024
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Xiang Gao
Zhengbo Xu
Junhan Zhao
Jiaying Liu
DiffM
42
8
0
03 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
49
11
0
02 Jul 2024
Meta 3D Gen
Raphael Bensadoun
Tom Monnier
Yanir Kleiman
Filippos Kokkinos
Yawar Siddiqui
...
Antoine Toisoul
David Novotny
Oran Gafni
Natalia Neverova
Andrea Vedaldi
54
1
0
02 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
70
7
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
92
35
0
02 Jul 2024
Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D Objects
Raphael Bensadoun
Yanir Kleiman
Idan Azuri
Omri Harosh
Andrea Vedaldi
Natalia Neverova
Oran Gafni
50
27
0
02 Jul 2024
Text-Aware Diffusion for Policy Learning
Calvin Luo
Mandy He
Zilai Zeng
Chen Sun
40
4
0
02 Jul 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLM
DiffM
97
6
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Li
Fan Ma
Zongxin Yang
Yue Yang
104
11
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
64
0
0
01 Jul 2024
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
Xiyuan Wei
Fanjiang Ye
Ori Yonay
Xingyu Chen
Baixi Sun
Dingwen Tao
Tianbao Yang
VLM
CLIP
79
2
0
01 Jul 2024
An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
Weimin Bai
Yifei Wang
Wenzheng Chen
He Sun
61
9
0
01 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
57
5
0
01 Jul 2024
StyleShot: A Snapshot on Any Style
Junyao Gao
Yanchen Liu
Yanan Sun
Yinhao Tang
Yanhong Zeng
Kai Chen
Cairong Zhao
TTA
3DH
VLM
84
15
0
01 Jul 2024
Controlling Face's Frame generation in StyleGAN's latent space operations: Modifying faces to deceive our memory
Agustín Roca
Nicolás Ignacio Britos
CVBM
38
0
0
30 Jun 2024
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Mushui Liu
Yuhang Ma
Yang Zhen
Jun Dan
Yunlong Yu
Zeng Zhao
Zhipeng Hu
Bai Liu
Changjie Fan
VLM
DiffM
73
14
0
30 Jun 2024
Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation
Yuchuan Tian
Jianhong Han
Hanting Chen
Yuanyuan Xi
Guoyang Zhang
Jie Hu
Chao Xu
Yunhe Wang
ViT
VLM
57
8
0
30 Jun 2024
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
Ayush Ranjan
Daniel Wen
Karthik Bhat
39
0
0
30 Jun 2024
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
Peng Dai
Feitong Tan
Qiangeng Xu
David Futschik
Ruofei Du
S. Fanello
Xiaojuan Qi
Yinda Zhang
VGen
43
5
0
29 Jun 2024
Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization
Taeyoung Yun
Sujin Yun
Jaewoo Lee
Jinkyoo Park
OffRL
62
5
0
29 Jun 2024
SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting
S. Sabour
Lily Goli
George Kopanas
Mark J. Matthews
Dmitry Lagun
Leonidas Guibas
Alec Jacobson
David J. Fleet
Andrea Tagliasacchi
59
18
0
28 Jun 2024
Wavelets Are All You Need for Autoregressive Image Generation
Wael Mattar
Idan Levy
Nir Sharon
S. Dekel
55
3
0
28 Jun 2024
Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs
S. Jeong
Mingwei Li
Matthew Berger
Shusen Liu
72
0
0
28 Jun 2024
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Nila Masrourisaadat
Nazanin Sedaghatkish
Fatemeh Sarshartehrani
Edward A. Fox
56
6
0
28 Jun 2024
PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
Shufan Li
Harkanwar Singh
Aditya Grover
EGVM
70
2
0
28 Jun 2024
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su
Man Luo
Kris W Pan
Tien Pei Chou
Vasudev Lal
Phillip Howard
68
4
0
28 Jun 2024
Previous
1
2
3
...
23
24
25
...
94
95
96
Next