Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,750 papers shown
Title
Adversarial Transform Particle Filters
Chengxin Gong
Wei Lin
Cheng Zhang
59
0
0
10 Feb 2025
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation
Vera Soboleva
M. Nakhodnov
Aibek Alanov
52
0
0
09 Feb 2025
Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
Marco Mistretta
Alberto Baldrati
Lorenzo Agnolucci
Marco Bertini
Andrew D. Bagdanov
CLIP
VLM
104
2
0
06 Feb 2025
A Periodic Bayesian Flow for Material Generation
Hanlin Wu
Yuxuan Song
Jingjing Gong
Ziyao Cao
Y. Ouyang
Jianbing Zhang
Hao Zhou
Wei-Ying Ma
Jingjing Liu
DiffM
73
2
0
04 Feb 2025
Information-Theoretic Proofs for Diffusion Sampling
Galen Reeves
H. Pfister
DiffM
100
0
0
04 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
60
0
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
66
1
0
02 Feb 2025
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Liangchen Li
Caoliwen Wang
Yuqi Zhou
Bailin Deng
Juyong Zhang
3DV
37
0
0
01 Feb 2025
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Nuwan T. Attygalle
M. Kljun
Aaron Quigley
Klen Copic Pucihar
Jens Grubert
...
Juri Yoneyama
Alice Toniolo
Angela Miguel
Hirokazu Kato
M. Weerasinghe
DiffM
83
0
0
28 Jan 2025
An analysis of the noise schedule for score-based generative models
SU StanislasStrasman
Antonio Ocello
Claire Boyer Lpsm
Sylvain Le Corff Lpsm
Vincent Lemaire
DiffM
103
4
0
28 Jan 2025
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Xiaoyu Xiang
Liat Sless Gorelik
Yuchen Fan
Omri Armstrong
Forrest N. Iandola
Yilei Li
Ita Lifshitz
Rakesh Ranjan
3DGS
DiffM
109
2
0
28 Jan 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Daniel Gehrig
Yuqian Fu
Luc Van Gool
62
6
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
38
0
0
28 Jan 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Jinwei Dong
Xinsheng Wang
Qirong Mao
63
0
0
28 Jan 2025
Visual Generation Without Guidance
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
57
0
0
28 Jan 2025
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
Adil Kaan Akan
Yucel Yemez
DiffM
OCL
42
0
0
27 Jan 2025
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance
Yinan Zheng
Ruiming Liang
Kexin Zheng
Jinliang Zheng
Liyuan Mao
...
Weihao Gu
Rui Ai
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
71
6
0
26 Jan 2025
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
Zehong Yan
Peng Qi
W. Hsu
M. Lee
47
0
0
24 Jan 2025
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols
John Joon Young Chung
Melissa Roemmele
Max Kreminski
VGen
75
0
0
23 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
86
0
0
22 Jan 2025
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
M. Gwilliam
Han Cai
Di Wu
Abhinav Shrivastava
Zhiyu Cheng
92
0
0
22 Jan 2025
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
MLLM
VLM
LRM
109
6
0
21 Jan 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi
Shahar Yadin
Roni Paiss
Omer Tov
Shiran Zada
Ariel Ephrat
T. Michaeli
Inbar Mosseri
Tali Dekel
DiffM
103
2
0
21 Jan 2025
Block Flow: Learning Straight Flow on Data Blocks
Zibin Wang
Zhiyuan Ouyang
Xiangyun Zhang
45
0
0
20 Jan 2025
Nested Annealed Training Scheme for Generative Adversarial Networks
Chang Wan
Ming-Hsuan Yang
Minglu Li
Yunliang Jiang
Zhonglong Zheng
GAN
43
0
0
20 Jan 2025
DPCL-Diff: The Temporal Knowledge Graph Reasoning Based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning
Yukun Cao
Lisheng Wang
Luobing Huang
DiffM
47
0
0
20 Jan 2025
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Haohan Wang
Wei Feng
Yang Lu
Yaoyu Li
Zheng Zhang
Jingjing Lv
Xin Zhu
Jun-Jun Shen
DiffM
83
5
0
20 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
79
4
0
20 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
44
1
0
20 Jan 2025
Model Synthesis for Zero-Shot Model Attribution
Tianyun Yang
Juan Cao
Danding Wang
Chang Xu
WIGM
78
4
0
20 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
39
1
0
20 Jan 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
76
14
0
17 Jan 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
85
63
0
17 Jan 2025
TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping
Despina Konstantinidou
C. Koutlis
Symeon Papadopoulos
81
2
0
17 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
105
18
0
17 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DH
MDE
48
0
0
15 Jan 2025
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints
Jonathan Nöther
Adish Singla
Goran Radanović
AAML
59
0
0
14 Jan 2025
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
Tharun Anand
Aryan Garg
Kaushik Mitra
VGen
DiffM
52
0
0
13 Jan 2025
Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning
Maomao Li
Lijian Lin
Yunfei Liu
Ye Zhu
Yu Li
DiffM
VGen
44
0
0
11 Jan 2025
Has an AI model been trained on your images?
Matyáš Boháček
Hany Farid
40
0
0
11 Jan 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
J. T. Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
52
1
0
10 Jan 2025
TextToucher: Fine-Grained Text-to-Touch Generation
Jiahang Tu
Hao Fu
Fengyu Yang
Hanbin Zhao
Chao Zhang
Hui Qian
VLM
DiffM
83
8
0
10 Jan 2025
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
Daniele Molino
Francesco Di Feola
E. Faiella
Deborah Fazzini
D. Santucci
Linlin Shen
V. Guarrasi
Paolo Soda
SyDa
MedIm
44
0
0
10 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
88
31
0
10 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
Xinyu Wang
DiffM
43
5
0
08 Jan 2025
Concept Matching with Agent for Out-of-Distribution Detection
YuXiao Lee
Xiaofeng Cao
Jingcai Guo
Wei Ye
Qing Guo
Yi Chang
69
0
0
08 Jan 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Zhengyuan Yang
P. Wang
Junyang Lin
Xinyu Wang
Wenyu Liu
DiffM
45
0
0
08 Jan 2025
On the Mode-Seeking Properties of Langevin Dynamics
Xiwei Cheng
Kexin Fu
Farzan Farnia
67
0
0
08 Jan 2025
Unity by Diversity: Improved Representation Learning in Multimodal VAEs
Thomas M. Sutter
Yang Meng
Andrea Agostini
Daphné Chopard
Norbert Fortin
Julia E. Vogt
Bahbak Shahbaba
Stephan Mandt
SSL
54
2
0
08 Jan 2025
Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance
Marta Gentiloni-Silveri
Antonio Ocello
40
2
0
04 Jan 2025
Previous
1
2
3
...
8
9
10
...
93
94
95
Next