Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,753 papers shown
Title
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Yaopei Zeng
Yuanpu Cao
Bochuan Cao
Yurui Chang
Jinghui Chen
Lu Lin
DiffM
46
3
0
28 Oct 2024
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Viacheslav Surkov
Chris Wendler
Mikhail Terekhov
Justin Deschenaux
Robert West
Çağlar Gülçehre
VLM
43
14
0
28 Oct 2024
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Zhendong Wang
Zhiyu Li
Ajay Mandlekar
Zhenjia Xu
Jiaojiao Fan
...
Yuke Zhu
Yogesh Balaji
Mingyuan Zhou
Xuan Li
Yu Zeng
45
16
0
28 Oct 2024
Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models
Wenda Li
Huijie Zhang
Qing Qu
WIGM
54
2
0
28 Oct 2024
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
51
7
0
28 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
30
4
0
28 Oct 2024
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments
Yuzhe Yang
Yipeng Du
Ahmad Farhan
Claudio Angione
Yue Zhao
Harry Yang
Fielding Johnston
James Buban
Patrick Colangelo
29
0
0
28 Oct 2024
On learning higher-order cumulants in diffusion models
Gert Aarts
Diaa E. Habibi
Lei Wang
K. Zhou
31
4
0
28 Oct 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
52
2
0
28 Oct 2024
Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models!
Arash Marioriyad
Mohammadali Banayeeanzade
Reza Abbasi
M. Rohban
M. Baghshah
DiffM
78
3
0
28 Oct 2024
Image Generation from Image Captioning -- Invertible Approach
Nandakishore S Menon
Chandramouli Kamanchi
Raghuram Bharadwaj Diddigi
19
0
0
26 Oct 2024
Diff-CXR: Report-to-CXR generation through a disease-knowledge enhanced diffusion model
Peng Huang
Bowen Guo
Shuyu Liang
Junhu Fu
Yuanyuan Wang
Yi Guo
DiffM
MedIm
42
1
0
26 Oct 2024
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
Liulei Li
Wenguan Wang
Yue Yang
42
7
0
26 Oct 2024
Super-resolved virtual staining of label-free tissue using diffusion models
Yijie Zhang
Luzhe Huang
N. Pillar
Yuan Li
Hanlong Chen
Aydogan Ozcan
37
3
0
26 Oct 2024
Transferable Adversarial Attacks on SAM and Its Downstream Models
Song Xia
Wenhan Yang
Yi Yu
Xun Lin
Henghui Ding
Lingyu Duan
Xudong Jiang
AAML
SILM
66
6
0
26 Oct 2024
Adversarial Environment Design via Regret-Guided Diffusion Models
Hojun Chung
Junseo Lee
Minsoo Kim
Dohyeong Kim
Songhwai Oh
31
0
0
25 Oct 2024
DiffGS: Functional Gaussian Splatting Diffusion
Junsheng Zhou
Weiqi Zhang
Yu-Shen Liu
3DGS
42
15
0
25 Oct 2024
Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series
Ilan Naiman
Nimrod Berman
Itai Pemper
Idan Arbiv
Gal Fadlon
Omri Azencot
32
12
0
25 Oct 2024
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
Z. Gong
Guangyin Bao
Qi Zhang
Zhongwei Wan
Duoqian Miao
...
Changwei Wang
Rongtao Xu
Liang Hu
Ke Liu
Yu Zhang
DiffM
VGen
53
9
0
25 Oct 2024
Generative Diffusion Models for Sequential Recommendations
Sharare Zolghadr
Ole Winther
Paul Jeha
DiffM
30
0
0
25 Oct 2024
Flow Generator Matching
Zemin Huang
Zhengyang Geng
Weijian Luo
Guo-jun Qi
49
9
0
25 Oct 2024
Non-rigid Relative Placement through 3D Dense Diffusion
Eric Cai
Octavian A. Donca
Ben Eisner
David Held
36
0
0
25 Oct 2024
Structured Diffusion Models with Mixture of Gaussians as Prior Distribution
Nanshan Jia
Tingyu Zhu
Haoyu Liu
Zeyu Zheng
DiffM
28
2
0
24 Oct 2024
BIFRÖST: 3D-Aware Image compositing with Language Instructions
Lingxiao Li
Kaixiong Gong
Weihong Li
Xili Dai
Tao Chen
Xiaojun Yuan
Xiangyu Yue
31
2
0
24 Oct 2024
Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Junyi Chen
Di Huang
Weicai Ye
Wanli Ouyang
Tong He
LRM
41
2
0
24 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
40
6
0
24 Oct 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Haonan Lin
Mengmeng Wang
Jiahao Wang
Wenbin An
Yan Chen
Yong Liu
Feng Tian
Guang Dai
Jingdong Wang
Qianying Wang
DiffM
53
8
0
24 Oct 2024
FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
Christopher T. H. Teo
Milad Abdollahzadeh
Xinda Ma
Ngai-man Cheung
DiffM
26
1
0
24 Oct 2024
Beyond Color and Lines: Zero-Shot Style-Specific Image Variations with Coordinated Semantics
Jinghao Hu
Yuhe Zhang
Guohua Geng
Liuyuxin Yang
JiaRui Yan
Jingtao Cheng
YaDong Zhang
Kang Li
DiffM
43
0
0
24 Oct 2024
WAFFLE: Multi-Modal Model for Automated Front-End Development
Shanchao Liang
Nan Jiang
Shangshu Qian
Lin Tan
19
0
0
24 Oct 2024
Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation
Xiaoyu Zhang
Teng Zhou
Xinlong Zhang
Jia Wei
Yongchuan Tang
49
1
0
24 Oct 2024
Fast constrained sampling in pre-trained diffusion models
Alexandros Graikos
Nebojsa Jojic
Dimitris Samaras
DiffM
30
1
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
11
0
24 Oct 2024
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference
Xin He
Shunkang Zhang
Yuxin Wang
Haiyan Yin
Zihao Zeng
Shaohuai Shi
Zhenheng Tang
Xiaowen Chu
Ivor Tsang
Ong Yew Soon
MoE
65
4
0
23 Oct 2024
Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation
Wenfang Yao
Chen Liu
Kejing Yin
W. K. Cheung
Jing Qin
31
1
0
23 Oct 2024
TAGE: Trustworthy Attribute Group Editing for Stable Few-shot Image Generation
Ruicheng Zhang
Guoheng Huang
Yejing Huo
Xiaochen Yuan
Zhizhen Zhou
Xuhang Chen
Guo Zhong
28
0
0
23 Oct 2024
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Jiahua Dong
Wenqi Liang
Hongliu Li
Duzhen Zhang
Meng Cao
Henghui Ding
Salman Khan
Fahad Shahbaz Khan
DiffM
68
9
0
23 Oct 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Hao Peng
Lingpeng Kong
AI4CE
78
17
0
23 Oct 2024
Dual-Model Defense: Safeguarding Diffusion Models from Membership Inference Attacks through Disjoint Data Splitting
Bao Q. Tran
Viet Anh Nguyen
Anh Tran
Toan M. Tran
31
0
0
22 Oct 2024
Progressive Compositionality in Text-to-Image Generative Models
Xu Han
Linghao Jin
Xiaofeng Liu
Paul Pu Liang
CoGe
106
2
0
22 Oct 2024
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras
Weili Nie
Karsten Kreis
A. Dimakis
Morteza Mardani
Nikola B. Kovachki
Arash Vahdat
DiffM
37
6
0
21 Oct 2024
Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods
Adam Phillips
Daniel Grandes Rodriguez
Miriam Sánchez-Manzano
Alan Salvadó
Manuel Garin
G. Haro
C. Ballester
34
0
0
21 Oct 2024
Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation
Anh-Vu Bui
L. Vuong
Khanh Doan
Trung Le
Paul Montague
Tamas Abraham
Dinh Q. Phung
KELM
DiffM
32
9
0
21 Oct 2024
TexPro: Text-guided PBR Texturing with Procedural Material Modeling
Ziqiang Dang
Wenqi Dong
Zesong Yang
Bangbang Yang
Liang Li
Yuewen Ma
Zhaopeng Cui
DiffM
47
1
0
21 Oct 2024
Generative AI Agents in Autonomous Machines: A Safety Perspective
Jason J. Jabbour
Vijay Janapa Reddi
AI4CE
51
4
0
20 Oct 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
DiffM
VGen
259
2
0
20 Oct 2024
FastSTI: A Fast Conditional Pseudo Numerical Diffusion Model for Spatio-temporal Traffic Data Imputation
Shaokang Cheng
Nada Osman
Shiru Qu
Lamberto Ballan
DiffM
30
0
0
20 Oct 2024
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Alan Dao
Dinh Bach Vu
Huy Hoang Ha
AuLLM
VLM
73
3
0
20 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
52
12
0
19 Oct 2024
"Confrontation or Acceptance": Understanding Novice Visual Artists' Perception towards AI-assisted Art Creation
Shuning Zhang
Shixuan Li
33
1
0
19 Oct 2024
Previous
1
2
3
...
13
14
15
...
94
95
96
Next