Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,757 papers shown
Title
TurboEdit: Instant text-based image editing
Zongze Wu
Nicholas I. Kolkin
Jonathan Brandt
Richard Zhang
Eli Shechtman
DiffM
54
11
0
14 Aug 2024
3D Gaussian Editing with A Single Image
Guan Luo
Tian-Xing Xu
Ying-Tian Liu
Xiao-Xiong Fan
Fang-Lue Zhang
Song-Hai Zhang
3DGS
51
5
0
14 Aug 2024
DIffSteISR: Harnessing Diffusion Prior for Superior Real-world Stereo Image Super-Resolution
Yuanbo Zhou
Xinlin Zhang
Wei Deng
Tao Wang
Tao Tan
Qinquan Gao
Tong Tong
42
0
0
14 Aug 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
Fahad Shahbaz Khan
Hideki Koike
DiffM
44
0
0
14 Aug 2024
Controlling the World by Sleight of Hand
Sruthi Sudhakar
Ruoshi Liu
Basile Van Hoorick
Carl Vondrick
Richard Zemel
54
4
0
13 Aug 2024
DiffLoRA: Generating Personalized Low-Rank Adaptation Weights with Diffusion
Yujia Wu
Yiming Shi
Jiwei Wei
Chengwei Sun
Yuyang Zhou
Yang Yang
Heng Tao Shen
48
3
0
13 Aug 2024
DC3DO: Diffusion Classifier for 3D Objects
Nursena Koprucu
Meher Shashwat Nigam
Shicheng Xu
Biruk Abere
Gabriele Dominici
Andrew Rodriguez
Sharvaree Vadgam
Berfin Inal
Alberto Tono
DiffM
32
0
0
13 Aug 2024
EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
Ruei-Che Chang
Yuxuan Liu
Lotus Zhang
Anhong Guo
DiffM
46
2
0
13 Aug 2024
ViMo: Generating Motions from Casual Videos
Liangdong Qiu
Chengxing Yu
Yanran Li
Zhao Wang
Haibin Huang
Chongyang Ma
Di Zhang
Pengfei Wan
Xiaoguang Han
VGen
47
2
0
13 Aug 2024
An Analysis for Image-to-Image Translation and Style Transfer
Xiaoming Yu
Jie Tian
Zhenhua Hu
VLM
51
0
0
12 Aug 2024
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
61
20
0
12 Aug 2024
A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models
Taehong Moon
Moonseok Choi
Eunggu Yun
Jongmin Yoon
Gayoung Lee
Jaewoong Cho
Juho Lee
44
4
0
12 Aug 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
Hila Gonen
Terra Blevins
Alisa Liu
Luke Zettlemoyer
Noah A. Smith
36
5
0
12 Aug 2024
LaWa: Using Latent Space for In-Generation Image Watermarking
Ahmad Rezaei
Mohammad Akbari
Saeed Ranjbar Alvar
Arezou Fatemi
Yong Zhang
WIGM
54
13
0
11 Aug 2024
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Yifan Pu
Zhuofan Xia
Jiayi Guo
Dongchen Han
Qixiu Li
...
Ji Li
Yizeng Han
Shiji Song
Gao Huang
Xiu Li
69
12
0
11 Aug 2024
HateSieve: A Contrastive Learning Framework for Detecting and Segmenting Hateful Content in Multimodal Memes
Xuanyu Su
Yansong Li
Diana Inkpen
Nathalie Japkowicz
VLM
89
2
0
11 Aug 2024
Misrepresented Technological Solutions in Imagined Futures: The Origins and Dangers of AI Hype in the Research Community
Savannah Thais
46
3
0
08 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
45
0
0
08 Aug 2024
Cross-View Meets Diffusion: Aerial Image Synthesis with Geometry and Text Guidance
Ahmad Arrabi
Xiaohan Zhang
Waqas Sultani
Chong Chen
S. Wshah
DiffM
33
4
0
08 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
18
0
0
07 Aug 2024
Prompt and Prejudice
Lorenzo Berlincioni
Luca Cultrera
Federico Becattini
Marco Bertini
A. Bimbo
51
0
0
07 Aug 2024
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis
Zebin Yao
Fangxiang Feng
Ruifan Li
Xiaojie Wang
DiffM
44
1
0
07 Aug 2024
Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey
V. T. Truong
Luan Ba Dang
Long Bao Le
DiffM
MedIm
65
17
0
06 Aug 2024
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts
Ciara Rowles
Shimon Vainer
Dante De Nigris
Slava Elizarov
Konstantin Kutsy
Simon Donné
DiffM
56
9
0
06 Aug 2024
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
Zhi Chen
Zecheng Zhao
Yadan Luo
Zi Huang
DiffM
48
4
0
06 Aug 2024
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection
Sen Nie
Zhuo Wang
Xinxin Wang
Kun He
DiffM
81
0
0
06 Aug 2024
LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba
Yunxiang Fu
Chaoqi Chen
Yizhou Yu
Mamba
84
3
0
05 Aug 2024
Fairness and Bias Mitigation in Computer Vision: A Survey
Sepehr Dehdashtian
Ruozhen He
Yi Li
Guha Balakrishnan
Nuno Vasconcelos
Vicente Ordonez
Vishnu Boddeti
49
4
0
05 Aug 2024
A Sharp Convergence Theory for The Probability Flow ODEs of Diffusion Models
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
DiffM
47
22
0
05 Aug 2024
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models
Agneet Chatterjee
Yiran Luo
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
45
5
0
05 Aug 2024
ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation
Jack Lu
Ryan Teehan
Mengye Ren
DiffM
42
3
0
05 Aug 2024
Dense Feature Interaction Network for Image Inpainting Localization
Ye Yao
Tingfeng Han
Shan Jia
Siwei Lyu
38
1
0
05 Aug 2024
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance
Aoming Liu
Zhong Li
Zhang Chen
Nannan Li
Yinghao Xu
Bryan A. Plummer
49
4
0
04 Aug 2024
AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning
Xin Wang
Kai-xiang Chen
Xingjun Ma
Zhineng Chen
Jingjing Chen
Yu-Gang Jiang
AAML
53
4
0
04 Aug 2024
Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI
Robert Wolfe
Aayushi Dangol
Alexis Hiniker
Bill Howe
36
5
0
04 Aug 2024
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang
Zongqing Lu
LM&Ro
53
2
0
04 Aug 2024
iControl3D: An Interactive System for Controllable 3D Scene Generation
Yuxiang Yang
Yizheng Wu
Jun Cen
Juewen Peng
Jing Zhang
Ke Xian
Zhe Wang
Zhiguo Cao
Guo-Shing Lin
44
0
0
03 Aug 2024
Adaptive Planning with Generative Models under Uncertainty
Pascal Jutras-Dubé
Ruqi Zhang
Aniket Bera
41
2
0
02 Aug 2024
Conditional LoRA Parameter Generation
Aaron Mueller
Millicent Li
Koyena Pal
Wangbo Zhao
Yukun Zhou
Jiuding Sun
Yonatan Belinkov
DiffM
46
4
0
02 Aug 2024
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain
Steven A. Grosz
A. Namboodiri
Anil K. Jain
DiffM
53
2
0
02 Aug 2024
VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling
Qian Zhang
Xiangzi Dai
Ninghua Yang
Xiang An
Ziyong Feng
Xingyu Ren
VLM
CLIP
43
17
0
02 Aug 2024
FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation
Xiang Gao
Jiaying Liu
59
2
0
02 Aug 2024
Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention
Mengkang Hu
DiffM
56
8
0
01 Aug 2024
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models
Gilad Deutch
Rinon Gal
Daniel Garibi
Or Patashnik
Daniel Cohen-Or
DiffM
48
22
0
01 Aug 2024
Jailbreaking Text-to-Image Models with LLM-Based Agents
Yingkai Dong
Zheng Li
Xiangtao Meng
Ning Yu
Shanqing Guo
LLMAG
45
14
0
01 Aug 2024
Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion
Monika Zimmermann
Jacek Naruniec
Christopher Schroers
Markus Gross
Romann M. Weber
VGen
DiffM
59
4
0
01 Aug 2024
DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving
Xuemeng Yang
Licheng Wen
Yukai Ma
Jianbiao Mei
Xin Li
...
Min Dou
Botian Shi
Liang He
Yong-Jin Liu
Yu Qiao
VGen
54
18
0
01 Aug 2024
Few-shot Defect Image Generation based on Consistency Modeling
Qingfeng Shi
Jing Wei
Fei Shen
Zheng Zhang
35
2
0
01 Aug 2024
Memorization Capacity for Additive Fine-Tuning with Small ReLU Networks
Jy-yong Sohn
Dohyun Kwon
Seoyeon An
Kangwook Lee
56
0
0
01 Aug 2024
A Simple Background Augmentation Method for Object Detection with Diffusion Model
Yuhang Li
Jun Gao
Chen Chen
Yue Zhang
Jielei Zhang
DiffM
48
5
0
01 Aug 2024
Previous
1
2
3
...
20
21
22
...
94
95
96
Next