Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10741
Cited By
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
20 December 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models"
50 / 2,604 papers shown
Title
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark
Edward Vendrow
Omiros Pantazis
Alexander Shepard
Gabriel J. Brostow
Kate E. Jones
Oisin Mac Aodha
Sara Beery
Grant Van Horn
VLM
43
3
0
04 Nov 2024
Towards Small Object Editing: A Benchmark Dataset and A Training-Free Approach
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Yiming Wu
Wei Ji
Haoran Liang
Ronghua Liang
37
1
0
03 Nov 2024
Denoising Fisher Training For Neural Implicit Samplers
Weijian Luo
Wei Deng
38
0
0
03 Nov 2024
Infinite-Resolution Integral Noise Warping for Diffusion Models
Yitong Deng
Winnie Lin
Lingxiao Li
Dmitriy Smirnov
Ryan Burgert
Ning Yu
Vincent Dedun
Mohammad H. Taghavi
36
2
0
02 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
Masayoshi Tomizuka
Weidong Zhan
DiffM
43
2
0
02 Nov 2024
TextDestroyer: A Training- and Annotation-Free Diffusion Method for Destroying Anomal Text from Images
Mengcheng Li
Mingbao Lin
Rongrong Ji
Chia-Wen Lin
Rongrong Ji
DiffM
56
0
0
01 Nov 2024
Scaling Concept With Text-Guided Diffusion Models
Chao Huang
Susan Liang
Yunlong Tang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
59
6
0
31 Oct 2024
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
Xinwang Chen
Ning Liu
Bo Li
Feifei Feng
Jian Tang
42
2
0
31 Oct 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
95
9
0
31 Oct 2024
MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts
Jie Zhu
Y. Chen
Mingyu Ding
Ping Luo
Leye Wang
Jingdong Wang
DiffM
42
4
0
30 Oct 2024
Exploiting Phonological Similarities between African Languages to achieve Speech to Speech Translation
P. Ochieng
D. Kaburu
28
0
0
30 Oct 2024
Consistency Diffusion Bridge Models
Guande He
Kaiwen Zheng
Jianfei Chen
Fan Bao
Jun-Jie Zhu
DiffM
72
3
0
30 Oct 2024
Unlocking Point Processes through Point Set Diffusion
David Lüdke
Enric Rabasseda Raventós
Marcel Kollovieh
Stephan Günnemann
DiffM
24
2
0
29 Oct 2024
Volumetric Conditioning Module to Control Pretrained Diffusion Models for 3D Medical Images
Suhyun Ahn
Wonjung Park
Jihoon Cho
Seunghyuck Park
Jinah Park
MedIm
31
0
0
29 Oct 2024
DiffSTR: Controlled Diffusion Models for Scene Text Removal
Sanhita Pathak
V. Kaushik
Brejesh Lall
DiffM
33
0
0
29 Oct 2024
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework
V. Arkhipkin
Viacheslav Vasilev
Andrei Filatov
Igor Pavlov
Julia Agafonova
...
Evelina Mironova
Anton Bukashkin
Konstantin Kulikov
Andrey Kuznetsov
Denis Dimitrov
DiffM
31
3
0
28 Oct 2024
MovieCharacter: A Tuning-Free Framework for Controllable Character Video Synthesis
Di Qiu
Zheng Chen
Rui Wang
Mingyuan Fan
Changqian Yu
Junshi Huan
Xiang Wen
VGen
51
7
0
28 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
28
4
0
28 Oct 2024
FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space
Yiyang Guo
Ruizhe Li
Mude Hui
Hanzhong Guo
Chen Zhang
Chuangjian Cai
Le Wan
Shangfei Wang
21
0
0
28 Oct 2024
Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models
Liulei Li
Wenguan Wang
Yuqing Yang
42
7
0
26 Oct 2024
Transferable Adversarial Attacks on SAM and Its Downstream Models
Song Xia
Wenhan Yang
Yi Yu
Xun Lin
Henghui Ding
Lingyu Duan
Xudong Jiang
AAML
SILM
66
6
0
26 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
38
6
0
24 Oct 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Haonan Lin
Mengmeng Wang
Jiahao Wang
Wenbin An
Yan Chen
Yong Liu
Feng Tian
Guang Dai
Jingdong Wang
Qianying Wang
DiffM
50
8
0
24 Oct 2024
FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
Christopher T. H. Teo
Milad Abdollahzadeh
Xinda Ma
Ngai-man Cheung
DiffM
26
1
0
24 Oct 2024
Fast constrained sampling in pre-trained diffusion models
Alexandros Graikos
Nebojsa Jojic
Dimitris Samaras
DiffM
30
1
0
24 Oct 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
97
11
0
24 Oct 2024
TopoDiffusionNet: A Topology-aware Diffusion Model
Saumya Gupta
Dimitris Samaras
Chong Chen
DiffM
44
4
0
22 Oct 2024
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Manan Suri
Puneet Mathur
Franck Dernoncourt
R. Jain
Vlad I. Morariu
Ramit Sawhney
Preslav Nakov
Dinesh Manocha
39
3
0
21 Oct 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
DiffM
VGen
238
2
0
20 Oct 2024
Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step
Mingyuan Zhou
Huangjie Zheng
Yi Gu
Zhendong Wang
Hai Huang
DiffM
56
6
0
19 Oct 2024
HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation
Bo Cheng
Yuhang Ma
Liebucha Wu
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Dawei Leng
Yuhui Yin
DiffM
35
8
0
18 Oct 2024
HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects
Oliverio Theophilus Nathanael
Jonathan Samuel Lumentut
Nicholas Hans Muliawan
Edbert Valencio Angky
Felix Indra Kurniadi
Alfi Yusrotis Zakiyyah
Jeklin Harefa
DiffM
33
0
0
18 Oct 2024
ERDDCI: Exact Reversible Diffusion via Dual-Chain Inversion for High-Quality Image Editing
Jimin Dai
Wenjie Qu
Shuo Chen
Jian Yang
Lei Luo
DiffM
28
0
0
18 Oct 2024
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Chengyue Wu
Xiaokang Chen
Z. F. Wu
Yiyang Ma
Xingchao Liu
...
Wen Liu
Zhenda Xie
Xingkai Yu
Chong Ruan
Ping Luo
AI4TS
60
78
0
17 Oct 2024
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration
Yun-Yen Chuang
Hung-Min Hsu
Kevin Lin
Chen-Sheng Gu
Ling Zhen Li
Ray-I Chang
Hung-yi Lee
DiffM
VLM
36
0
0
17 Oct 2024
FAMSeC: A Few-shot-sample-based General AI-generated Image Detection Method
Juncong Xu
Yang Yang
Han Fang
Honggu Liu
Weiming Zhang
37
1
0
17 Oct 2024
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction
Patrick Kwon
Hanbyul Joo
31
3
0
17 Oct 2024
FaceChain-FACT: Face Adapter with Decoupled Training for Identity-preserved Personalization
Cheng Yu
Haoyu Xie
Lei Shang
Yong-Jin Liu
Jun Dan
Liefeng Bo
Baigui Sun
24
2
0
16 Oct 2024
Off-dynamics Conditional Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffM
OffRL
44
0
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
41
0
0
16 Oct 2024
DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models
Zhengyang Yu
Zhaoyuan Yang
Jing Zhang
DiffM
30
2
0
15 Oct 2024
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling
Guiyu Zhang
Huan-ang Gao
Zijian Jiang
Hao Zhao
Zhedong Zheng
EGVM
57
6
0
15 Oct 2024
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Rasoul Shafipour
David Harrison
Maxwell Horton
Jeffrey Marker
Houman Bedayat
Sachin Mehta
Mohammad Rastegari
Mahyar Najibi
Saman Naderiparizi
MQ
57
0
0
14 Oct 2024
Saliency Guided Optimization of Diffusion Latents
Xiwen Wang
Jizhe Zhou
Xuekang Zhu
Cheng Li
Mao Li
EGVM
23
0
0
14 Oct 2024
ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization
Jiawei Li
Fanrui Zhang
Jiaying Zhu
Esther Sun
Qiang Zhang
Zheng-jun Zha
MLLM
57
9
0
14 Oct 2024
MagicEraser: Erasing Any Objects via Semantics-Aware Control
Fan Li
Zixiao Zhang
Yi Huang
Jianzhuang Liu
Renjing Pei
Bin Shao
Songcen Xu
DiffM
44
7
0
14 Oct 2024
Learning to Customize Text-to-Image Diffusion In Diverse Context
Taewook Kim
Wei Chen
Qiang Qiu
DiffM
38
2
0
14 Oct 2024
TextMaster: Universal Controllable Text Edit
Aoqiang Wang
Yufei Guo
Zhenyu Yan
Wenxiang Shang
Ran Lin
Zhao Zhang
DiffM
28
2
0
13 Oct 2024
EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models
Eungbean Lee
Somi Jeong
Kwanghoon Sohn
DiffM
30
1
0
13 Oct 2024
Generating Intermediate Representations for Compositional Text-To-Image Generation
Ran Galun
Sagie Benaim
25
0
0
13 Oct 2024
Previous
1
2
3
...
7
8
9
...
51
52
53
Next