Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.14828
Cited By
Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models
23 May 2024
Katherine Xu
Lingzhi Zhang
Jianbo Shi
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Good Seed Makes a Good Crop: Discovering Secret Seeds in Text-to-Image Diffusion Models"
49 / 49 papers shown
Title
Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model
Kwanyoung Kim
Sanghyun Kim
DiffM
VGen
158
0
0
23 May 2025
PiCo: Enhancing Text-Image Alignment with Improved Noise Selection and Precise Mask Control in Diffusion Models
Chang Xie
Chenyi Zhuang
Pan Gao
VLM
48
0
0
06 May 2025
Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing
Joowon Kim
Ziseok Lee
Donghyeon Cho
Sanghyun Jo
Y. Jung
Kyungsu Kim
Eunho Yang
DiffM
61
0
0
18 Apr 2025
RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
E. Peruzzo
Dejia Xu
Xingqian Xu
Humphrey Shi
N. Sebe
DiffM
VGen
69
0
0
09 Apr 2025
ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos
Haolin Yang
Feilong Tang
Ming Hu
Yulong Li
Junjie Guo
...
Zelin Peng
Junjun He
Junjun He
Zongyuan Ge
Imran Razzak
DiffM
VGen
172
2
0
20 Mar 2025
Language-based Image Colorization: A Benchmark and Beyond
Yongqian Li
Shuai Yang
Jiaying Liu
DiffM
VLM
64
0
0
19 Mar 2025
TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
Ryugo Morita
Stanislav Frolov
Brian B. Moser
Takahiro Shirakawa
Ko Watanabe
Andreas Dengel
Jinjia Zhou
DiffM
106
0
0
23 Nov 2024
ZeroComp: Zero-shot Object Compositing from Image Intrinsics via Diffusion
Zitian Zhang
Frédéric Fortier-Chouinard
Mathieu Garon
Anand Bhattad
Jean-François Lalonde
DiffM
65
4
0
10 Oct 2024
Questionable practices in machine learning
Gavin Leech
Juan J. Vazquez
Misha Yagudin
Niclas Kupper
Laurence Aitchison
74
6
0
17 Jul 2024
Beyond Aesthetics: Cultural Competence in Text-to-Image Models
Nithish Kannen
Arif Ahmad
Marco Andreetto
Vinodkumar Prabhakaran
Utsav Prabhu
Adji Bousso Dieng
Pushpak Bhattacharyya
Shachi Dave
66
18
0
09 Jul 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
39
3
0
17 Apr 2024
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Xiefan Guo
Jinlin Liu
Miaomiao Cui
Jiankai Li
Hongyu Yang
Di Huang
59
30
0
06 Apr 2024
Uncovering the Text Embedding in Text-to-Image Diffusion Models
Huikang Yu
Hao Luo
Fan Wang
Feng Zhao
49
10
0
01 Apr 2024
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
Michael Toker
Hadas Orgad
Mor Ventura
Dana Arad
Yonatan Belinkov
DiffM
72
13
0
09 Mar 2024
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Bin Cao
Jianhao Yuan
Yexin Liu
Jian Li
Shuyang Sun
Jing Liu
Bo Zhao
DiffM
76
7
0
28 Feb 2024
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Ling Yang
Zhaochen Yu
Chenlin Meng
Minkai Xu
Stefano Ermon
Tengjiao Wang
CoGe
DiffM
68
127
0
22 Jan 2024
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Lihe Yang
Bingyi Kang
Zilong Huang
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
VLM
177
744
0
19 Jan 2024
Adversarial Diffusion Distillation
Axel Sauer
Dominik Lorenz
A. Blattmann
Robin Rombach
160
364
0
28 Nov 2023
Synthetic Shifts to Initial Seed Vector Exposes the Brittle Nature of Latent-Based Diffusion Models
Poyuan Mao
Shashank Kotyan
Tham Yik Foong
Danilo Vasconcellos Vargas
58
5
0
24 Nov 2023
Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models
Gabriele Corso
Yilun Xu
Valentin De Bortoli
Regina Barzilay
Tommi Jaakkola
DiffM
34
21
0
19 Oct 2023
Improved Baselines with Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
82
2,593
0
05 Oct 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
70
414
0
30 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
59
138
0
20 Sep 2023
TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation
P. Grimal
Hervé Le Borgne
Olivier Ferret
Julien Tourille
EGVM
57
11
0
11 Jul 2023
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
Yujun Shi
Chuhui Xue
Jun Hao Liew
Jiachun Pan
Hanshu Yan
Wenqing Zhang
Vincent Y. F. Tan
Song Bai
56
210
0
26 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
69
274
0
15 Jun 2023
Diffusion Self-Guidance for Controllable Image Generation
Dave Epstein
Allan Jabri
Ben Poole
Alexei A. Efros
Aleksander Holynski
60
249
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffM
VLM
82
44
0
01 Jun 2023
Guided Image Synthesis via Initial Image Editing in Diffusion Model
Jiafeng Mao
Xueting Wang
Kiyoharu Aizawa
DiffM
54
52
0
05 May 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion Models
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
60
116
0
20 Mar 2023
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
Qiucheng Wu
Yujian Liu
Handong Zhao
Ajinkya Kale
T. Bui
Tong Yu
Zhe Lin
Yang Zhang
Shiyu Chang
DiffM
CoGe
44
99
0
16 Dec 2022
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
Narek Tumanyan
Michal Geyer
Shai Bagon
Tali Dekel
111
660
0
22 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
125
811
0
02 Nov 2022
Improving Deep Neural Network Random Initialization Through Neuronal Rewiring
Leonardo F. S. Scabini
B. De Baets
Odemir M. Bruno
AI4CE
38
7
0
17 Jul 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
158
1,089
0
22 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
59
360
0
02 Jun 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
276
6,768
0
13 Apr 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
256
15,081
0
20 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
178
2,315
0
02 Dec 2021
Instance-wise Occlusion and Depth Orders in Natural Scenes
Hyunmin Lee
Jaesik Park
3DV
46
26
0
29 Nov 2021
Torch.manual_seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision
David Picard
3DV
VLM
46
90
0
16 Sep 2021
Accounting for Variance in Machine Learning Benchmarks
Xavier Bouthillier
Pierre Delaunay
Mirko Bronzi
Assya Trofimov
Brennan Nichyporuk
...
Dmitriy Serdyuk
Tal Arbel
C. Pal
Gaël Varoquaux
Pascal Vincent
58
149
0
01 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
671
28,659
0
26 Feb 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
140
7,166
0
06 Oct 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
199
10,591
0
17 Feb 2020
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
Alina Kuznetsova
H. Rom
N. Alldrin
J. Uijlings
Ivan Krasin
...
S. Popov
Matteo Malloci
Alexander Kolesnikov
Tom Duerig
V. Ferrari
ObjD
VLM
87
1,340
0
02 Nov 2018
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Ramprasaath R. Selvaraju
Michael Cogswell
Abhishek Das
Ramakrishna Vedantam
Devi Parikh
Dhruv Batra
FAtt
209
19,796
0
07 Oct 2016
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
920
99,991
0
04 Sep 2014
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
244
43,290
0
01 May 2014
1