Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.03498
Cited By
Improved Techniques for Training GANs
10 June 2016
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved Techniques for Training GANs"
50 / 4,102 papers shown
Title
Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment
Kim Sung-Bin
Arda Senocak
Hyunwoo Ha
Tae-Hyun Oh
DiffM
219
0
0
09 Dec 2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
Ziwei Huang
Wanggui He
Quanyu Long
Yandi Wang
Haoyuan Li
...
Fangxun Shu
Long Chen
Hao Jiang
Leilei Gan
Leilei Gan
EGVM
518
4
0
05 Dec 2024
DiffuPT: Class Imbalance Mitigation for Glaucoma Detection via Diffusion Based Generation and Model Pretraining
Youssof Nawar
Nouran Soliman
Moustafa Wassel
Mohamed ElHabebe
Noha Adly
Marwan Torki
Ahmed Elmassry
Islam Ahmed
MedIm
115
0
0
04 Dec 2024
BOTracle: A framework for Discriminating Bots and Humans
Jan Kadel
August See
Ritwik Sinha
Mathias Fischer
92
0
0
03 Dec 2024
AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation
Zhihang Lin
Mingbao Lin
Wengyi Zhan
Rongrong Ji
138
0
0
03 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
135
20
0
02 Dec 2024
IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models
Khaled Abud
Sergey Lavrushkin
Alexey Kirillov
D. Vatolin
216
0
0
02 Dec 2024
BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis
Seong-Eun Hong
Soobin Lim
Juyeong Hwang
Minwook Chang
Hyeongyeop Kang
180
1
0
28 Nov 2024
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization
Rui Xie
Tianchen Zhao
Zhihang Yuan
Rui Wan
Wenxi Gao
Zhenhua Zhu
Xuefei Ning
Yu Wang
VGen
MQ
92
4
0
26 Nov 2024
Factorized Visual Tokenization and Generation
Zechen Bai
Jianxiong Gao
Ziteng Gao
Pichao Wang
Zheng Zhang
Tong He
Mike Zheng Shou
132
3
0
25 Nov 2024
Synthesising Handwritten Music with GANs: A Comprehensive Evaluation of CycleWGAN, ProGAN, and DCGAN
Elona Shatri
Kalikidhar Palavala
George Fazekas
152
0
0
25 Nov 2024
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guangtao Zhai
EGVM
250
1
0
25 Nov 2024
ExAL: An Exploration Enhanced Adversarial Learning Algorithm
A Vinil
Aneesh Sreevallabh Chivukula
Pranav Chintareddy
AAML
78
0
0
24 Nov 2024
Comparative Analysis of Diffusion Generative Models in Computational Pathology
Denisha Thakkar
Vincent Quoc-Huy Trinh
Sonal Varma
Samira Ebrahimi Kahou
Hassan Rivaz
Mahdi S. Hosseini
MedIm
121
1
0
24 Nov 2024
PanoLlama: Generating Endless and Coherent Panoramas with Next-Token-Prediction LLMs
Teng Zhou
Xiaoyu Zhang
Yongchuan Tang
MLLM
DiffM
202
1
0
24 Nov 2024
Hierarchical Cross-Attention Network for Virtual Try-On
Hao Tang
Bin Ren
Pingping Wu
N. Sebe
103
0
0
23 Nov 2024
Automatic Evaluation for Text-to-image Generation: Task-decomposed Framework, Distilled Training, and Meta-evaluation Benchmark
Rong-Cheng Tu
Zi-Ao Ma
Tian Lan
Yuehao Zhao
Heyan Huang
Xian-Ling Mao
MLLM
VLM
EGVM
178
4
0
23 Nov 2024
GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts
Junwen He
Yifan Wang
Lijun Wang
Huchuan Lu
Jun-Yan He
Chong Li
Hanyuan Chen
Jin-Peng Lan
Bin Luo
Yifeng Geng
117
1
0
18 Nov 2024
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
Jintao Zhang
Haofeng Huang
Pengle Zhang
Jia Wei
Jun-Jie Zhu
Jianfei Chen
MQ
VLM
181
2
0
17 Nov 2024
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
Shitong Shao
Zikai Zhou
Tian Ye
Lichen Bai
Zhiqiang Xu
Zeke Xie
DiffM
121
0
0
16 Nov 2024
Physics Informed Distillation for Diffusion Models
Joshua Tian Jin Tee
Kang Zhang
Hee Suk Yoon
Dhananjaya N. Gowda
Chanwoo Kim
Chang D. Yoo
DiffM
98
6
0
13 Nov 2024
World Models: The Safety Perspective
Zifan Zeng
Chongzhe Zhang
Feng Liu
Joseph Sifakis
Qunli Zhang
Shiming Liu
Peng Wang
KELM
LLMAG
78
2
0
12 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
175
0
0
12 Nov 2024
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models
Yoad Tewel
Rinon Gal
Dvir Samuel
Yuval Atzmon
Lior Wolf
Gal Chechik
VLM
118
9
0
11 Nov 2024
A Modular Conditional Diffusion Framework for Image Reconstruction
Magauiya Zhussip
Iaroslav Koshelev
Stamatis Lefkimmiatis
DiffM
52
0
0
08 Nov 2024
Improving image synthesis with diffusion-negative sampling
Alakh Desai
Nuno Vasconcelos
DiffM
42
2
0
08 Nov 2024
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
Hao Fei
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
191
14
0
08 Nov 2024
Controlling Human Shape and Pose in Text-to-Image Diffusion Models via Domain Adaptation
Benito Buchheim
M. Reimann
Jürgen Döllner
55
0
0
07 Nov 2024
Image Understanding Makes for A Good Tokenizer for Image Generation
Luting Wang
Yang Zhao
Zijian Zhang
Jiashi Feng
Si Liu
Bingyi Kang
VLM
89
4
0
07 Nov 2024
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Yue Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
100
3
0
05 Nov 2024
Conditional Vendi Score: An Information-Theoretic Approach to Diversity Evaluation of Prompt-based Generative Models
Mohammad Jalali
Azim Ospanov
Amin Gohari
Farzan Farnia
EGVM
96
4
0
05 Nov 2024
Constant Acceleration Flow
Dogyun Park
Sojin Lee
S. Kim
Taehoon Lee
Youngjoon Hong
Hyunwoo J. Kim
99
3
0
01 Nov 2024
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
Penghui Ruan
Pichao Wang
Divya Saxena
Jiannong Cao
Yuhui Shi
DiffM
VGen
102
0
0
31 Oct 2024
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Weicai Ye
Chenhao Ji
Zheng Chen
Junyao Gao
Xiaoshui Huang
Song-Hai Zhang
Wanli Ouyang
Tong He
Cairong Zhao
Guofeng Zhang
94
11
0
31 Oct 2024
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
Xinwang Chen
Ning Liu
Yinlin Zhu
Feifei Feng
Jian Tang
47
2
0
31 Oct 2024
FlowDCN: Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
Shuai Wang
Zexian Li
Tianhui Song
Xubin Li
Tiezheng Ge
Bo Zheng
Liwen Wang
105
3
0
30 Oct 2024
Embedding Watermarks in Diffusion Process for Model Intellectual Property Protection
Jijia Yang
Sen Peng
Xiaohua Jia
WIGM
101
0
0
29 Oct 2024
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin
Ruiyu Wang
Kuangyi Chen
Florian T. Pokorny
76
0
0
29 Oct 2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
408
3
0
29 Oct 2024
BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference
Changwoo Lee
Soo Min Kwon
Qing Qu
Hun-Seok Kim
90
0
0
28 Oct 2024
Reconstructing dynamics from sparse observations with no training on target system
Zheng-Meng Zhai
Jun-Yin Huang
Benjamin D. Stern
Y. Lai
78
1
0
28 Oct 2024
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
K R Prajwal
Bowen Shi
Matthew Lee
Apoorv Vyas
Andros Tjandra
...
Baishan Guo
Huiyu Wang
Triantafyllos Afouras
David Kant
Wei-Ning Hsu
80
5
0
27 Oct 2024
MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation
Jialin Luo
Yuanzhi Wang
Ziqi Gu
Yide Qiu
Shuaizhen Yao
Fuyun Wang
Chunyan Xu
Wenhua Zhang
Dan Wang
Zhen Cui
DiffM
52
2
0
26 Oct 2024
FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling
Zhengqiang Zhang
Ruihuang Li
Lei Zhang
104
3
0
24 Oct 2024
Optical Generative Models
Shiqi Chen
Yuhang Li
Hanlong Chen
Aydogan Ozcan
VLM
59
1
0
23 Oct 2024
Deep Generative Models for 3D Medical Image Synthesis
Paul Friedrich
Yannik Frisch
P. Cattin
3DV
MedIm
92
4
0
23 Oct 2024
Offline Evaluation of Set-Based Text-to-Image Generation
Negar Arabzadeh
Fernando Diaz
Junfeng He
EGVM
70
0
0
22 Oct 2024
One-Step Diffusion Distillation through Score Implicit Matching
Weijian Luo
Zemin Huang
Zhengyang Geng
J. Zico Kolter
Guo-Jun Qi
DiffM
92
21
0
22 Oct 2024
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
80
3
0
21 Oct 2024
Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models
Giannis Daras
Weili Nie
Karsten Kreis
A. Dimakis
Morteza Mardani
Nikola B. Kovachki
Arash Vahdat
DiffM
113
10
0
21 Oct 2024
Previous
1
2
3
...
5
6
7
...
81
82
83
Next