Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.03498
Cited By
Improved Techniques for Training GANs
10 June 2016
Tim Salimans
Ian Goodfellow
Wojciech Zaremba
Vicki Cheung
Alec Radford
Xi Chen
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved Techniques for Training GANs"
50 / 4,102 papers shown
Title
Speaker-Independent Acoustic-to-Articulatory Inversion through Multi-Channel Attention Discriminator
Woo-Jin Chung
Hong-Goo Kang
57
2
0
25 Jun 2024
Do As I Do: Pose Guided Human Motion Copy
Sifan Wu
Zhenguang Liu
Beibei Zhang
Roger Zimmermann
Zhongjie Ba
Xiaosong Zhang
Kui Ren
77
8
0
24 Jun 2024
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
Zhiyu Tan
Xiaomeng Yang
Luozheng Qin
Mengping Yang
Cheng Zhang
Hao Li
106
8
0
24 Jun 2024
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi
Wenbo Li
Yuechen Zhang
Jingwen He
Biao Gong
Yinqiang Zheng
114
13
0
24 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
182
39
0
24 Jun 2024
Evaluation and Comparison of Emotionally Evocative Image Augmentation Methods
Jan Ignatowicz
K. Kutt
Grzegorz J. Nalepa
GAN
39
1
0
23 Jun 2024
X-ray2CTPA: Generating 3D CTPA scans from 2D X-ray conditioning
Noa Cahan
Eyal Klang
Galit Aviram
Y. Barash
Eli Konen
Raja Giryes
H. Greenspan
MedIm
75
0
0
23 Jun 2024
MetaGreen: Meta-Learning Inspired Transformer Selection for Green Semantic Communication
Shubhabrata Mukherjee
Cory Beard
Sejun Song
68
0
0
22 Jun 2024
Fingerprint Membership and Identity Inference Against Generative Adversarial Networks
Saverio Cavasin
Daniele Mari
Simone Milani
Mauro Conti
AAML
114
3
0
21 Jun 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Xuan He
Dongfu Jiang
Ge Zhang
Max Ku
Achint Soni
...
Yaswanth Narsupalli
Rongqi Fan
Zhiheng Lyu
Yuchen Lin
Wenhu Chen
EGVM
VGen
ALM
136
56
0
21 Jun 2024
Generative Topological Networks
Alona Levy-Jurgenson
Z. Yakhini
140
0
0
21 Jun 2024
Behaviour Distillation
Andrei Lupu
Chris Xiaoxuan Lu
Jarek Liesen
R. T. Lange
Jakob Foerster
DD
101
4
0
21 Jun 2024
Holistic Evaluation for Interleaved Text-and-Image Generation
Minqian Liu
Zhiyang Xu
Zihao Lin
Trevor Ashby
Joy Rimchala
Jiaxin Zhang
Lifu Huang
EGVM
112
11
0
20 Jun 2024
LayerMatch: Do Pseudo-labels Benefit All Layers?
Chaoqi Liang
Guanglei Yang
Lifeng Qiao
Zitong Huang
Hongliang Yan
Yunchao Wei
W. Zuo
90
0
0
20 Jun 2024
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation
Baiqi Li
Zhiqiu Lin
Deepak Pathak
Jiayao Li
Yixin Fei
...
Tiffany Ling
Xide Xia
Pengchuan Zhang
Graham Neubig
Deva Ramanan
EGVM
138
39
0
19 Jun 2024
Improving Text-To-Audio Models with Synthetic Captions
Zhifeng Kong
Sang-gil Lee
Deepanway Ghosal
Navonil Majumder
Ambuj Mehrish
Rafael Valle
Soujanya Poria
Bryan Catanzaro
110
13
0
18 Jun 2024
Autoregressive Image Generation without Vector Quantization
Tianhong Li
Yonglong Tian
He Li
Mingyang Deng
Kaiming He
DiffM
162
238
0
17 Jun 2024
ChildDiffusion: Unlocking the Potential of Generative AI and Controllable Augmentations for Child Facial Data using Stable Diffusion and Large Language Models
Muhammad Ali Farooq
Wang Yao
Peter Corcoran
69
1
0
17 Jun 2024
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Xiyang Wu
Tianrui Guan
Dianqi Li
Shuaiyi Huang
Xiaoyu Liu
...
Abhinav Shrivastava
Furong Huang
Jordan L. Boyd-Graber
Dinesh Manocha
Dinesh Manocha
HILM
LRM
VLM
MLLM
111
16
0
16 Jun 2024
IG2: Integrated Gradient on Iterative Gradient Path for Feature Attribution
Yue Zhuo
Zhiqiang Ge
57
9
0
16 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
99
2
0
15 Jun 2024
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation
Wei Chen
Lin Li
Yongqi Yang
Bin Wen
Fan Yang
Tingting Gao
Yu Wu
Long Chen
VLM
VGen
127
11
0
15 Jun 2024
ControlVAR: Exploring Controllable Visual Autoregressive Modeling
Xiang Li
Kai Qiu
Hao Chen
Jason Kuen
Zhe Lin
Rita Singh
Bhiksha Raj
DiffM
93
27
0
14 Jun 2024
Reinforced Decoder: Towards Training Recurrent Neural Networks for Time Series Forecasting
Qi Sima
Xinze Zhang
Yukun Bao
Siyue Yang
Liang Shen
AI4TS
80
1
0
14 Jun 2024
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Qihao Liu
Zhanpeng Zeng
Ju He
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
110
22
0
13 Jun 2024
Beyond the Frontier: Predicting Unseen Walls from Occupancy Grids by Learning from Floor Plans
Ludvig Ericson
Patric Jensfelt
141
7
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
145
2
0
13 Jun 2024
DiTFastAttn: Attention Compression for Diffusion Transformer Models
Zhihang Yuan
Pu Lu
Hanling Zhang
Xuefei Ning
Linfeng Zhang
Tianchen Zhao
Shengen Yan
Guohao Dai
Yu Wang
113
33
0
12 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
95
11
0
12 Jun 2024
Image and Video Tokenization with Binary Spherical Quantization
Yue Zhao
Yuanjun Xiong
Philipp Krahenbuhl
94
24
0
11 Jun 2024
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Xingyu Fu
Muyu He
Yujie Lu
William Yang Wang
Dan Roth
EGVM
LRM
105
21
0
11 Jun 2024
SAGIPS: A Scalable Asynchronous Generative Inverse Problem Solver
Daniel Lersch
Malachi Schram
Zhenyu Dai
Kishansingh Rajput
Xingfu Wu
Nobuo Sato
J. T. Childers
67
0
0
11 Jun 2024
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling
Denis Blessing
Xiaogang Jia
Johannes Esslinger
Francisco Vargas
Gerhard Neumann
135
27
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
101
2
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
134
301
0
10 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
117
15
0
10 Jun 2024
Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models
P. W. Shin
Jihyun Janice Ahn
Wenpeng Yin
Jack Sampson
Vijaykrishnan Narayanan
61
3
0
09 Jun 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Shiji Song
Yuan Yao
Gao Huang
84
17
0
08 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
Dongfu Jiang
Max Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
123
21
0
06 Jun 2024
Multistep Distillation of Diffusion Models via Moment Matching
Tim Salimans
Thomas Mensink
Jonathan Heek
Emiel Hoogeboom
DiffM
77
32
0
06 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
218
1
0
06 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Chenfanfu Jiang
Ningyu Zhang
Kai-Wei Chang
Aditya Grover
EGVM
VGen
112
45
0
05 Jun 2024
When Spiking neural networks meet temporal attention image decoding and adaptive spiking neuron
Xuerui Qiu
Zheng Luan
Zhaorui Wang
Rui-jie Zhu
107
5
0
05 Jun 2024
Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following
Qiaomu Miao
Alexandros Graikos
Jingwei Zhang
Sounak Mondal
Minh Hoai
Dimitris Samaras
149
0
0
04 Jun 2024
Analyzing the Feature Extractor Networks for Face Image Synthesis
Erdi Sarıtaş
H. K. Ekenel
CVBM
EGVM
83
1
0
04 Jun 2024
ST-DPGAN: A Privacy-preserving Framework for Spatiotemporal Data Generation
Wei Shao
Rongyi Zhu
Cai Yang
Chandra Thapa
Muhammad Ejaz Ahmed
S. Çamtepe
Rui Zhang
DuYong Kim
Hamid Menouar
Flora D. Salim
69
0
0
04 Jun 2024
Rank-based No-reference Quality Assessment for Face Swapping
Xinghui Zhou
Wenbo Zhou
Tianyi Wei
Shen Chen
Taiping Yao
Shouhong Ding
Weiming Zhang
Nenghai Yu
CVBM
55
0
0
04 Jun 2024
L-MAGIC: Language Model Assisted Generation of Images with Coherence
Zhipeng Cai
Matthias Mueller
R. Birkl
Diana Wofk
Shaoyen Tseng
JunDa Cheng
Gabriela Ben-Melech Stan
Vasudev Lal
Michael Paulitsch
DiffM
MLLM
80
6
0
03 Jun 2024
Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Xinyin Ma
Gongfan Fang
Michael Bi Mi
Xinchao Wang
114
44
0
03 Jun 2024
Segmentation-Free Guidance for Text-to-Image Diffusion Models
K. Azarian
Debasmit Das
Qiqi Hou
Fatih Porikli
VLM
79
0
0
03 Jun 2024
Previous
1
2
3
...
9
10
11
...
81
82
83
Next