Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.02836
Cited By
v1
v2 (latest)
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
6 September 2023
Takashi Shibuya
Yuhta Takida
Yuki Mitsufuji
Re-assign community
ArXiv (abs)
PDF
HTML
Github (204★)
Papers citing
"BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network"
22 / 22 papers shown
Title
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
Yuhta Takida
Masaaki Imaizumi
Takashi Shibuya
Chieh-Hsin Lai
Toshimitsu Uesaka
Naoki Murata
Yuki Mitsufuji
GAN
91
13
0
30 Jan 2023
BigVGAN: A Universal Neural Vocoder with Large-Scale Training
Sang-gil Lee
Ming-Yu Liu
Boris Ginsburg
Bryan Catanzaro
Sung-Hoon Yoon
112
254
0
09 Jun 2022
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
Axel Sauer
Katja Schwarz
Andreas Geiger
270
512
0
01 Feb 2022
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Zhisheng Xiao
Karsten Kreis
Arash Vahdat
DiffM
102
560
0
15 Dec 2021
Chunked Autoregressive GAN for Conditional Waveform Synthesis
Max Morrison
Rithesh Kumar
Kundan Kumar
Prem Seetharaman
Aaron Courville
Yoshua Bengio
GAN
123
72
0
19 Oct 2021
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
179
1,952
0
12 Oct 2020
DiffWave: A Versatile Diffusion Model for Audio Synthesis
Zhifeng Kong
Ming-Yu Liu
Jiaji Huang
Kexin Zhao
Bryan Catanzaro
DiffM
BDL
169
1,468
0
21 Sep 2020
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren
Chenxu Hu
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
105
1,411
0
08 Jun 2020
WaveFlow: A Compact Flow-based Model for Raw Audio
Ming-Yu Liu
Kainan Peng
Kexin Zhao
Z. Song
87
117
0
03 Dec 2019
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Ryuichi Yamamoto
Eunwoo Song
Jae-Min Kim
62
820
0
25 Oct 2019
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen
Viet Dang
R. Clark
Yu Zhang
Ron J. Weiss
Ye Jia
Zhiwen Chen
Yonghui Wu
104
959
0
05 Apr 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
628
10,595
0
12 Dec 2018
WaveGlow: A Flow-based Generative Network for Speech Synthesis
R. Prenger
Rafael Valle
Bryan Catanzaro
155
1,036
0
31 Oct 2018
Efficient Neural Audio Synthesis
Nal Kalchbrenner
Erich Elsen
Karen Simonyan
Seb Noury
Norman Casagrande
Edward Lockhart
Florian Stimberg
Aaron van den Oord
Sander Dieleman
Koray Kavukcuoglu
94
871
0
23 Feb 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
85
2,705
0
16 Dec 2017
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Aaron van den Oord
Yazhe Li
Igor Babuschkin
Karen Simonyan
Oriol Vinyals
...
Alex Graves
Helen King
T. Walters
Dan Belov
Demis Hassabis
233
859
0
28 Nov 2017
Geometric GAN
Jae Hyun Lim
J. C. Ye
GAN
66
518
0
08 May 2017
SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual
Antonio Bonafonte
Joan Serrà
GAN
94
1,148
0
28 Mar 2017
Wasserstein GAN
Martín Arjovsky
Soumith Chintala
Léon Bottou
GAN
183
4,829
0
26 Jan 2017
Least Squares Generative Adversarial Networks
Xudong Mao
Qing Li
Haoran Xie
Raymond Y. K. Lau
Zhen Wang
Stephen Paul Smolley
GAN
343
4,580
0
13 Nov 2016
WaveNet: A Generative Model for Raw Audio
Aaron van den Oord
Sander Dieleman
Heiga Zen
Karen Simonyan
Oriol Vinyals
Alex Graves
Nal Kalchbrenner
A. Senior
Koray Kavukcuoglu
DiffM
406
7,425
0
12 Sep 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin
Botond Cseke
Ryota Tomioka
GAN
164
1,659
0
02 Jun 2016
1