Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.14227
Cited By
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models
29 December 2022
Yinghao Aaron Li
Cong Han
N. Mesgarani
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models"
18 / 18 papers shown
Title
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training
Huaizhen Tang
Xulong Zhang
Jianzong Wang
Ning Cheng
Zhen Zeng
Edward Xiao
Jing Xiao
53
20
0
08 Aug 2022
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis
Yinghao Aaron Li
Cong Han
N. Mesgarani
91
39
0
30 May 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
217
407
0
04 Dec 2021
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Yinghao Aaron Li
A. Zare
N. Mesgarani
73
101
0
21 Jul 2021
S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
Jheng-hao Lin
Yist Y. Lin
C. Chien
Hung-yi Lee
106
56
0
07 Apr 2021
PPG-based singing voice conversion with adversarial representation learning
Zhonghao Li
Benlai Tang
Xiang Yin
Yuan Wan
Linjia Xu
Chen Shen
Zejun Ma
41
37
0
28 Oct 2020
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Jungil Kong
Jaehyeon Kim
Jaekyoung Bae
166
1,931
0
12 Oct 2020
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Jaehyeon Kim
Sungwon Kim
Jungil Kong
Sungroh Yoon
81
491
0
22 May 2020
Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge
Benjamin van Niekerk
Leanne Nortje
Herman Kamper
64
116
0
19 May 2020
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data
Seung-won Park
Doo-young Kim
Myun-chul Joe
57
42
0
07 May 2020
A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses
Malik Boudiaf
Jérôme Rony
Imtiaz Masud Ziko
Eric Granger
M. Pedersoli
Pablo Piantanida
Ismail Ben Ayed
SSL
77
159
0
19 Mar 2020
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Rafael Valle
Jason Chun Lok Li
R. Prenger
Bryan Catanzaro
60
149
0
26 Oct 2019
AUTOVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Kaizhi Qian
Yang Zhang
Shiyu Chang
Xuesong Yang
M. Hasegawa-Johnson
70
465
0
14 May 2019
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech
Heiga Zen
Viet Dang
R. Clark
Yu Zhang
Ron J. Weiss
Ye Jia
Zhiwen Chen
Yonghui Wu
96
951
0
05 Apr 2019
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
93
1,503
0
30 Mar 2018
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Jonathan Shen
Ruoming Pang
Ron J. Weiss
M. Schuster
Navdeep Jaitly
...
Yuxuan Wang
RJ Skerry-Ryan
Rif A. Saurous
Yannis Agiomyrgiannakis
Yonghui Wu
77
2,697
0
16 Dec 2017
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
132
2,136
0
14 Nov 2017
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization
Xun Huang
Serge J. Belongie
OOD
177
4,359
0
20 Mar 2017
1