Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.11293
Cited By
Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks
30 November 2017
Takuhiro Kaneko
Hirokazu Kameoka
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks"
41 / 41 papers shown
Title
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units
Gallil Maimon
Yossi Adi
34
13
0
19 Dec 2022
EmoFake: An Initial Dataset for Emotion Fake Audio Detection
Yan Zhao
Jiangyan Yi
J. Tao
Chenglong Wang
Xiaohui Zhang
Yongfeng Dong
24
10
0
10 Nov 2022
A Diffeomorphic Flow-based Variational Framework for Multi-speaker Emotion Conversion
Ravi Shankar
Hsi-Wei Hsieh
N. Charon
A. Venkataraman
DRL
22
2
0
09 Nov 2022
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse
Xulong Zhang
Jianzong Wang
Ning Cheng
Jing Xiao
24
1
0
25 Oct 2022
Deepfake: Definitions, Performance Metrics and Standards, Datasets and Benchmarks, and a Meta-Review
Enes ALTUNCU
V. N. Franqueira
Shujun Li
28
11
0
21 Aug 2022
Dance Style Transfer with Cross-modal Transformer
Wenjie Yin
Hang Yin
Kim Baraka
Danica Kragic
Mårten Björkman
50
23
0
19 Aug 2022
End-to-End Voice Conversion with Information Perturbation
Qicong Xie
Shan Yang
Yinjiao Lei
Linfu Xie
Dan Su
43
7
0
15 Jun 2022
Speak Like a Dog: Human to Non-human creature Voice Conversion
Kohei Suzuki
Shoki Sakamoto
T. Taniguchi
Hirokazu Kameoka
27
2
0
09 Jun 2022
Noise-robust voice conversion with domain adversarial training
Hongqiang Du
Lei Xie
Haizhou Li
19
11
0
26 Jan 2022
Improving Code-switching Language Modeling with Artificially Generated Texts using Cycle-consistent Adversarial Networks
Chia-Yu Li
Ngoc Thang Vu
17
12
0
12 Dec 2021
CycleGAN with Dual Adversarial Loss for Bone-Conducted Speech Enhancement
Qing Pan
Teng Gao
Jian Zhou
Hua-bin Wang
L. Tao
H. Kwan
31
3
0
02 Nov 2021
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
Shijun Wang
Dimche Kostadinov
Damian Borth
29
11
0
27 Oct 2021
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis with GANs
J. Nistal
Stefan Lattner
G. Richard
26
8
0
03 Aug 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
Helen Meng
DRL
22
136
0
18 Jun 2021
Emotional Voice Conversion: Theory, Databases and ESD
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
33
168
0
31 May 2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training
Kun Zhou
Berrak Sisman
Haizhou Li
23
27
0
31 Mar 2021
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
38
57
0
25 Feb 2021
Optimizing voice conversion network with cycle consistency loss of speaker identity
Hongqiang Du
Xiaohai Tian
Lei Xie
Haizhou Li
21
17
0
17 Nov 2020
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech
Kun Zhou
Berrak Sisman
Haizhou Li
DRL
34
40
0
03 Nov 2020
AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization
Yen-Hao Chen
Da-Yi Wu
Tsung-Han Wu
Hung-yi Lee
34
107
0
31 Oct 2020
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Takuhiro Kaneko
Hirokazu Kameoka
Kou Tanaka
Nobukatsu Hojo
26
78
0
22 Oct 2020
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
33
1
0
20 Aug 2020
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
Berrak Sisman
Junichi Yamagishi
Simon King
Haizhou Li
BDL
41
318
0
09 Aug 2020
Multi-speaker Emotion Conversion via Latent Variable Regularization and a Chained Encoder-Decoder-Predictor Network
Ravi Shankar
Hsi-Wei Hsieh
N. Charon
A. Venkataraman
40
11
0
25 Jul 2020
Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations
Janek Ebbers
Michael Kuhlmann
Tobias Cord-Landwehr
Reinhold Haeb-Umbach
DRL
CoGe
SSL
31
4
0
26 May 2020
Towards Automatic Face-to-Face Translation
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
42
172
0
01 Mar 2020
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data
Kun Zhou
Berrak Sisman
Haizhou Li
27
66
0
01 Feb 2020
Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Hao Luo
Hsin-Te Hwang
Chen-Chou Lo
Yu-Huai Peng
Yu Tsao
Hsin-Min Wang
DRL
17
42
0
22 Jan 2020
A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications
Jie Gui
Zhenan Sun
Yonggang Wen
Dacheng Tao
Jieping Ye
EGVM
33
821
0
20 Jan 2020
Adversarially Trained Autoencoders for Parallel-Data-Free Voice Conversion
Orhan Ocal
Oguz H. Elibol
Gokce Keskin
Cory Stephenson
Anil Thomas
Kannan Ramchandran
26
10
0
09 May 2019
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion
Wen-Chin Huang
Yi-Chiao Wu
Chen-Chou Lo
Patrick Lumban Tobing
Tomoki Hayashi
Kazuhiro Kobayashi
T. Toda
Yu Tsao
H. Wang
DRL
19
13
0
02 May 2019
Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
John Gideon
M. McInnis
E. Provost
13
107
0
28 Mar 2019
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang
Qiyang Li
Cem Anil
Xuchan Bao
Sageev Oore
Roger C. Grosse
30
97
0
22 Nov 2018
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms
Kou Tanaka
Hirokazu Kameoka
Takuhiro Kaneko
Nobukatsu Hojo
17
111
0
09 Nov 2018
Symbolic Music Genre Transfer with CycleGAN
Gino Brunner
Yuyi Wang
Roger Wattenhofer
Sumu Zhao
GAN
30
72
0
20 Sep 2018
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
DRL
16
59
0
13 Aug 2018
Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences
Cheng-chieh Yeh
Po-Chun Hsu
Ju-Chieh Chou
Hung-yi Lee
Lin-Shan Lee
33
23
0
09 Aug 2018
StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
Hirokazu Kameoka
Takuhiro Kaneko
Kou Tanaka
Nobukatsu Hojo
34
370
0
06 Jun 2018
A Universal Music Translation Network
Noam Mor
Lior Wolf
Adam Polyak
Yaniv Taigman
19
110
0
21 May 2018
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
Ehsan Hosseini-Asl
Yingbo Zhou
Caiming Xiong
R. Socher
16
54
0
27 Mar 2018
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network
Wenzhe Shi
Jose Caballero
Ferenc Huszár
J. Totz
Andrew P. Aitken
Rob Bishop
Daniel Rueckert
Zehan Wang
SupR
231
5,176
0
16 Sep 2016
1