Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11567
Cited By
AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines
22 October 2020
Yao Shi
Hui Bu
Xin Xu
Shaojing Zhang
Ming Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AISHELL-3: A Multi-speaker Mandarin TTS Corpus and the Baselines"
22 / 122 papers shown
Title
Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Zhu Li
Yuqing Zhang
Mengxi Nie
Ming Yan
Mengnan He
Ruixiong Zhang
Caixia Gong
13
3
0
15 Nov 2021
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Yihui Fu
Yun Liu
Jingdong Li
Dawei Luo
Shubo Lv
Yukai Jv
Lei Xie
27
49
0
11 Nov 2021
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Shengyuan Xu
Wenxiao Zhao
Jing Guo
22
12
0
01 Nov 2021
VoiceFixer: Toward General Speech Restoration with Neural Vocoder
Haohe Liu
Qiuqiang Kong
Qiao Tian
Yan Zhao
DeLiang Wang
Chuanzeng Huang
Yuxuan Wang
28
57
0
28 Sep 2021
Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning
Rui Li
dong Pu
Minnie Huang
Bill Huang
50
14
0
23 Sep 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
32
26
0
19 Sep 2021
Automatic recognition of suprasegmentals in speech
Jiahong Yuan
Neville Ryant
Xingyu Cai
Kenneth Ward Church
M. Liberman
23
9
0
02 Aug 2021
Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language
Huiyan Li
Haohong Lin
You Wang
Hengyang Wang
Ming Zhang
Han Gao
Qing Ai
Zhiyuan Luo
Guang Li
31
11
0
31 Jul 2021
Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection
Lin Zhang
Xin Wang
Erica Cooper
Junichi Yamagishi
12
28
0
29 Jul 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic Model
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
13
0
0
23 Jul 2021
Msdtron: a high-capability multi-speaker speech synthesis system for diverse data using characteristic information
Qinghua Wu
Quanbo Shen
Jian Luan
YuJun Wang
38
3
0
07 Jul 2021
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion
Daxin Tan
Liqun Deng
Y. Yeung
Xin Jiang
Xiao Chen
Tan Lee
29
37
0
04 Jul 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
18
352
0
29 Jun 2021
SRIB-LEAP submission to Far-field Multi-Channel Speech Enhancement Challenge for Video Conferencing
R. Raj
Rohit Kumar
M. Jayesh
Anurenjan Purushothaman
Sriram Ganapathy
Basha Shaik
16
2
0
24 Jun 2021
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
Chenye Cui
Yi Ren
Jinglin Liu
Feiyang Chen
Rongjie Huang
Ming Lei
Zhou Zhao
24
35
0
17 Jun 2021
Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss
Yaogen Yang
Haozhe Zhang
Xiaoyi Qin
Shanshan Liang
Huahua Cui
Mingyang Xu
Ming Li
53
4
0
22 Apr 2021
Review of end-to-end speech synthesis technology based on deep learning
Zhaoxi Mu
Xinyu Yang
Yizhuo Dong
AuLLM
ALM
26
24
0
20 Apr 2021
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Saida Mussakhojayeva
Aigerim Janaliyeva
A. Mirzakhmetov
Yerbolat Khassanov
H. A. Varol
9
14
0
17 Apr 2021
End-to-End Mandarin Tone Classification with Short Term Context Information
Jiyang Tang
Ming Li
41
7
0
12 Apr 2021
Half-Truth: A Partially Fake Audio Detection Dataset
Jiangyan Yi
Ye Bai
J. Tao
Haoxin Ma
Zhengkun Tian
Chenglong Wang
Tao Wang
Ruibo Fu
16
82
0
08 Apr 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
39
12
0
02 Apr 2021
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Z. Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
207
820
0
12 Jun 2018
Previous
1
2
3