ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.00354
  4. Cited By
JSUT corpus: free large-scale Japanese speech corpus for end-to-end
  speech synthesis

JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis

28 October 2017
Ryosuke Sonobe
Shinnosuke Takamichi
Hiroshi Saruwatari
    3DV
ArXiv (abs)PDFHTML

Papers citing "JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis"

23 / 73 papers shown
Title
vTTS: visual-text to speech
vTTS: visual-text to speech
Yoshifumi Nakano
Takaaki Saeki
Shinnosuke Takamichi
Katsuhito Sudoh
Hiroshi Saruwatari
61
4
0
28 Mar 2022
Polyphone disambiguation and accent prediction using pre-trained
  language models in Japanese TTS front-end
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end
Rem Hida
Masaki Hamada
Chie Kamada
E. Tsunoo
Toshiyuki Sekiya
Toshiyuki Kumakura
34
7
0
24 Jan 2022
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
WaveFake: A Data Set to Facilitate Audio Deepfake Detection
Joel Frank
Lea Schonherr
DiffM
204
131
0
04 Nov 2021
RefineGAN: Universally Generating Waveform Better than Ground Truth with
  Highly Accurate Pitch and Intensity Responses
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses
Shengyuan Xu
Wenxiao Zhao
Jing Guo
63
12
0
01 Nov 2021
ESPnet2-TTS: Extending the Edge of TTS Research
ESPnet2-TTS: Extending the Edge of TTS Research
Tomoki Hayashi
Ryuichi Yamamoto
Takenori Yoshimura
Peter Wu
Jiatong Shi
Takaaki Saeki
Yooncheol Ju
Yusuke Yasuda
Shinnosuke Takamichi
Shinji Watanabe
VLM
85
63
0
15 Oct 2021
Decoupling Speaker-Independent Emotions for Voice Conversion Via
  Source-Filter Networks
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks
Zhaojie Luo
Shoufeng Lin
Rui Liu
Jun Baba
Yuichiro Yoshikawa
H. Ishiguro
39
9
0
04 Oct 2021
End to End Bangla Speech Synthesis
End to End Bangla Speech Synthesis
Prithwiraj Bhattacharjee
Rajan Saha Raju
Arif Ahmad
M. S. Rahman
31
2
0
01 Aug 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
133
359
0
29 Jun 2021
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on
  Neural TTS Model and Phonetic Posteriorgram
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram
Shengkui Zhao
Hao Wang
Trung Hieu Nguyen
B. Ma
33
20
0
03 Feb 2021
Simultaneous Speech-to-Speech Translation System with Neural Incremental
  ASR, MT, and TTS
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS
Katsuhito Sudoh
Takatomo Kano
Sashi Novitasari
Tomoya Yanagita
S. Sakti
Satoshi Nakamura
36
13
0
10 Nov 2020
JSSS: free Japanese speech corpus for summarization and simplification
JSSS: free Japanese speech corpus for summarization and simplification
Shinnosuke Takamichi
Mamoru Komachi
Naoko Tanji
Hiroshi Saruwatari
22
1
0
05 Oct 2020
Accent Estimation of Japanese Words from Their Surfaces and
  Romanizations for Building Large Vocabulary Accent Dictionaries
Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent Dictionaries
Hideyuki Tachibana
Yotaro Katayama
29
5
0
21 Sep 2020
Unsupervised Learning For Sequence-to-sequence Text-to-speech For
  Low-resource Languages
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages
Haitong Zhang
Yue Lin
53
30
0
11 Aug 2020
DiscreTalk: Text-to-Speech as a Machine Translation Problem
DiscreTalk: Text-to-Speech as a Machine Translation Problem
Tomoki Hayashi
Shinji Watanabe
70
32
0
12 May 2020
Utterance-level Sequential Modeling For Deep Gaussian Process Based
  Speech Synthesis Using Simple Recurrent Unit
Utterance-level Sequential Modeling For Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit
Tomoki Koriyama
Hiroshi Saruwatari
BDL
64
5
0
22 Apr 2020
Lifter Training and Sub-band Modeling for Computationally Efficient and
  High-Quality Voice Conversion Using Spectral Differentials
Lifter Training and Sub-band Modeling for Computationally Efficient and High-Quality Voice Conversion Using Spectral Differentials
Takaaki Saeki
Yuki Saito
Shinnosuke Takamichi
Hiroshi Saruwatari
17
4
0
17 Feb 2020
Phase reconstruction based on recurrent phase unwrapping with deep
  neural networks
Phase reconstruction based on recurrent phase unwrapping with deep neural networks
Yoshiki Masuyama
Kohei Yatabe
Yuma Koizumi
Yasuhiro Oikawa
Noboru Harada
57
22
0
14 Feb 2020
A Dataset for measuring reading levels in India at scale
A Dataset for measuring reading levels in India at scale
Dolly Agarwal
J. Gupchup
Nishant Baghel
23
1
0
27 Nov 2019
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source
  End-to-End Text-to-Speech Toolkit
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Tomoki Hayashi
Ryuichi Yamamoto
Katsuki Inoue
Takenori Yoshimura
Shinji Watanabe
Tomoki Toda
K. Takeda
Yu Zhang
Xu Tan
VLM
93
205
0
24 Oct 2019
A Comparative Study on Transformer vs RNN in Speech Applications
A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
94
722
0
13 Sep 2019
JVS corpus: free Japanese multi-speaker voice corpus
JVS corpus: free Japanese multi-speaker voice corpus
Shinnosuke Takamichi
Kentaro Mitsui
Yuki Saito
Tomoki Koriyama
Naoko Tanji
Hiroshi Saruwatari
69
72
0
17 Aug 2019
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
Kyubyong Park
Thomas Mulc
70
101
0
27 Mar 2019
Phase reconstruction from amplitude spectrograms based on
  von-Mises-distribution deep neural network
Phase reconstruction from amplitude spectrograms based on von-Mises-distribution deep neural network
Shinnosuke Takamichi
Yuki Saito
Norihiro Takamune
Daichi Kitamura
Hiroshi Saruwatari
40
44
0
10 Jul 2018
Previous
12