ByT5 model for massively multilingual grapheme-to-phoneme conversion

6 April 2022

Papers citing "ByT5 model for massively multilingual grapheme-to-phoneme conversion"

20 / 20 papers shown

Title
Cross-Lingual IPA Contrastive Learning for Zero-Shot NER Jimin Sohn David R. Mortensen 49 0 0 10 Mar 2025
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Shri Kiran Srinivasan Mohammed Irfan Kurpath Sahal Shaji Mullappilly Jean Lahoud Fahad A Khan Rao Muhammad Anwer Salman Khan Hisham Cholakkal AuLLM 165 0 0 06 Mar 2025
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model Davor Lauc 74 0 0 12 Dec 2024
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer Davor Lauc Attapol Rutherford Weerin Wongwarawipatr 70 0 0 05 Dec 2024
Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models Dongrui Han Mingyu Cui Jiawen Kang Xixin Wu Xunying Liu Helen Meng 32 1 0 12 Nov 2024
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset Farhan Samir Emily P. Ahn Shreya Prakash Márton Soskuthy Vered Shwartz Jian Zhu 26 0 0 05 Oct 2024
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning Siqi Sun Korin Richmond 40 0 0 15 Sep 2024
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study Mahta Fetrat Qharabagh Zahra Dehghanian Hamid R. Rabiee 33 2 0 13 Sep 2024
Exploring the Benefits of Tokenization of Discrete Acoustic Units Avihu Dekel Raul Fernandez 46 2 0 08 Jun 2024
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech Abhinav Garg Jiyeon Kim Sushil Khyalia Chanwoo Kim Dhananjaya N. Gowda 25 2 0 19 Jan 2024
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language Jian Zhu Changbing Yang Farhan Samir Jahurul Islam 32 4 0 14 Nov 2023
SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation Matthias Lindemann Alexander Koller Ivan Titov AI4CE 19 1 0 01 Oct 2023
Speak While You Think: Streaming Speech Synthesis During Text Generation Avihu Dekel Slava Shechtman Raul Fernandez David Haws Zvi Kons R. Hoory 21 8 0 20 Sep 2023
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction Eunseop Yoon Hee Suk Yoon Dhananjaya N. Gowda Soohwan Eom Daehyeok Kim John Harvill Heting Gao M. Hasegawa-Johnson Chanwoo Kim Chang D. Yoo 32 1 0 16 Aug 2023
Multilingual context-based pronunciation learning for Text-to-Speech Giulia Comini M. Ribeiro Fan Yang Heereen Shim Jaime Lorenzo-Trueba 52 7 0 31 Jul 2023
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings M. Ribeiro Giulia Comini Jaime Lorenzo-Trueba 36 4 0 31 Jul 2023
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech L. T. Nguyen Thinh-Le-Gia Pham Dat Quoc Nguyen 26 13 0 31 May 2023
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners Jocelyn Huang Evelina Bakhturina Oktai Tatanov 16 0 0 28 Feb 2023
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models Jonas Belouadi Steffen Eger 54 24 0 20 Dec 2022
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion Kaisheng Yao Geoffrey Zweig 45 163 0 31 May 2015