Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.03067
Cited By
ByT5 model for massively multilingual grapheme-to-phoneme conversion
6 April 2022
Jian Zhu
Cong Zhang
David Jurgens
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ByT5 model for massively multilingual grapheme-to-phoneme conversion"
20 / 20 papers shown
Title
Cross-Lingual IPA Contrastive Learning for Zero-Shot NER
Jimin Sohn
David R. Mortensen
49
0
0
10 Mar 2025
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Shri Kiran Srinivasan
Mohammed Irfan Kurpath
Sahal Shaji Mullappilly
Jean Lahoud
Fahad A Khan
Rao Muhammad Anwer
Salman Khan
Hisham Cholakkal
AuLLM
165
0
0
06 Mar 2025
PolyIPA -- Multilingual Phoneme-to-Grapheme Conversion Model
Davor Lauc
74
0
0
12 Dec 2024
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer
Davor Lauc
Attapol Rutherford
Weerin Wongwarawipatr
70
0
0
05 Dec 2024
Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models
Dongrui Han
Mingyu Cui
Jiawen Kang
Xixin Wu
Xunying Liu
Helen Meng
32
1
0
12 Nov 2024
Efficiently Identifying Low-Quality Language Subsets in Multilingual Datasets: A Case Study on a Large-Scale Multilingual Audio Dataset
Farhan Samir
Emily P. Ahn
Shreya Prakash
Márton Soskuthy
Vered Shwartz
Jian Zhu
26
0
0
05 Oct 2024
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning
Siqi Sun
Korin Richmond
40
0
0
15 Sep 2024
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study
Mahta Fetrat Qharabagh
Zahra Dehghanian
Hamid R. Rabiee
33
2
0
13 Sep 2024
Exploring the Benefits of Tokenization of Discrete Acoustic Units
Avihu Dekel
Raul Fernandez
46
2
0
08 Jun 2024
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
Abhinav Garg
Jiyeon Kim
Sushil Khyalia
Chanwoo Kim
Dhananjaya N. Gowda
25
2
0
19 Jan 2024
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language
Jian Zhu
Changbing Yang
Farhan Samir
Jahurul Islam
32
4
0
14 Nov 2023
SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation
Matthias Lindemann
Alexander Koller
Ivan Titov
AI4CE
19
1
0
01 Oct 2023
Speak While You Think: Streaming Speech Synthesis During Text Generation
Avihu Dekel
Slava Shechtman
Raul Fernandez
David Haws
Zvi Kons
R. Hoory
21
8
0
20 Sep 2023
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Eunseop Yoon
Hee Suk Yoon
Dhananjaya N. Gowda
Soohwan Eom
Daehyeok Kim
John Harvill
Heting Gao
M. Hasegawa-Johnson
Chanwoo Kim
Chang D. Yoo
32
1
0
16 Aug 2023
Multilingual context-based pronunciation learning for Text-to-Speech
Giulia Comini
M. Ribeiro
Fan Yang
Heereen Shim
Jaime Lorenzo-Trueba
52
7
0
31 Jul 2023
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
M. Ribeiro
Giulia Comini
Jaime Lorenzo-Trueba
36
4
0
31 Jul 2023
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech
L. T. Nguyen
Thinh-Le-Gia Pham
Dat Quoc Nguyen
26
13
0
31 May 2023
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners
Jocelyn Huang
Evelina Bakhturina
Oktai Tatanov
16
0
0
28 Feb 2023
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
54
24
0
20 Dec 2022
Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion
Kaisheng Yao
Geoffrey Zweig
45
163
0
31 May 2015
1