Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.01541
Cited By
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model
5 December 2017
Yue Liu
Tara N. Sainath
K. Sim
M. Bacchiani
Eugene Weinstein
Patrick Nguyen
Zhiwen Chen
Yan-Qing Wu
Kanishka Rao
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model"
28 / 28 papers shown
Title
State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition
Aref Farhadipour
Homayoon Beigi
Volker Dellwo
H. Veisi
Mamba
24
0
0
20 Jun 2025
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in Sorani Kurdish
Sana Isam
Hossein Hassani
30
0
0
29 Mar 2024
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
117
29
0
13 Sep 2023
Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Xuefei Wang
Yanhua Long
Yijie Li
Haoran Wei
62
4
0
20 Jun 2023
Learning Cross-lingual Visual Speech Representations
Andreas Zinonos
A. Haliassos
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
48
8
0
14 Mar 2023
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information
Fenglin Ding
Genshun Wan
Pengcheng Li
Jia Pan
Cong Liu
SSL
83
1
0
07 Dec 2022
SQuId: Measuring Speech Naturalness in Many Languages
Thibault Sellam
Ankur Bapna
Joshua Camp
Diana Mackinnon
Ankur P. Parikh
Jason Riesa
83
18
0
12 Oct 2022
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Sanghyun Yoo
Inchul Song
Yoshua Bengio
70
28
0
06 May 2022
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition
Xun Gong
Y. Qian
Houjun Huang
Yanmin Qian
81
46
0
21 Apr 2022
Transducer-based language embedding for spoken language identification
Peng Shen
Xugang Lu
Hisashi Kawai
82
6
0
08 Apr 2022
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
170
378
0
02 Nov 2021
A Configurable Multilingual Model is All You Need to Recognize All Languages
Long Zhou
Jinyu Li
Eric Sun
Shujie Liu
136
42
0
13 Jul 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Yue Liu
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Wenjie Huang
Min Ma
Junwen Bai
CLL
139
77
0
30 Apr 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Jay Mahadeokar
Yangyang Shi
Yuan Shangguan
Chunyang Wu
Alex Xiao
Hang Su
Duc Le
Ozlem Kalinli
Christian Fuegen
M. Seltzer
55
3
0
06 Apr 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
76
38
0
27 Jan 2021
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Hu Hu
Xuesong Yang
Zeynab Raeesy
Jinxi Guo
Gokce Keskin
Harish Arsikere
Ariya Rastrow
A. Stolcke
Roland Maas
64
30
0
14 Dec 2020
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Genta Indra Winata
Guangsen Wang
Caiming Xiong
Guosheng Lin
VLM
62
50
0
03 Dec 2020
Cascaded encoders for unifying streaming and non-streaming ASR
A. Narayanan
Tara N. Sainath
Ruoming Pang
Jiahui Yu
Chung-Cheng Chiu
Rohit Prabhavalkar
Ehsan Variani
Trevor Strohman
AuLLM
128
86
0
27 Oct 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
122
78
0
14 Aug 2020
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
87
143
0
06 Jul 2020
Language-agnostic Multilingual Modeling
A. Datta
Bhuvana Ramabhadran
Jesse Emond
Anjuli Kannan
Brian Roark
62
34
0
20 Apr 2020
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
91
99
0
22 Oct 2019
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model
Anjuli Kannan
A. Datta
Tara N. Sainath
Eugene Weinstein
Bhuvana Ramabhadran
Yonghui Wu
Ankur Bapna
Zhiwen Chen
Seungjin Lee
AuLLM
71
174
0
11 Sep 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
43
17
0
02 Aug 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhiwen Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
107
211
0
21 Feb 2019
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Sanjeev Khudanpur
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
91
31
0
10 Dec 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Yue Liu
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
AuLLM
79
131
0
22 Nov 2018
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models
Tara N. Sainath
Rohit Prabhavalkar
Shankar Kumar
Seungjin Lee
Anjuli Kannan
...
Patrick Nguyen
Yue Liu
Yonghui Wu
Zhiwen Chen
Chung-Cheng Chiu
71
54
0
05 Dec 2017
1