ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01541
  4. Cited By
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence
  Model

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model

5 December 2017
Yue Liu
Tara N. Sainath
K. Sim
M. Bacchiani
Eugene Weinstein
Patrick Nguyen
Zhiwen Chen
Yan-Qing Wu
Kanishka Rao
ArXiv (abs)PDFHTML

Papers citing "Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model"

28 / 28 papers shown
Title
State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition
State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition
Aref Farhadipour
Homayoon Beigi
Volker Dellwo
H. Veisi
Mamba
24
0
0
20 Jun 2025
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in
  Sorani Kurdish
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in Sorani Kurdish
Sana Isam
Hossein Hassani
30
0
0
29 Mar 2024
Can Whisper perform speech-based in-context learning?
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
117
29
0
13 Sep 2023
Multi-pass Training and Cross-information Fusion for Low-resource
  End-to-end Accented Speech Recognition
Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Xuefei Wang
Yanhua Long
Yijie Li
Haoran Wei
62
4
0
20 Jun 2023
Learning Cross-lingual Visual Speech Representations
Learning Cross-lingual Visual Speech Representations
Andreas Zinonos
A. Haliassos
Pingchuan Ma
Stavros Petridis
Maja Pantic
SSL
48
8
0
14 Mar 2023
Improved Self-Supervised Multilingual Speech Representation Learning
  Combined with Auxiliary Language Information
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information
Fenglin Ding
Genshun Wan
Pengcheng Li
Jia Pan
Cong Liu
SSL
83
1
0
07 Dec 2022
SQuId: Measuring Speech Naturalness in Many Languages
SQuId: Measuring Speech Naturalness in Many Languages
Thibault Sellam
Ankur Bapna
Joshua Camp
Diana Mackinnon
Ankur P. Parikh
Jason Riesa
83
18
0
12 Oct 2022
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech
  Recognition
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Sanghyun Yoo
Inchul Song
Yoshua Bengio
70
28
0
06 May 2022
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech
  Recognition
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition
Xun Gong
Y. Qian
Houjun Huang
Yanmin Qian
81
46
0
21 Apr 2022
Transducer-based language embedding for spoken language identification
Transducer-based language embedding for spoken language identification
Peng Shen
Xugang Lu
Hisashi Kawai
82
6
0
08 Apr 2022
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
170
378
0
02 Nov 2021
A Configurable Multilingual Model is All You Need to Recognize All
  Languages
A Configurable Multilingual Model is All You Need to Recognize All Languages
Long Zhou
Jinyu Li
Eric Sun
Shujie Liu
136
42
0
13 Jul 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Scaling End-to-End Models for Large-Scale Multilingual ASR
Yue Liu
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Wenjie Huang
Min Ma
Junwen Bai
CLL
139
77
0
30 Apr 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute
  forMulti-Domain On-Device Scenarios
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Jay Mahadeokar
Yangyang Shi
Yuan Shangguan
Chunyang Wu
Alex Xiao
Hang Su
Duc Le
Ozlem Kalinli
Christian Fuegen
M. Seltzer
55
3
0
06 Apr 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
76
38
0
27 Jan 2021
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain
  Adversarial Training with Relabeling
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling
Hu Hu
Xuesong Yang
Zeynab Raeesy
Jinxi Guo
Gokce Keskin
Harish Arsikere
Ariya Rastrow
A. Stolcke
Roland Maas
64
30
0
14 Dec 2020
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual
  Speech Recognition
Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Genta Indra Winata
Guangsen Wang
Caiming Xiong
Guosheng Lin
VLM
62
50
0
03 Dec 2020
Cascaded encoders for unifying streaming and non-streaming ASR
Cascaded encoders for unifying streaming and non-streaming ASR
A. Narayanan
Tara N. Sainath
Ruoming Pang
Jiahui Yu
Chung-Cheng Chiu
Rohit Prabhavalkar
Ehsan Variani
Trevor Strohman
AuLLM
128
86
0
27 Oct 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An
  Overview
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
122
78
0
14 Aug 2020
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap
Anuroop Sriram
Paden Tomasello
Awni Y. Hannun
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
87
143
0
06 Jul 2020
Language-agnostic Multilingual Modeling
Language-agnostic Multilingual Modeling
A. Datta
Bhuvana Ramabhadran
Jesse Emond
Anjuli Kannan
Brian Roark
62
34
0
20 Apr 2020
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
91
99
0
22 Oct 2019
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End
  Model
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model
Anjuli Kannan
A. Datta
Tara N. Sainath
Eugene Weinstein
Bhuvana Ramabhadran
Yonghui Wu
Ankur Bapna
Zhiwen Chen
Seungjin Lee
AuLLM
71
174
0
11 Sep 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Multilingual Speech Recognition with Corpus Relatedness Sampling
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
43
17
0
02 Aug 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence
  Modeling
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhiwen Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
107
211
0
21 Feb 2019
Pretraining by Backtranslation for End-to-end ASR in Low-Resource
  Settings
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Sanjeev Khudanpur
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
91
31
0
10 Dec 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and
  Synthesis with Bytes
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Yue Liu
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
AuLLM
79
131
0
22 Nov 2018
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica
  in End-to-End Models
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models
Tara N. Sainath
Rohit Prabhavalkar
Shankar Kumar
Seungjin Lee
Anjuli Kannan
...
Patrick Nguyen
Yue Liu
Yonghui Wu
Zhiwen Chen
Chung-Cheng Chiu
71
54
0
05 Dec 2017
1