ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01541
  4. Cited By
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence
  Model

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model

5 December 2017
Bo-wen Li
Tara N. Sainath
K. Sim
M. Bacchiani
Eugene Weinstein
Patrick Nguyen
Z. Chen
Yan-Qing Wu
Kanishka Rao
ArXivPDFHTML

Papers citing "Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model"

21 / 21 papers shown
Title
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR
Zheshu Song
Jianheng Zhuo
Yifan Yang
Ziyang Ma
Shixiong Zhang
Xie Chen
36
9
0
07 Jun 2024
Low-resource speech recognition and dialect identification of Irish in a
  multi-task framework
Low-resource speech recognition and dialect identification of Irish in a multi-task framework
Liam Lonergan
Mengjie Qian
Neasa Ní Chiaráin
Christer Gobl
A. N. Chasaide
43
2
0
02 May 2024
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in
  Sorani Kurdish
Where Are You From? Let Me Guess! Subdialect Recognition of Speeches in Sorani Kurdish
Sana Isam
Hossein Hassani
25
0
0
29 Mar 2024
Can Whisper perform speech-based in-context learning?
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
24
24
0
13 Sep 2023
Towards dialect-inclusive recognition in a low-resource language: are
  balanced corpora the answer?
Towards dialect-inclusive recognition in a low-resource language: are balanced corpora the answer?
Liam Lonergan
Mengjie Qian
Neasa Ní Chiaráin
Christer Gobl
A. N. Chasaide
21
6
0
14 Jul 2023
Multi-pass Training and Cross-information Fusion for Low-resource
  End-to-end Accented Speech Recognition
Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition
Xuefei Wang
Yanhua Long
Yijie Li
Haoran Wei
27
4
0
20 Jun 2023
Improved Self-Supervised Multilingual Speech Representation Learning
  Combined with Auxiliary Language Information
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information
Fenglin Ding
Genshun Wan
Pengcheng Li
Jia-Yu Pan
Cong Liu
SSL
25
1
0
07 Dec 2022
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture,
  and Generalization Capabilities
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Andros Tjandra
Nayan Singhal
David C. Zhang
Ozlem Kalinli
Abdel-rahman Mohamed
Duc Le
M. Seltzer
32
12
0
10 Nov 2022
Distilling a Pretrained Language Model to a Multilingual ASR Model
Distilling a Pretrained Language Model to a Multilingual ASR Model
Kwanghee Choi
Hyung-Min Park
VLM
19
10
0
25 Jun 2022
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech
  Recognition
A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Sanghyun Yoo
Inchul Song
Yoshua Bengio
22
28
0
06 May 2022
Transducer-based language embedding for spoken language identification
Transducer-based language embedding for spoken language identification
Peng Shen
Xugang Lu
Hisashi Kawai
50
6
0
08 Apr 2022
Multilingual Speech Recognition using Knowledge Transfer across Learning
  Processes
Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Rimita Lahiri
K. Kumatani
Eric Sun
Yao Qian
55
6
0
15 Oct 2021
Integrating Categorical Features in End-to-End ASR
Integrating Categorical Features in End-to-End ASR
Rongqing Huang
18
1
0
06 Oct 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Scaling End-to-End Models for Large-Scale Multilingual ASR
Bo-wen Li
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Yifan Jiang
Min Ma
Junwen Bai
CLL
28
76
0
30 Apr 2021
Cascaded encoders for unifying streaming and non-streaming ASR
Cascaded encoders for unifying streaming and non-streaming ASR
A. Narayanan
Tara N. Sainath
Ruoming Pang
Jiahui Yu
Chung-Cheng Chiu
Rohit Prabhavalkar
Ehsan Variani
Trevor Strohman
AuLLM
6
85
0
27 Oct 2020
Language-agnostic Multilingual Modeling
Language-agnostic Multilingual Modeling
A. Datta
Bhuvana Ramabhadran
Jesse Emond
Anjuli Kannan
Brian Roark
18
35
0
20 Apr 2020
A Streaming On-Device End-to-End Model Surpassing Server-Side
  Conventional Model Quality and Latency
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Tara N. Sainath
Yanzhang He
Bo-wen Li
A. Narayanan
Ruoming Pang
...
Trevor Strohman
Mirkó Visontai
Yonghui Wu
Yu Zhang
Ding Zhao
20
215
0
28 Mar 2020
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
24
99
0
22 Oct 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence
  Modeling
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Z. Chen
M. Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
27
209
0
21 Feb 2019
Pretraining by Backtranslation for End-to-end ASR in Low-Resource
  Settings
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Matthew Wiesner
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
15
32
0
10 Dec 2018
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica
  in End-to-End Models
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models
Tara N. Sainath
Rohit Prabhavalkar
Shankar Kumar
Seungjin Lee
Anjuli Kannan
...
Patrick Nguyen
Bo-wen Li
Yonghui Wu
Z. Chen
Chung-Cheng Chiu
11
53
0
05 Dec 2017
1