Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.03945
Cited By
Towards Building ASR Systems for the Next Billion Users
6 November 2021
Tahir Javed
Sumanth Doddapaneni
A. Raman
Kaushal Bhogale
Gowtham Ramesh
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards Building ASR Systems for the Next Billion Users"
22 / 22 papers shown
Title
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving
Bhavani Shankar
P. Jyothi
Pushpak Bhattacharyya
40
1
0
16 Jun 2024
Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens
Nay San
Georgios Paraskevopoulos
Aryaman Arora
Xiluo He
Prabhjot Kaur
Oliver Adams
Dan Jurafsky
31
7
0
03 Feb 2024
Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Piyush Singh Pasi
Karthikeya Battepati
P. Jyothi
Ganesh Ramakrishnan
T. Mahapatra
Manoj Singh
51
0
0
10 Oct 2023
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR
Tobi Olatunji
Tejumade Afonja
Aditya Yadavalli
Chris C. Emezue
Sahib Singh
...
Joanne I. Osuchukwu
Salomey Osei
A. Tonja
Naome A. Etori
Clinton Mbataku
25
15
0
30 Sep 2023
Multimodal Modeling For Spoken Language Identification
Shikhar Bharadwaj
Min Ma
Shikhar Vashishth
Ankur Bapna
Sriram Ganapathy
...
Yu Zhang
D. Esch
Sandy Ritchie
Partha P. Talukdar
Jason Riesa
30
0
0
19 Sep 2023
"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations
N. Abhishek
P. Bhattacharyya
14
2
0
06 Aug 2023
MASR: Multi-label Aware Speech Representation
Anjali Raj
Shikhar Bharadwaj
Sriram Ganapathy
Min Ma
Shikhar Vashishth
SSL
11
0
0
20 Jul 2023
Label Aware Speech Representation Learning For Language Identification
Shikhar Vashishth
Shikhar Bharadwaj
Sriram Ganapathy
Ankur Bapna
Min Ma
Wei Han
Vera Axelrod
Partha P. Talukdar
SSL
17
4
0
07 Jun 2023
AfriNames: Most ASR models "butcher" African Names
Tobi Olatunji
Tejumade Afonja
Bonaventure F. P. Dossou
A. Tonja
Chris C. Emezue
Amina Mardiyyah Rufai
Sahib Singh
19
5
0
01 Jun 2023
Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR
Kaushal Bhogale
Sairam Sundaresan
A. Raman
Tahir Javed
Mitesh M. Khapra
Pratyush Kumar
VLM
25
10
0
24 May 2023
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
77
298
0
22 May 2023
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
Mingda Chen
Paul-Ambroise Duquenne
Pierre Yves Andrews
Justine T. Kao
Alexandre Mourachko
Holger Schwenk
Marta R. Costa-jussá
14
17
0
16 Dec 2022
Towards Building Text-To-Speech Systems for the Next Billion Users
Gokul Karthik Kumar
V. PraveenS.
Pratyush Kumar
Mitesh M. Khapra
Karthik Nandakumar
36
18
0
17 Nov 2022
Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
Travis M. Bartley
Fei Jia
Krishna C. Puvvada
Samuel Kriman
Boris Ginsburg
SSL
21
6
0
09 Nov 2022
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
33
16
0
05 Oct 2022
An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer Learning
Tushar Talukder Showrav
17
6
0
16 Sep 2022
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Kaushal Bhogale
A. Raman
Tahir Javed
Sumanth Doddapaneni
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
17
22
0
26 Aug 2022
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages
Tahir Javed
Kaushal Bhogale
A. Raman
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
ELM
25
20
0
24 Aug 2022
WavFT: Acoustic model finetuning with labelled and unlabelled data
Utkarsh Chauhan
Vikas Joshi
Rupeshkumar Mehta
9
0
0
01 Apr 2022
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition
Ashish Seth
L. D. Prasad
Sreyan Ghosh
S. Umesh
30
3
0
31 Mar 2022
A Survey of Multilingual Models for Automatic Speech Recognition
Hemant Yadav
Sunayana Sitaram
17
35
0
25 Feb 2022
CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Anirudh Gupta
Harveen Singh Chadha
Priyanshi Shah
Neeraj Chimmwal
Ankur Dhuriya
Rishabh Gaur
Vivek Raghavan
31
37
0
15 Jul 2021
1