Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1912.06670
Cited By
Common Voice: A Massively-Multilingual Speech Corpus
13 December 2019
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Common Voice: A Massively-Multilingual Speech Corpus"
50 / 310 papers shown
Title
The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
Mutian He
Philip N. Garner
44
4
0
16 May 2023
OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking
Fazle Rakib
Souhardya Saha Dip
Samiul Alam
Nazia Tasnim
Md. Istiak Hossain Shihab
...
Farig Sadeque
Sayma Sultana Chowdhury
Tahsin Reasat
Asif Sushmit
Ahmed Imtiaz Humayun
21
6
0
15 May 2023
Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models
Takanori Ashihara
Takafumi Moriya
Kohei Matsuura
Tomohiro Tanaka
33
3
0
09 May 2023
Spaiche: Extending State-of-the-Art ASR Models to Swiss German Dialects
Clément Sicard
Kajetan Pyszkowski
Victor Gillioz
26
7
0
20 Apr 2023
A Survey of Corpora for Germanic Low-Resource Languages and Dialects
Verena Blaschke
Hinrich Schütze
Barbara Plank
27
13
0
19 Apr 2023
Prak: An automatic phonetic alignment tool for Czech
V. Hanzl
Adléta Hanzlová
27
0
0
17 Apr 2023
Prediction-Oriented Bayesian Active Learning
Freddie Bickford-Smith
Andreas Kirsch
Sebastian Farquhar
Y. Gal
Adam Foster
Tom Rainforth
42
29
0
17 Apr 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Hainan Xu
Fei Jia
Somshubra Majumdar
Hengguan Huang
Shinji Watanabe
Boris Ginsburg
27
19
0
13 Apr 2023
Multilingual Word Error Rate Estimation: e-WER3
Shammur A. Chowdhury
Ahmed M. Ali
24
7
0
02 Apr 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu
Egor Lakomkin
Konstantinos Vougioukas
Pingchuan Ma
Honglie Chen
...
Niko Moritz
J. Kolár
Stavros Petridis
M. Pantic
Christian Fuegen
52
19
0
30 Mar 2023
Demystifying Misconceptions in Social Bots Research
S. Cresci
Kai-Cheng Yang
A. Spognardi
Roberto Di Pietro
Maurizio Tesconi
M. Petrocchi
36
17
0
30 Mar 2023
Bilex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation
Alex Jones
Isaac Caswell
Ishan Saxena
Orhan Firat
23
9
0
27 Mar 2023
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages
Chris C. Emezue
Sanchit Gandhi
Lewis Tunstall
Abubakar Abid
Josh Meyer
...
Douwe Kiela
Yacine Jernite
Julien Chaumond
Merve Noyan
Omar Sanseviero
33
2
0
22 Mar 2023
Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network
Zeyu Ren
Nurmemet Yolwas
Huiru Wang
Wushour Slamu
21
0
0
22 Mar 2023
Right the docs: Characterising voice dataset documentation practices used in machine learning
Kathy Reid
Elizabeth T. Williams
22
2
0
19 Mar 2023
Improving Accented Speech Recognition with Multi-Domain Training
Lucas Maison
Yannick Esteve
26
7
0
14 Mar 2023
audb -- Sharing and Versioning of Audio and Annotation Data in Python
H. Wierstorf
Johannes Wagner
F. Eyben
Felix Burkhardt
Björn W. Schuller
30
1
0
01 Mar 2023
Explanations for Automatic Speech Recognition
Xiao-lan Wu
P. Bell
A. Rajan
14
6
0
27 Feb 2023
Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
16
0
0
26 Feb 2023
Measuring Equality in Machine Learning Security Defenses: A Case Study in Speech Recognition
Luke E. Richards
Edward Raff
Cynthia Matuszek
AAML
16
2
0
17 Feb 2023
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Vinícius Ribeiro
Yiteng Huang
Yuan Shangguan
Zhaojun Yang
Liting Wan
Ming Sun
24
1
0
17 Feb 2023
Stabilising and accelerating light gated recurrent units for automatic speech recognition
Adel Moumen
Titouan Parcollet
28
3
0
16 Feb 2023
Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Karol Nowakowski
M. Ptaszynski
Kyoko Murasaki
Jagna Nieuwazny
23
23
0
18 Jan 2023
2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022
Michel Plüss
Yanick Schraner
Christian Scheller
Manfred Vogel
34
2
0
17 Jan 2023
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek
Georgios Paraskevopoulos
Theodoros Kouzelis
Georgios Rouvalis
Athanasios Katsamanis
Vassilis Katsouros
Alexandros Potamianos
VLM
30
7
0
31 Dec 2022
Voice conversion with limited data and limitless data augmentations
Olga Slizovskaia
Jordi Janer
Pritish Chandna
Oscar Mayor
35
1
0
27 Dec 2022
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Wei-Ning Hsu
Tal Remez
Bowen Shi
Jacob Donley
Yossi Adi
DiffM
27
12
0
21 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
41
6
0
19 Dec 2022
WACO: Word-Aligned Contrastive Learning for Speech Translation
Siqi Ouyang
Rong Ye
Lei Li
32
25
0
19 Dec 2022
Jointly Learning Visual and Auditory Speech Representations from Raw Data
A. Haliassos
Pingchuan Ma
Rodrigo Mira
Stavros Petridis
M. Pantic
SSL
45
49
0
12 Dec 2022
DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Kazuki Kawamura
Jun Rekimoto
33
0
0
08 Dec 2022
Unsupervised Fine-Tuning Data Selection for ASR Using Self-Supervised Speech Models
Reem Gody
David Harwath
20
3
0
03 Dec 2022
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
Spandan Dey
Md. Sahidullah
G. Saha
33
20
0
30 Nov 2022
Towards continually learning new languages
Ngoc-Quan Pham
Jan Niehues
A. Waibel
CLL
11
1
0
21 Nov 2022
Phonemic Adversarial Attack against Audio Recognition in Real World
Jiakai Wang
Zhendong Chen
Zixin Yin
Qinghong Yang
Xianglong Liu
AAML
40
3
0
19 Nov 2022
Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness
C. Hazirbas
Yejin Bang
Tiezheng Yu
Parisa Assar
Bilal Porgali
...
Jacqueline Pan
Emily McReynolds
Miranda Bogen
Pascale Fung
Cristian Canton Ferrer
35
8
0
10 Nov 2022
Speech separation with large-scale self-supervised learning
Zhuo Chen
Naoyuki Kanda
Jian Wu
Yu-Huan Wu
Xiaofei Wang
Takuya Yoshioka
Jinyu Li
S. Sivasankaran
Sefik Emre Eskimez
19
14
0
09 Nov 2022
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations
Paul-Ambroise Duquenne
Hongyu Gong
Ning Dong
Jingfei Du
Ann Lee
Vedanuj Goswani
Changhan Wang
J. Pino
Benoît Sagot
Holger Schwenk
45
34
0
08 Nov 2022
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
Juan Pablo Zuluaga
Karel Veselý
Igor Szöke
Alexander Blatt
P. Motlícek
...
Claudia Cevenini
Pavel Kolcárek
Allan Tart
J. Černocký
Dietrich Klakow
34
23
0
08 Nov 2022
Going In Style: Audio Backdoors Through Stylistic Transformations
Stefanos Koffas
Luca Pajola
S. Picek
Mauro Conti
31
23
0
06 Nov 2022
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Christian Heider Nielsen
Zheng-Hua Tan
AAML
19
1
0
03 Nov 2022
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system
Li Li
Dongxing Xu
Haoran Wei
Yanhua Long
21
2
0
03 Nov 2022
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition
Chao-Han Huck Yang
Bo-wen Li
Yu Zhang
Nanxin Chen
Tara N. Sainath
Sabato Marco Siniscalchi
Chin-Hui Lee
27
6
0
02 Nov 2022
Avoid Overthinking in Self-Supervised Models for Speech Recognition
Dan Berrebbi
Brian Yan
Shinji Watanabe
LRM
26
4
0
01 Nov 2022
Metric Learning for User-defined Keyword Spotting
Jaemin Jung
You-kyong. Kim
Jihwan Park
Youshin Lim
Byeong-Yeol Kim
Youngjoon Jang
Joon Son Chung
40
9
0
01 Nov 2022
SG-VAD: Stochastic Gates Based Speech Activity Detection
Jonathan Svirsky
Ofir Lindenbaum
39
4
0
28 Oct 2022
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Steven Vander Eeckt
Hugo Van hamme
CLL
MoMe
64
14
0
27 Oct 2022
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation
F. López
Jordi Luque
14
6
0
27 Oct 2022
Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Chak-Fai Li
Francis Keith
William Hartmann
M. Snover
19
0
0
27 Oct 2022
There is more than one kind of robustness: Fooling Whisper with adversarial examples
R. Olivier
Bhiksha Raj
AAML
40
12
0
26 Oct 2022
Previous
1
2
3
4
5
6
7
Next