Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.09444
Cited By
Letter-Based Speech Recognition with Gated ConvNets
22 December 2017
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Letter-Based Speech Recognition with Gated ConvNets"
12 / 12 papers shown
Title
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting
Saket Dingliwal
Monica Sunkara
S. Bodapati
S. Ronanki
Jeffrey J. Farris
Katrin Kirchhoff
33
0
0
18 Oct 2022
Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Guillermo Cámbara
Jordi Luque
Mireia Farrús
24
0
0
21 Dec 2021
Towards Building ASR Systems for the Next Billion Users
Tahir Javed
Sumanth Doddapaneni
A. Raman
Kaushal Bhogale
Gowtham Ramesh
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
44
54
0
06 Nov 2021
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection
Fei Jia
Somshubra Majumdar
Boris Ginsburg
11
48
0
26 Oct 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
21
114
0
20 Feb 2020
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
22
0
19 Aug 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
27
125
0
27 May 2019
A Fully Differentiable Beam Search Decoder
R. Collobert
Awni Y. Hannun
Gabriel Synnaeve
17
40
0
16 Feb 2019
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
10
156
0
18 Dec 2018
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Yossi Adi
Neil Zeghidour
R. Collobert
Nicolas Usunier
Vitaliy Liptchinsky
Gabriel Synnaeve
29
39
0
09 Dec 2018
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
27
687
0
06 Sep 2018
CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation
Caroline Etienne
Guillaume Fidanza
Andrei Petrovskii
Laurence Devillers
B. Schmauch
19
99
0
15 Feb 2018
1