ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.09444
  4. Cited By
Letter-Based Speech Recognition with Gated ConvNets

Letter-Based Speech Recognition with Gated ConvNets

22 December 2017
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
ArXivPDFHTML

Papers citing "Letter-Based Speech Recognition with Gated ConvNets"

12 / 12 papers shown
Title
Towards Personalization of CTC Speech Recognition Models with Contextual
  Adapters and Adaptive Boosting
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting
Saket Dingliwal
Monica Sunkara
S. Bodapati
S. Ronanki
Jeffrey J. Farris
Katrin Kirchhoff
33
0
0
18 Oct 2022
Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Guillermo Cámbara
Jordi Luque
Mireia Farrús
24
0
0
21 Dec 2021
Towards Building ASR Systems for the Next Billion Users
Towards Building ASR Systems for the Next Billion Users
Tahir Javed
Sumanth Doddapaneni
A. Raman
Kaushal Bhogale
Gowtham Ramesh
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
44
54
0
06 Nov 2021
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network
  for Voice Activity Detection
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection
Fei Jia
Somshubra Majumdar
Boris Ginsburg
11
48
0
26 Oct 2020
Imputer: Sequence Modelling via Imputation and Dynamic Programming
Imputer: Sequence Modelling via Imputation and Dynamic Programming
William Chan
Chitwan Saharia
Geoffrey E. Hinton
Mohammad Norouzi
Navdeep Jaitly
BDL
AI4TS
21
114
0
20 Feb 2020
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
22
0
19 Aug 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
27
125
0
27 May 2019
A Fully Differentiable Beam Search Decoder
A Fully Differentiable Beam Search Decoder
R. Collobert
Awni Y. Hannun
Gabriel Synnaeve
17
40
0
16 Feb 2019
wav2letter++: The Fastest Open-source Speech Recognition System
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
10
156
0
18 Dec 2018
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial
  and Multi-task Learning in Speech Recognition
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Yossi Adi
Neil Zeghidour
R. Collobert
Nicolas Usunier
Vitaliy Liptchinsky
Gabriel Synnaeve
29
39
0
09 Dec 2018
Deep Audio-Visual Speech Recognition
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
27
687
0
06 Sep 2018
CNN+LSTM Architecture for Speech Emotion Recognition with Data
  Augmentation
CNN+LSTM Architecture for Speech Emotion Recognition with Data Augmentation
Caroline Etienne
Guillaume Fidanza
Andrei Petrovskii
Laurence Devillers
B. Schmauch
19
99
0
15 Feb 2018
1