ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXivPDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 750 papers shown
Title
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural
  Network: A Survey
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey
Xiaoyu Zhang
Chao Chen
Yi Xie
Xiaofeng Chen
Jun Zhang
Yang Xiang
FedML
22
7
0
13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
24
3
0
13 May 2021
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in
  Commodity DRAM
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in Commodity DRAM
Sourjya Roy
M. Ali
A. Raghunathan
14
19
0
08 May 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech
  and Background Noise Effect
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect
Binbin Xu
Chongyang Tao
Z. Feng
Youssef Raqui
Sylvie Ranwez
11
12
0
07 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient
  Distributed Artificial Intelligence
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
28
86
0
04 May 2021
On the limit of English conversational speech recognition
On the limit of English conversational speech recognition
Zoltán Tüske
G. Saon
Brian Kingsbury
22
50
0
03 May 2021
RotLSTM: Rotating Memories in Recurrent Neural Networks
RotLSTM: Rotating Memories in Recurrent Neural Networks
Vlad Velici
Adam Prugel-Bennett
RALM
VLM
17
1
0
01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental
  Comparison
Adversarial Example Detection for DNN Models: A Review and Experimental Comparison
Ahmed Aldahdooh
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
11
122
0
01 May 2021
End-to-End Speech Recognition from Federated Acoustic Models
End-to-End Speech Recognition from Federated Acoustic Models
Yan Gao
Titouan Parcollet
Salah Zaiem
Javier Fernandez-Marques
Pedro Porto Buarque de Gusmão
Daniel J. Beutel
Nicholas D. Lane
28
43
0
29 Apr 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Ali Ramezani-Kebrya
Fartash Faghri
Ilya Markov
V. Aksenov
Dan Alistarh
Daniel M. Roy
MQ
65
30
0
28 Apr 2021
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head
Qianyun Wang
Zhenfeng Fan
Shi-hong Xia
3DH
71
18
0
25 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing
Quantization of Deep Neural Networks for Accurate Edge Computing
Wentao Chen
Hailong Qiu
Zhuang Jian
Chutong Zhang
Yu Hu
Qing Lu
Tianchen Wang
Yiyu Shi
Meiping Huang
Xiaowe Xu
52
21
0
25 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Janne Pylkkönen
Antti Ukkonen
Juho Kilpikoski
Samu Tamminen
Hannes Heikinheimo
18
27
0
22 Apr 2021
Best Practices for Noise-Based Augmentation to Improve the Performance
  of Deployable Speech-Based Emotion Recognition Systems
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems
Mimansa Jaiswal
E. Provost
26
0
0
18 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
39
194
0
16 Apr 2021
A Method to Reveal Speaker Identity in Distributed ASR Training, and How
  to Counter It
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
Trung D. Q. Dang
Om Thakkar
Swaroop Indra Ramaswamy
Rajiv Mathews
Peter Chin
Franccoise Beaufays
FedML
30
10
0
15 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
22
12
0
11 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by
  Applying Fast-Skip Regularization
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
Zhengkun Tian
Jiangyan Yi
Ye Bai
J. Tao
Shuai Zhang
Zhengqi Wen
28
16
0
07 Apr 2021
Visual Alignment Constraint for Continuous Sign Language Recognition
Visual Alignment Constraint for Continuous Sign Language Recognition
Yuecong Min
Aiming Hao
Xiujuan Chai
Xilin Chen
SLR
28
129
0
06 Apr 2021
Intent Recognition and Unsupervised Slot Identification for Low
  Resourced Spoken Dialog Systems
Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems
Akshat Gupta
Olivia Deng
Akruti Kushwaha
Saloni Mittal
William Zeng
Sai Krishna Rallabandi
A. Black
16
7
0
03 Apr 2021
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity
  and Model Smoothness
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity and Model Smoothness
Zhuolin Yang
Linyi Li
Xiaojun Xu
Shiliang Zuo
Qiang Chen
Benjamin I. P. Rubinstein
Pan Zhou
Ce Zhang
Bo-wen Li
AAML
18
53
0
01 Apr 2021
Comparison of different convolutional neural network activation
  functions and methods for building ensembles
Comparison of different convolutional neural network activation functions and methods for building ensembles
L. Nanni
Gianluca Maguolo
S. Brahnam
M. Paci
16
8
0
29 Mar 2021
Are all outliers alike? On Understanding the Diversity of Outliers for
  Detecting OODs
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs
R. Kaur
Susmit Jha
Anirban Roy
O. Sokolsky
Insup Lee
11
13
0
23 Mar 2021
Federated Quantum Machine Learning
Federated Quantum Machine Learning
Samuel Yen-Chi Chen
Shinjae Yoo
FedML
AI4CE
19
115
0
22 Mar 2021
Digital Peter: Dataset, Competition and Handwriting Recognition Methods
Digital Peter: Dataset, Competition and Handwriting Recognition Methods
M. Potanin
Denis Dimitrov
Alex Shonenkov
Vladimir Bataev
Denis Karachev
Maxim Novopoltsev
21
9
0
16 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
26
12
0
13 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion
  Recognition
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
38
55
0
10 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey
  and Research Challenges
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
33
199
0
08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial Examples
WaveGuard: Understanding and Mitigating Audio Adversarial Examples
Shehzeen Samarah Hussain
Paarth Neekhara
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
30
71
0
04 Mar 2021
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale
  Black-Box Optimization
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization
HanQin Cai
Y. Lou
Daniel McKenzie
W. Yin
27
40
0
21 Feb 2021
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation
Elizabeth Fons
Paula Dawson
Xiao-Jun Zeng
J. Keane
Alexandros Iosifidis
AI4TS
23
23
0
16 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural
  Networks for Automatic Speech Recognition
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Priyabrata Karmakar
S. Teng
Guojun Lu
27
25
0
14 Feb 2021
Double-descent curves in neural networks: a new perspective using
  Gaussian processes
Double-descent curves in neural networks: a new perspective using Gaussian processes
Ouns El Harzli
Bernardo Cuenca Grau
Guillermo Valle Pérez
A. Louis
20
6
0
14 Feb 2021
Learning Speech-driven 3D Conversational Gestures from Video
Learning Speech-driven 3D Conversational Gestures from Video
I. Habibie
Weipeng Xu
Dushyant Mehta
Lingjie Liu
Hans-Peter Seidel
Gerard Pons-Moll
Mohamed A. Elgharib
Christian Theobalt
SLR
CVBM
3DH
40
107
0
13 Feb 2021
Dompteur: Taming Audio Adversarial Examples
Dompteur: Taming Audio Adversarial Examples
Thorsten Eisenhofer
Lea Schonherr
Joel Frank
Lars Speckemeier
D. Kolossa
Thorsten Holz
AAML
33
24
0
10 Feb 2021
BembaSpeech: A Speech Recognition Corpus for the Bemba Language
BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Claytone Sikasote
Antonios Anastasopoulos
9
21
0
09 Feb 2021
Classification of Handwritten Names of Cities and Handwritten Text
  Recognition using Various Deep Learning Models
Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models
D. Nurseitov
K. Bostanbekov
Maksat Kanatov
Anel N. Alimova
Abdelrahman Abdallah
Galymzhan Abdimanap
29
33
0
09 Feb 2021
Effects of Layer Freezing on Transferring a Speech Recognition System to
  Under-resourced Languages
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages
Onno Eberhard
Torsten Zesch
11
3
0
08 Feb 2021
A bandit approach to curriculum generation for automatic speech
  recognition
A bandit approach to curriculum generation for automatic speech recognition
Anastasia Kuznetsova
Anurag Kumar
Francis M. Tyers
11
1
0
06 Feb 2021
Audio Adversarial Examples: Attacks Using Vocal Masks
Audio Adversarial Examples: Attacks Using Vocal Masks
Kai Yuan Tay
Lynnette Hui Xian Ng
Wei Han Chua
Lucerne Loke
Danqi Ye
Melissa Chua
AAML
18
0
0
04 Feb 2021
Effects of Number of Filters of Convolutional Layers on Speech
  Recognition Model Accuracy
Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
James Mou
Jun Li
11
3
0
03 Feb 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for
  Text-Dependent Speaker Verification
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification
A. K. Sarkar
Md. Sahidullah
Zheng-Hua Tan
7
0
0
03 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech
  Recognition
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
Naoyuki Kanda
Yashesh Gaur
S. Parthasarathy
Eric Sun
Liang Lu
Xie Chen
Jinyu Li
Jiawei Liu
AuLLM
41
52
0
02 Feb 2021
An Efficient Statistical-based Gradient Compression Technique for
  Distributed Training Systems
An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems
A. Abdelmoniem
Ahmed Elzanaty
Mohamed-Slim Alouini
Marco Canini
51
75
0
26 Jan 2021
Evaluating Models of Robust Word Recognition with Serial Reproduction
Evaluating Models of Robust Word Recognition with Serial Reproduction
Stephan C. Meylan
Sathvik Nair
Thomas L. Griffiths
22
4
0
24 Jan 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
Stable Recovery of Entangled Weights: Towards Robust Identification of
  Deep Neural Networks from Minimal Samples
Stable Recovery of Entangled Weights: Towards Robust Identification of Deep Neural Networks from Minimal Samples
Christian Fiedler
M. Fornasier
T. Klock
Michael Rauchensteiner
OOD
22
12
0
18 Jan 2021
Black-box Adversarial Attacks on Monocular Depth Estimation Using
  Evolutionary Multi-objective Optimization
Black-box Adversarial Attacks on Monocular Depth Estimation Using Evolutionary Multi-objective Optimization
Renya Daimo
S. Ono
Takahiro Suzuki
AAML
MDE
6
4
0
29 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech
  Recognition
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Minglun Han
Linhao Dong
Shiyu Zhou
Bo Xu
13
21
0
17 Dec 2020
HeadGAN: One-shot Neural Head Synthesis and Editing
HeadGAN: One-shot Neural Head Synthesis and Editing
M. Doukas
S. Zafeiriou
V. Sharmanska
CVBM
3DH
27
125
0
15 Dec 2020
Previous
123...678...131415
Next