ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXivPDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 751 papers shown
Title
NTP : A Neural Network Topology Profiler
NTP : A Neural Network Topology Profiler
Raghavendra Bhat
Pravin Chandran
Juby Jose
Viswanath Dibbur
Prakash Sirra Ajith
24
2
0
22 May 2019
Acoustic-to-Word Models with Conversational Context Information
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
22
7
0
21 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems
Universal Adversarial Perturbations for Speech Recognition Systems
Paarth Neekhara
Shehzeen Samarah Hussain
Prakhar Pandey
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
20
113
0
09 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Capture, Learning, and Synthesis of 3D Speaking Styles
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM
3DH
53
338
0
08 May 2019
Transparent pronunciation scoring using articulatorily weighted phoneme
  edit distance
Transparent pronunciation scoring using articulatorily weighted phoneme edit distance
Reima Karhila
Anna-Riikka Smolander
Sari Ylinen
M. Kurimo
14
13
0
07 May 2019
Ensemble Distribution Distillation
Ensemble Distribution Distillation
A. Malinin
Bruno Mlodozeniec
Mark Gales
UQCV
27
231
0
30 Apr 2019
Unsupervised Data Augmentation for Consistency Training
Unsupervised Data Augmentation for Consistency Training
Qizhe Xie
Zihang Dai
Eduard H. Hovy
Minh-Thang Luong
Quoc V. Le
61
2,290
0
29 Apr 2019
Transformers with convolutional context for ASR
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
11
168
0
26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
8
3,412
0
18 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Adversarial Audio: A New Information Hiding Method and Backdoor for
  DNN-based Speech Recognition Models
Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models
Yehao Kong
Jiliang Zhang
11
26
0
08 Apr 2019
Measuring scheduling efficiency of RNNs for NLP applications
Measuring scheduling efficiency of RNNs for NLP applications
Urmish Thakker
Ganesh S. Dasika
Jesse G. Beu
Matthew Mattina
27
13
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
RAPID: Early Classification of Explosive Transients using Deep Learning
RAPID: Early Classification of Explosive Transients using Deep Learning
D. Muthukrishna
G. Narayan
K. Mandel
R. Biswas
R. Hložek
26
106
0
29 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings
Local Aggregation for Unsupervised Learning of Visual Embeddings
Chengxu Zhuang
Alex Zhai
Daniel L. K. Yamins
SSL
44
444
0
29 Mar 2019
Grammatical Error Correction and Style Transfer via Zero-shot
  Monolingual Translation
Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation
Elizaveta Korotkova
Agnes Luhtaru
Maksym Del
Krista Liin
Daiga Deksne
Mark Fishel
22
11
0
27 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End
  Speech Recognition
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
22
15
0
27 Mar 2019
Practical Hidden Voice Attacks against Speech and Speaker Recognition
  Systems
Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems
H. Abdullah
Washington Garcia
Christian Peeters
Patrick Traynor
Kevin R. B. Butler
Joseph N. Wilson
AAML
17
165
0
18 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
22
14
0
12 Mar 2019
Source codes in human communication
Source codes in human communication
Michael Ramscar
6
11
0
08 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition
  from YouTube Videos
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
18
19
0
01 Mar 2019
Incorporating End-to-End Speech Recognition Models for Sentiment
  Analysis
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
25
21
0
28 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
Justice Amoh
K. Odame
26
17
0
13 Feb 2019
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning
  Applications
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications
Peifeng Yu
Mosharaf Chowdhury
10
72
0
12 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet
  Execution-Efficient LSTM
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
36
10
0
30 Jan 2019
Weighted-Sampling Audio Adversarial Example Attack
Weighted-Sampling Audio Adversarial Example Attack
Xiaolei Liu
Xiaosong Zhang
Kun Wan
Qingxin Zhu
Yufei Ding
DiffM
AAML
36
36
0
26 Jan 2019
SirenAttack: Generating Adversarial Audio for End-to-End Acoustic Systems
Tianyu Du
S. Ji
Jinfeng Li
Qinchen Gu
Ting Wang
R. Beyah
AAML
8
127
0
23 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in
  Speech Recognition
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
19
117
0
22 Jan 2019
Robust Watermarking of Neural Network with Exponential Weighting
Robust Watermarking of Neural Network with Exponential Weighting
Ryota Namba
Jun Sakuma
AAML
20
137
0
18 Jan 2019
Prototypical Metric Transfer Learning for Continuous Speech Keyword
  Spotting With Limited Training Data
Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data
Harshita Seth
Pulkit Kumar
Muktabh Mayank Srivastava
8
12
0
12 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
13
26
0
31 Dec 2018
Stanza: Layer Separation for Distributed Training in Deep Learning
Stanza: Layer Separation for Distributed Training in Deep Learning
Xiaorui Wu
Hongao Xu
Bo Li
Y. Xiong
MoE
20
9
0
27 Dec 2018
A Multiversion Programming Inspired Approach to Detecting Audio
  Adversarial Examples
A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples
Qiang Zeng
Jianhai Su
Chenglong Fu
Golam Kayas
Lannan Luo
AAML
27
46
0
26 Dec 2018
wav2letter++: The Fastest Open-source Speech Recognition System
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
20
156
0
18 Dec 2018
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems
Xiaoning Du
Xiaofei Xie
Yi Li
Lei Ma
Jianjun Zhao
Yang Liu
24
38
0
13 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource
  Settings
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Matthew Wiesner
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
21
32
0
10 Dec 2018
Prior Networks for Detection of Adversarial Attacks
Prior Networks for Detection of Adversarial Attacks
A. Malinin
Mark Gales
AAML
22
5
0
06 Dec 2018
Layer Flexible Adaptive Computational Time
Layer Flexible Adaptive Computational Time
Lida Zhang
Abdolghani Ebrahimi
Diego Klabjan
AI4CE
36
1
0
06 Dec 2018
Overcoming Catastrophic Forgetting by Soft Parameter Pruning
Overcoming Catastrophic Forgetting by Soft Parameter Pruning
Jian-wei Peng
Jiang Hao
Zhuo Li
Enqiang Guo
X. Wan
Min Deng
Qing Zhu
Haifeng Li
CLL
20
4
0
04 Dec 2018
Effects of Loss Functions And Target Representations on Adversarial
  Robustness
Effects of Loss Functions And Target Representations on Adversarial Robustness
Sean Saito
S. Roy
AAML
11
7
0
01 Dec 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for
  Speech Recognition
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition
Jan Kremer
Lasse Borgholt
Lars Maaløe
34
6
0
28 Nov 2018
Adversarial Machine Learning And Speech Emotion Recognition: Utilizing
  Generative Adversarial Networks For Robustness
Adversarial Machine Learning And Speech Emotion Recognition: Utilizing Generative Adversarial Networks For Robustness
S. Latif
R. Rana
Junaid Qadir
GAN
AAML
24
42
0
28 Nov 2018
Improved Frequency Modulation Features for Multichannel Distant Speech
  Recognition
Improved Frequency Modulation Features for Multichannel Distant Speech Recognition
I. Rodomagoulakis
Petros Maragos
11
7
0
23 Nov 2018
Strong mixed-integer programming formulations for trained neural
  networks
Strong mixed-integer programming formulations for trained neural networks
Ross Anderson
Joey Huchette
Christian Tjandraatmadja
J. Vielma
19
251
0
20 Nov 2018
Protecting Voice Controlled Systems Using Sound Source Identification
  Based on Acoustic Cues
Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues
Yuan Gong
C. Poellabauer
AAML
11
27
0
16 Nov 2018
Streaming End-to-end Speech Recognition For Mobile Devices
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
42
624
0
15 Nov 2018
Automatic Grammar Augmentation for Robust Voice Command Recognition
Automatic Grammar Augmentation for Robust Voice Command Recognition
Yang Yang
Anusha Lalitha
Jinwon Lee
Chris Lott
21
3
0
14 Nov 2018
RNNFast: An Accelerator for Recurrent Neural Networks Using Domain Wall
  Memory
RNNFast: An Accelerator for Recurrent Neural Networks Using Domain Wall Memory
Mohammad Hossein Samavatian
Anys Bacha
Li Zhou
R. Teodorescu
22
7
0
07 Nov 2018
Adversarial Black-Box Attacks on Automatic Speech Recognition Systems
  using Multi-Objective Evolutionary Optimization
Adversarial Black-Box Attacks on Automatic Speech Recognition Systems using Multi-Objective Evolutionary Optimization
Shreya Khare
Rahul Aralikatte
Senthil Mani
AAML
11
14
0
04 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech
  Augmentation
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Jason Chun Lok Li
R. Gadde
Boris Ginsburg
Vitaly Lavrukhin
8
54
0
02 Nov 2018
Previous
123...101112...141516
Next