ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXivPDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 750 papers shown
Title
Stage-based Hyper-parameter Optimization for Deep Learning
Stage-based Hyper-parameter Optimization for Deep Learning
Ahnjae Shin
Dongjin Shin
Sungwoo Cho
Do Yoon Kim
Eunji Jeong
Gyeong-In Yu
Byung-Gon Chun
11
4
0
24 Nov 2019
Universal adversarial examples in speech command classification
Universal adversarial examples in speech command classification
Jon Vadillo
Roberto Santana
AAML
34
29
0
22 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
Amirata Ghorbani
Vivek Natarajan
David Coz
Yuan Liu
GAN
MedIm
21
98
0
20 Nov 2019
Generate (non-software) Bugs to Fool Classifiers
Generate (non-software) Bugs to Fool Classifiers
Hiromu Yakura
Youhei Akimoto
Jun Sakuma
AAML
25
10
0
20 Nov 2019
A novel method for identifying the deep neural network model with the
  Serial Number
A novel method for identifying the deep neural network model with the Serial Number
Xiangrui Xu
Yaqin Li
Cao Yuan
AAML
16
8
0
19 Nov 2019
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models
Siddharth Dalmia
Abdel-rahman Mohamed
M. Lewis
Florian Metze
Luke Zettlemoyer
16
10
0
09 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Guangke Chen
Sen Chen
Lingling Fan
Xiaoning Du
Zhe Zhao
Fu Song
Yang Liu
AAML
19
193
0
03 Nov 2019
Does Speech enhancement of publicly available data help build robust
  Speech Recognition Systems?
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Bhavya Ghai
Buvana Ramanan
Klaus Mueller
11
1
0
29 Oct 2019
Improving sequence-to-sequence speech recognition training with
  on-the-fly data augmentation
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
T. Nguyen
S. Stueker
Jan Niehues
A. Waibel
11
98
0
29 Oct 2019
Meta Learning for End-to-End Low-Resource Speech Recognition
Meta Learning for End-to-End Low-Resource Speech Recognition
Jui-Yang Hsu
Yuan-Jui Chen
Hung-yi Lee
27
103
0
26 Oct 2019
Recognizing long-form speech using streaming end-to-end models
Recognizing long-form speech using streaming end-to-end models
A. Narayanan
Rohit Prabhavalkar
Chung-Cheng Chiu
David Rybach
Tara N. Sainath
Trevor Strohman
29
129
0
24 Oct 2019
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial
  Networks
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks
Sherif Abdulatif
Karim Armanious
Karim Guirguis
Jayasankar T. Sajeev
Bin Yang
GAN
6
0
0
21 Oct 2019
End-to-End Speech Recognition: A review for the French Language
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
22
10
0
18 Oct 2019
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box
  Attacks on Speech Recognition and Voice Identification Systems
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems
H. Abdullah
Muhammad Sajidur Rahman
Washington Garcia
Logan Blue
Kevin Warren
Anurag Swarnim Yadav
T. Shrimpton
Patrick Traynor
AAML
25
88
0
11 Oct 2019
Animating Face using Disentangled Audio Representations
Animating Face using Disentangled Audio Representations
Gaurav Mittal
Baoyuan Wang
CVBM
18
39
0
02 Oct 2019
Addressing Failure Prediction by Learning Model Confidence
Addressing Failure Prediction by Learning Model Confidence
Charles Corbière
Nicolas Thome
Avner Bar-Hen
Matthieu Cord
P. Pérez
33
282
0
01 Oct 2019
RandAugment: Practical automated data augmentation with a reduced search
  space
RandAugment: Practical automated data augmentation with a reduced search space
E. D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
MQ
96
3,416
0
30 Sep 2019
A Comparison of Hybrid and End-to-End Models for Syllable Recognition
A Comparison of Hybrid and End-to-End Models for Syllable Recognition
Sebastian P. Bayerl
Korbinian Riedhammer
12
2
0
19 Sep 2019
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Han Xu
Yao Ma
Haochen Liu
Debayan Deb
Hui Liu
Jiliang Tang
Anil K. Jain
AAML
33
668
0
17 Sep 2019
Preech: A System for Privacy-Preserving Speech Transcription
Preech: A System for Privacy-Preserving Speech Transcription
Shimaa Ahmed
Amrita Roy Chowdhury
Kassem Fawaz
P. Ramanathan
51
46
0
09 Sep 2019
A Quantum Search Decoder for Natural Language Processing
A Quantum Search Decoder for Natural Language Processing
Johannes Bausch
Sathyawageeswar Subramanian
Stephen Piddock
20
14
0
09 Sep 2019
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible
  Neural Processing Units
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units
Yujeong Choi
Minsoo Rhu
6
127
0
06 Sep 2019
Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal
  Pain Assessment from Crying Sound
Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal Pain Assessment from Crying Sound
Md Sirajus Salekin
Ghada Zamzami
Rahul Paul
Dmitry Goldgof
R. Kasturi
T. Ho
Yu Sun
16
7
0
05 Sep 2019
Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings
Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings
Pengfei Sun
Gopala K. Anumanchipalli
E. Chang
11
56
0
03 Sep 2019
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Joel Hestness
Newsha Ardalani
G. Diamos
13
66
0
03 Sep 2019
Metric Learning for Adversarial Robustness
Metric Learning for Adversarial Robustness
Chengzhi Mao
Ziyuan Zhong
Junfeng Yang
Carl Vondrick
Baishakhi Ray
OOD
21
183
0
03 Sep 2019
Smaller Models, Better Generalization
Smaller Models, Better Generalization
Mayank Sharma
Suraj Tripathi
Abhimanyu Dubey
Jayadeva Jayadeva
Sai Guruju
Nihal Goalla
15
1
0
29 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Pavel Denisov
Ngoc Thang Vu
17
27
0
13 Aug 2019
Universal Adversarial Audio Perturbations
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
25
51
0
08 Aug 2019
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech
  Recognition Systems
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems
Lea Schonherr
Thorsten Eisenhofer
Steffen Zeiler
Thorsten Holz
D. Kolossa
AAML
54
63
0
05 Aug 2019
Machine Learning at the Network Edge: A Survey
Machine Learning at the Network Edge: A Survey
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
38
378
0
31 Jul 2019
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE)
  for Speech Feature Enhancement
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement
Alzahra Badi
Sangwook Park
D. Han
Hanseok Ko
16
6
0
26 Jul 2019
A system of different layers of abstraction for artificial intelligence
A system of different layers of abstraction for artificial intelligence
Alexander Serb
T. Prodromakis
AI4CE
19
6
0
22 Jul 2019
A semi-holographic hyperdimensional representation system for
  hardware-friendly cognitive computing
A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing
Alexandrou Serb
I. Kobyzev
Jiaqi Wang
T. Prodromakis
4
3
0
12 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural
  text-to-speech
Fine-grained robust prosody transfer for single-speaker neural text-to-speech
V. Klimkov
S. Ronanki
Jonas Rohnke
Thomas Drugman
AI4TS
14
82
0
04 Jul 2019
Towards Interpretable Deep Extreme Multi-label Learning
Towards Interpretable Deep Extreme Multi-label Learning
Yihuang Kang
I-Ling Cheng
W. Mao
Bowen Kuo
Pei-Ju Lee
11
0
0
03 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Themis: Fair and Efficient GPU Cluster Scheduling
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
12
182
0
02 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for
  Conversational-Context Fusion
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Suyoun Kim
Siddharth Dalmia
Florian Metze
15
23
0
27 Jun 2019
Unsupervised Phoneme and Word Discovery from Multiple Speakers using
  Double Articulation Analyzer and Neural Network with Parametric Bias
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias
Ryo Nakashima
Ryo Ozaki
T. Taniguchi
21
6
0
21 Jun 2019
On the Robustness of the Backdoor-based Watermarking in Deep Neural
  Networks
On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks
Masoumeh Shafieinejad
Jiaqi Wang
Nils Lukas
Xinda Li
Florian Kerschbaum
AAML
25
8
0
18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken
  language understanding and domain portability
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
10
54
0
18 Jun 2019
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Aaron Nicolson
K. Paliwal
11
12
0
18 Jun 2019
Perceptual Based Adversarial Audio Attacks
Perceptual Based Adversarial Audio Attacks
Joseph Szurley
J. Zico Kolter
AAML
24
25
0
14 Jun 2019
Selfie: Self-supervised Pretraining for Image Embedding
Selfie: Self-supervised Pretraining for Image Embedding
Trieu H. Trinh
Minh-Thang Luong
Quoc V. Le
SSL
11
111
0
07 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized
  Recommendation
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
44
290
0
06 Jun 2019
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty
  and Adversarial Robustness
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness
A. Malinin
Mark Gales
UQCV
AAML
27
172
0
31 May 2019
Speaker Anonymization Using X-vector and Neural Waveform Models
Speaker Anonymization Using X-vector and Neural Waveform Models
Fuming Fang
Xin Wang
Junichi Yamagishi
Isao Echizen
Massimiliano Todisco
Nicholas W. D. Evans
J. Bonastre
21
134
0
30 May 2019
Mixed Precision Training With 8-bit Floating Point
Mixed Precision Training With 8-bit Floating Point
Naveen Mellempudi
Sudarshan Srinivasan
Dipankar Das
Bharat Kaul
MQ
18
68
0
29 May 2019
Local Label Propagation for Large-Scale Semi-Supervised Learning
Local Label Propagation for Large-Scale Semi-Supervised Learning
Chengxu Zhuang
Xuehao Ding
Divyanshu Murli
Daniel L. K. Yamins
SSL
30
11
0
28 May 2019
NTP : A Neural Network Topology Profiler
NTP : A Neural Network Topology Profiler
Raghavendra Bhat
Pravin Chandran
Juby Jose
Viswanath Dibbur
Prakash Sirra Ajith
19
2
0
22 May 2019
Previous
123...91011...131415
Next