Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 750 papers shown

Title
Stage-based Hyper-parameter Optimization for Deep Learning Ahnjae Shin Dongjin Shin Sungwoo Cho Do Yoon Kim Eunji Jeong Gyeong-In Yu Byung-Gon Chun 11 4 0 24 Nov 2019
Universal adversarial examples in speech command classification Jon Vadillo Roberto Santana AAML 34 29 0 22 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology Amirata Ghorbani Vivek Natarajan David Coz Yuan Liu GAN MedIm 21 98 0 20 Nov 2019
Generate (non-software) Bugs to Fool Classifiers Hiromu Yakura Youhei Akimoto Jun Sakuma AAML 25 10 0 20 Nov 2019
A novel method for identifying the deep neural network model with the Serial Number Xiangrui Xu Yaqin Li Cao Yuan AAML 16 8 0 19 Nov 2019
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models Siddharth Dalmia Abdel-rahman Mohamed M. Lewis Florian Metze Luke Zettlemoyer 16 10 0 09 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems Guangke Chen Sen Chen Lingling Fan Xiaoning Du Zhe Zhao Fu Song Yang Liu AAML 19 193 0 03 Nov 2019
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems? Bhavya Ghai Buvana Ramanan Klaus Mueller 11 1 0 29 Oct 2019
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation T. Nguyen S. Stueker Jan Niehues A. Waibel 11 98 0 29 Oct 2019
Meta Learning for End-to-End Low-Resource Speech Recognition Jui-Yang Hsu Yuan-Jui Chen Hung-yi Lee 27 103 0 26 Oct 2019
Recognizing long-form speech using streaming end-to-end models A. Narayanan Rohit Prabhavalkar Chung-Cheng Chiu David Rybach Tara N. Sainath Trevor Strohman 29 129 0 24 Oct 2019
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks Sherif Abdulatif Karim Armanious Karim Guirguis Jayasankar T. Sajeev Bin Yang GAN 6 0 0 21 Oct 2019
End-to-End Speech Recognition: A review for the French Language Florian Boyer Jean-Luc Rouas AI4TS 22 10 0 18 Oct 2019
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems H. Abdullah Muhammad Sajidur Rahman Washington Garcia Logan Blue Kevin Warren Anurag Swarnim Yadav T. Shrimpton Patrick Traynor AAML 25 88 0 11 Oct 2019
Animating Face using Disentangled Audio Representations Gaurav Mittal Baoyuan Wang CVBM 18 39 0 02 Oct 2019
Addressing Failure Prediction by Learning Model Confidence Charles Corbière Nicolas Thome Avner Bar-Hen Matthieu Cord P. Pérez 33 282 0 01 Oct 2019
RandAugment: Practical automated data augmentation with a reduced search space E. D. Cubuk Barret Zoph Jonathon Shlens Quoc V. Le MQ 96 3,416 0 30 Sep 2019
A Comparison of Hybrid and End-to-End Models for Syllable Recognition Sebastian P. Bayerl Korbinian Riedhammer 12 2 0 19 Sep 2019
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review Han Xu Yao Ma Haochen Liu Debayan Deb Hui Liu Jiliang Tang Anil K. Jain AAML 33 668 0 17 Sep 2019
Preech: A System for Privacy-Preserving Speech Transcription Shimaa Ahmed Amrita Roy Chowdhury Kassem Fawaz P. Ramanathan 51 46 0 09 Sep 2019
A Quantum Search Decoder for Natural Language Processing Johannes Bausch Sathyawageeswar Subramanian Stephen Piddock 20 14 0 09 Sep 2019
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units Yujeong Choi Minsoo Rhu 6 127 0 06 Sep 2019
Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal Pain Assessment from Crying Sound Md Sirajus Salekin Ghada Zamzami Rahul Paul Dmitry Goldgof R. Kasturi T. Ho Yu Sun 16 7 0 05 Sep 2019
Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings Pengfei Sun Gopala K. Anumanchipalli E. Chang 11 56 0 03 Sep 2019
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning Joel Hestness Newsha Ardalani G. Diamos 13 66 0 03 Sep 2019
Metric Learning for Adversarial Robustness Chengzhi Mao Ziyuan Zhong Junfeng Yang Carl Vondrick Baishakhi Ray OOD 21 183 0 03 Sep 2019
Smaller Models, Better Generalization Mayank Sharma Suraj Tripathi Abhimanyu Dubey Jayadeva Jayadeva Sai Guruju Nihal Goalla 15 1 0 29 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning Pavel Denisov Ngoc Thang Vu 17 27 0 13 Aug 2019
Universal Adversarial Audio Perturbations Sajjad Abdoli L. G. Hafemann Jérôme Rony Ismail Ben Ayed P. Cardinal Alessandro Lameiras Koerich AAML 25 51 0 08 Aug 2019
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition Systems Lea Schonherr Thorsten Eisenhofer Steffen Zeiler Thorsten Holz D. Kolossa AAML 54 63 0 05 Aug 2019
Machine Learning at the Network Edge: A Survey M. G. Sarwar Murshed Chris Murphy Daqing Hou Nazar Khan Ganesh Ananthanarayanan Faraz Hussain 38 378 0 31 Jul 2019
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement Alzahra Badi Sangwook Park D. Han Hanseok Ko 16 6 0 26 Jul 2019
A system of different layers of abstraction for artificial intelligence Alexander Serb T. Prodromakis AI4CE 19 6 0 22 Jul 2019
A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing Alexandrou Serb I. Kobyzev Jiaqi Wang T. Prodromakis 4 3 0 12 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural text-to-speech V. Klimkov S. Ronanki Jonas Rohnke Thomas Drugman AI4TS 14 82 0 04 Jul 2019
Towards Interpretable Deep Extreme Multi-label Learning Yihuang Kang I-Ling Cheng W. Mao Bowen Kuo Pei-Ju Lee 11 0 0 03 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling Kshiteej S. Mahajan Arjun Balasubramanian Arjun Singhvi Shivaram Venkataraman Aditya Akella Amar Phanishayee Shuchi Chawla 12 182 0 02 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion Suyoun Kim Siddharth Dalmia Florian Metze 15 23 0 27 Jun 2019
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias Ryo Nakashima Ryo Ozaki T. Taniguchi 21 6 0 21 Jun 2019
On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks Masoumeh Shafieinejad Jiaqi Wang Nils Lukas Xinda Li Florian Kerschbaum AAML 25 8 0 18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability Antoine Caubrière N. Tomashenko Antoine Laurent Emmanuel Morin Nathalie Camelin Yannick Esteve 10 54 0 18 Jun 2019
Deep Xi as a Front-End for Robust Automatic Speech Recognition Aaron Nicolson K. Paliwal 11 12 0 18 Jun 2019
Perceptual Based Adversarial Audio Attacks Joseph Szurley J. Zico Kolter AAML 24 25 0 14 Jun 2019
Selfie: Self-supervised Pretraining for Image Embedding Trieu H. Trinh Minh-Thang Luong Quoc V. Le SSL 11 111 0 07 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized Recommendation Udit Gupta Carole-Jean Wu Xiaodong Wang Maxim Naumov Brandon Reagen ... Andrey Malevich Dheevatsa Mudigere M. Smelyanskiy Liang Xiong Xuan Zhang GNN 44 290 0 06 Jun 2019
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness A. Malinin Mark Gales UQCV AAML 27 172 0 31 May 2019
Speaker Anonymization Using X-vector and Neural Waveform Models Fuming Fang Xin Wang Junichi Yamagishi Isao Echizen Massimiliano Todisco Nicholas W. D. Evans J. Bonastre 21 134 0 30 May 2019
Mixed Precision Training With 8-bit Floating Point Naveen Mellempudi Sudarshan Srinivasan Dipankar Das Bharat Kaul MQ 18 68 0 29 May 2019
Local Label Propagation for Large-Scale Semi-Supervised Learning Chengxu Zhuang Xuehao Ding Divyanshu Murli Daniel L. K. Yamins SSL 30 11 0 28 May 2019
NTP : A Neural Network Topology Profiler Raghavendra Bhat Pravin Chandran Juby Jose Viswanath Dibbur Prakash Sirra Ajith 19 2 0 22 May 2019