Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 751 papers shown

Title
NTP : A Neural Network Topology Profiler Raghavendra Bhat Pravin Chandran Juby Jose Viswanath Dibbur Prakash Sirra Ajith 24 2 0 22 May 2019
Acoustic-to-Word Models with Conversational Context Information Suyoun Kim Florian Metze 22 7 0 21 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems Paarth Neekhara Shehzeen Samarah Hussain Prakhar Pandey Shlomo Dubnov Julian McAuley F. Koushanfar AAML 20 113 0 09 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles Daniel Cudeiro Timo Bolkart Cassidy Laidlaw Anurag Ranjan Michael J. Black CVBM 3DH 53 338 0 08 May 2019
Transparent pronunciation scoring using articulatorily weighted phoneme edit distance Reima Karhila Anna-Riikka Smolander Sari Ylinen M. Kurimo 14 13 0 07 May 2019
Ensemble Distribution Distillation A. Malinin Bruno Mlodozeniec Mark Gales UQCV 27 231 0 30 Apr 2019
Unsupervised Data Augmentation for Consistency Training Qizhe Xie Zihang Dai Eduard H. Hovy Minh-Thang Luong Quoc V. Le 61 2,290 0 29 Apr 2019
Transformers with convolutional context for ASR Abdel-rahman Mohamed Dmytro Okhonko Luke Zettlemoyer 11 168 0 26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition Daniel S. Park William Chan Yu Zhang Chung-Cheng Chiu Barret Zoph E. D. Cubuk Quoc V. Le VLM 8 3,412 0 18 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation Gakuto Kurata Kartik Audhkhasi 16 46 0 17 Apr 2019
Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models Yehao Kong Jiliang Zhang 11 26 0 08 Apr 2019
Measuring scheduling efficiency of RNNs for NLP applications Urmish Thakker Ganesh S. Dasika Jesse G. Beu Matthew Mattina 27 13 0 05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions Awni Y. Hannun Ann Lee Qiantong Xu R. Collobert 28 95 0 04 Apr 2019
RAPID: Early Classification of Explosive Transients using Deep Learning D. Muthukrishna G. Narayan K. Mandel R. Biswas R. Hložek 26 106 0 29 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings Chengxu Zhuang Alex Zhai Daniel L. K. Yamins SSL 44 444 0 29 Mar 2019
Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation Elizaveta Korotkova Agnes Luhtaru Maksym Del Krista Liin Daiga Deksne Mark Fishel 22 11 0 27 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition Shiliang Zhang Ming Lei Zhijie Yan 22 15 0 27 Mar 2019
Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems H. Abdullah Washington Garcia Christian Peeters Patrick Traynor Kevin R. B. Butler Joseph N. Wilson AAML 17 165 0 18 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model Yangyang Shi M. Hwang X. Lei AI4TS 22 14 0 12 Mar 2019
Source codes in human communication Michael Ramscar 6 11 0 08 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos Egor Lakomkin S. Magg C. Weber S. Wermter 18 19 0 01 Mar 2019
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis Egor Lakomkin M. Zamani C. Weber S. Magg S. Wermter 25 21 0 28 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting Justice Amoh K. Odame 26 17 0 13 Feb 2019
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications Peifeng Yu Mosharaf Chowdhury 10 72 0 12 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM Hongxu Yin Guoyang Chen Yingmin Li Shuai Che Weifeng Zhang N. Jha 36 10 0 30 Jan 2019
Weighted-Sampling Audio Adversarial Example Attack Xiaolei Liu Xiaosong Zhang Kun Wan Qingxin Zhu Yufei Ding DiffM AAML 36 36 0 26 Jan 2019
SirenAttack: Generating Adversarial Audio for End-to-End Acoustic Systems Tianyu Du S. Ji Jinfeng Li Qinchen Gu Ting Wang R. Beyah AAML 8 127 0 23 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition Julian Salazar Katrin Kirchhoff Zhiheng Huang AI4TS 19 117 0 22 Jan 2019
Robust Watermarking of Neural Network with Exponential Weighting Ryota Namba Jun Sakuma AAML 20 137 0 18 Jan 2019
Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data Harshita Seth Pulkit Kumar Muktabh Mayank Srivastava 8 12 0 12 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units Amit Das Jinyu Li Guoli Ye Rui Zhao Jiawei Liu 13 26 0 31 Dec 2018
Stanza: Layer Separation for Distributed Training in Deep Learning Xiaorui Wu Hongao Xu Bo Li Y. Xiong MoE 20 9 0 27 Dec 2018
A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples Qiang Zeng Jianhai Su Chenglong Fu Golam Kayas Lannan Luo AAML 27 46 0 26 Dec 2018
wav2letter++: The Fastest Open-source Speech Recognition System Vineel Pratap Awni Y. Hannun Qiantong Xu Jeff Cai Jacob Kahn Gabriel Synnaeve Vitaliy Liptchinsky R. Collobert VLM 20 156 0 18 Dec 2018
DeepCruiser: Automated Guided Testing for Stateful Deep Learning Systems Xiaoning Du Xiaofei Xie Yi Li Lei Ma Jianjun Zhao Yang Liu 24 38 0 13 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings Matthew Wiesner Adithya Renduchintala Shinji Watanabe Shuoyang Ding Najim Dehak Sanjeev Khudanpur 21 32 0 10 Dec 2018
Prior Networks for Detection of Adversarial Attacks A. Malinin Mark Gales AAML 22 5 0 06 Dec 2018
Layer Flexible Adaptive Computational Time Lida Zhang Abdolghani Ebrahimi Diego Klabjan AI4CE 36 1 0 06 Dec 2018
Overcoming Catastrophic Forgetting by Soft Parameter Pruning Jian-wei Peng Jiang Hao Zhuo Li Enqiang Guo X. Wan Min Deng Qing Zhu Haifeng Li CLL 20 4 0 04 Dec 2018
Effects of Loss Functions And Target Representations on Adversarial Robustness Sean Saito S. Roy AAML 11 7 0 01 Dec 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition Jan Kremer Lasse Borgholt Lars Maaløe 34 6 0 28 Nov 2018
Adversarial Machine Learning And Speech Emotion Recognition: Utilizing Generative Adversarial Networks For Robustness S. Latif R. Rana Junaid Qadir GAN AAML 24 42 0 28 Nov 2018
Improved Frequency Modulation Features for Multichannel Distant Speech Recognition I. Rodomagoulakis Petros Maragos 11 7 0 23 Nov 2018
Strong mixed-integer programming formulations for trained neural networks Ross Anderson Joey Huchette Christian Tjandraatmadja J. Vielma 19 251 0 20 Nov 2018
Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues Yuan Gong C. Poellabauer AAML 11 27 0 16 Nov 2018
Streaming End-to-end Speech Recognition For Mobile Devices Yanzhang He Tara N. Sainath Rohit Prabhavalkar Ian McGraw R. Álvarez ... K. Sim Tom Bagby Shuo-yiin Chang Kanishka Rao A. Gruenstein 42 624 0 15 Nov 2018
Automatic Grammar Augmentation for Robust Voice Command Recognition Yang Yang Anusha Lalitha Jinwon Lee Chris Lott 21 3 0 14 Nov 2018
RNNFast: An Accelerator for Recurrent Neural Networks Using Domain Wall Memory Mohammad Hossein Samavatian Anys Bacha Li Zhou R. Teodorescu 22 7 0 07 Nov 2018
Adversarial Black-Box Attacks on Automatic Speech Recognition Systems using Multi-Objective Evolutionary Optimization Shreya Khare Rahul Aralikatte Senthil Mani AAML 11 14 0 04 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation Jason Chun Lok Li R. Gadde Boris Ginsburg Vitaly Lavrukhin 8 54 0 02 Nov 2018