Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 750 papers shown

Title
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey Xiaoyu Zhang Chao Chen Yi Xie Xiaofeng Chen Jun Zhang Yang Xiang FedML 22 7 0 13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition Khin Me Me Chit Laet Laet Lin 24 3 0 13 May 2021
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in Commodity DRAM Sourjya Roy M. Ali A. Raghunathan 14 19 0 08 May 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect Binbin Xu Chongyang Tao Z. Feng Youssef Raqui Sylvie Ranwez 11 12 0 07 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence Emna Baccour N. Mhaisen A. Abdellatif A. Erbad Amr M. Mohamed Mounir Hamdi Mohsen Guizani 28 86 0 04 May 2021
On the limit of English conversational speech recognition Zoltán Tüske G. Saon Brian Kingsbury 22 50 0 03 May 2021
RotLSTM: Rotating Memories in Recurrent Neural Networks Vlad Velici Adam Prugel-Bennett RALM VLM 17 1 0 01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental Comparison Ahmed Aldahdooh W. Hamidouche Sid Ahmed Fezza Olivier Déforges AAML 11 122 0 01 May 2021
End-to-End Speech Recognition from Federated Acoustic Models Yan Gao Titouan Parcollet Salah Zaiem Javier Fernandez-Marques Pedro Porto Buarque de Gusmão Daniel J. Beutel Nicholas D. Lane 28 43 0 29 Apr 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization Ali Ramezani-Kebrya Fartash Faghri Ilya Markov V. Aksenov Dan Alistarh Daniel M. Roy MQ 65 30 0 28 Apr 2021
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head Qianyun Wang Zhenfeng Fan Shi-hong Xia 3DH 71 18 0 25 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing Wentao Chen Hailong Qiu Zhuang Jian Chutong Zhang Yu Hu Qing Lu Tianchen Wang Yiyu Shi Meiping Huang Xiaowe Xu 52 21 0 25 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network Janne Pylkkönen Antti Ukkonen Juho Kilpikoski Samu Tamminen Hannes Heikinheimo 18 27 0 22 Apr 2021
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems Mimansa Jaiswal E. Provost 26 0 0 18 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement Alexander Richard Michael Zollhoefer Yandong Wen Fernando de la Torre Yaser Sheikh CVBM 39 194 0 16 Apr 2021
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It Trung D. Q. Dang Om Thakkar Swaroop Indra Ramaswamy Rajiv Mathews Peter Chin Franccoise Beaufays FedML 30 10 0 15 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets Evelina Bakhturina Vitaly Lavrukhin Boris Ginsburg 22 12 0 11 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization Zhengkun Tian Jiangyan Yi Ye Bai J. Tao Shuai Zhang Zhengqi Wen 28 16 0 07 Apr 2021
Visual Alignment Constraint for Continuous Sign Language Recognition Yuecong Min Aiming Hao Xiujuan Chai Xilin Chen SLR 28 129 0 06 Apr 2021
Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems Akshat Gupta Olivia Deng Akruti Kushwaha Saloni Mittal William Zeng Sai Krishna Rallabandi A. Black 16 7 0 03 Apr 2021
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity and Model Smoothness Zhuolin Yang Linyi Li Xiaojun Xu Shiliang Zuo Qiang Chen Benjamin I. P. Rubinstein Pan Zhou Ce Zhang Bo-wen Li AAML 18 53 0 01 Apr 2021
Comparison of different convolutional neural network activation functions and methods for building ensembles L. Nanni Gianluca Maguolo S. Brahnam M. Paci 16 8 0 29 Mar 2021
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs R. Kaur Susmit Jha Anirban Roy O. Sokolsky Insup Lee 11 13 0 23 Mar 2021
Federated Quantum Machine Learning Samuel Yen-Chi Chen Shinjae Yoo FedML AI4CE 19 115 0 22 Mar 2021
Digital Peter: Dataset, Competition and Handwriting Recognition Methods M. Potanin Denis Dimitrov Alex Shonenkov Vladimir Bataev Denis Karachev Maxim Novopoltsev 21 9 0 16 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo Bonaventure F. P. Dossou Chris C. Emezue 26 12 0 13 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition Maurice Gerczuk Shahin Amiriparian Sandra Ottl Björn Schuller 38 55 0 10 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges Yoshitomo Matsubara Marco Levorato Francesco Restuccia 33 199 0 08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial Examples Shehzeen Samarah Hussain Paarth Neekhara Shlomo Dubnov Julian McAuley F. Koushanfar AAML 30 71 0 04 Mar 2021
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization HanQin Cai Y. Lou Daniel McKenzie W. Yin 27 40 0 21 Feb 2021
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation Elizabeth Fons Paula Dawson Xiao-Jun Zeng J. Keane Alexandros Iosifidis AI4TS 23 23 0 16 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition Priyabrata Karmakar S. Teng Guojun Lu 27 25 0 14 Feb 2021
Double-descent curves in neural networks: a new perspective using Gaussian processes Ouns El Harzli Bernardo Cuenca Grau Guillermo Valle Pérez A. Louis 20 6 0 14 Feb 2021
Learning Speech-driven 3D Conversational Gestures from Video I. Habibie Weipeng Xu Dushyant Mehta Lingjie Liu Hans-Peter Seidel Gerard Pons-Moll Mohamed A. Elgharib Christian Theobalt SLR CVBM 3DH 40 107 0 13 Feb 2021
Dompteur: Taming Audio Adversarial Examples Thorsten Eisenhofer Lea Schonherr Joel Frank Lars Speckemeier D. Kolossa Thorsten Holz AAML 33 24 0 10 Feb 2021
BembaSpeech: A Speech Recognition Corpus for the Bemba Language Claytone Sikasote Antonios Anastasopoulos 9 21 0 09 Feb 2021
Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models D. Nurseitov K. Bostanbekov Maksat Kanatov Anel N. Alimova Abdelrahman Abdallah Galymzhan Abdimanap 29 33 0 09 Feb 2021
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages Onno Eberhard Torsten Zesch 11 3 0 08 Feb 2021
A bandit approach to curriculum generation for automatic speech recognition Anastasia Kuznetsova Anurag Kumar Francis M. Tyers 11 1 0 06 Feb 2021
Audio Adversarial Examples: Attacks Using Vocal Masks Kai Yuan Tay Lynnette Hui Xian Ng Wei Han Chua Lucerne Loke Danqi Ye Melissa Chua AAML 18 0 0 04 Feb 2021
Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy James Mou Jun Li 11 3 0 03 Feb 2021
Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification A. K. Sarkar Md. Sahidullah Zheng-Hua Tan 7 0 0 03 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition Zhong Meng Naoyuki Kanda Yashesh Gaur S. Parthasarathy Eric Sun Liang Lu Xie Chen Jinyu Li Jiawei Liu AuLLM 41 52 0 02 Feb 2021
An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems A. Abdelmoniem Ahmed Elzanaty Mohamed-Slim Alouini Marco Canini 51 75 0 26 Jan 2021
Evaluating Models of Robust Word Recognition with Serial Reproduction Stephan C. Meylan Sathvik Nair Thomas L. Griffiths 22 4 0 24 Jan 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey Tailin Liang C. Glossner Lei Wang Shaobo Shi Xiaotong Zhang MQ 150 675 0 24 Jan 2021
Stable Recovery of Entangled Weights: Towards Robust Identification of Deep Neural Networks from Minimal Samples Christian Fiedler M. Fornasier T. Klock Michael Rauchensteiner OOD 22 12 0 18 Jan 2021
Black-box Adversarial Attacks on Monocular Depth Estimation Using Evolutionary Multi-objective Optimization Renya Daimo S. Ono Takahiro Suzuki AAML MDE 6 4 0 29 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition Minglun Han Linhao Dong Shiyu Zhou Bo Xu 13 21 0 17 Dec 2020
HeadGAN: One-shot Neural Head Synthesis and Editing M. Doukas S. Zafeiriou V. Sharmanska CVBM 3DH 27 125 0 15 Dec 2020