Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 750 papers shown

Title
Text-To-Speech Data Augmentation for Low Resource Speech Recognition Rodolfo Zevallos 19 4 0 01 Apr 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax Jaesong Lee Lukas Lee Shinji Watanabe 25 8 0 31 Mar 2022
An Empirical Study of Language Model Integration for Transducer based Speech Recognition Huahuan Zheng Keyu An Zhijian Ou Chen Huang Ke Ding Guanglu Wan 27 5 0 31 Mar 2022
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages? Priyanshi Shah Harveen Singh Chadha Anirudh Gupta Ankur Dhuriya Neeraj Chhimwal Rishabh Gaur Vivek Raghavan 31 1 0 30 Mar 2022
Improving Speech Recognition for Indic Languages using Language Model Ankur Dhuriya Harveen Singh Chadha Anirudh Gupta Priyanshi Shah Neeraj Chhimwal Rishabh Gaur Vivek Raghavan 17 2 0 30 Mar 2022
4-bit Conformer with Native Quantization Aware Training for Speech Recognition Shaojin Ding Phoenix Meadowlark Yanzhang He Lukasz Lew Shivani Agrawal Oleg Rybakov MQ 31 32 0 29 Mar 2022
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data Chen Chen Nana Hou Yuchen Hu Shashank Shirol Chng Eng Siong NoLa 14 43 0 29 Mar 2022
WaveFuzz: A Clean-Label Poisoning Attack to Protect Your Voice Yunjie Ge Qianqian Wang Jingfeng Zhang Juntao Zhou Yunzhu Zhang Chao Shen AAML 20 6 0 25 Mar 2022
Learning by non-interfering feedback chemical signaling in physical networks Vidyesh Rao Anisetti B. Scellier J. M. Schwarz 11 17 0 22 Mar 2022
Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition Marie Biolková Bac Nguyen AAML 33 2 0 18 Mar 2022
Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness Tejas Gokhale Swaroop Mishra Man Luo Bhavdeep Singh Sachdeva Chitta Baral 52 29 0 15 Mar 2022
aaeCAPTCHA: The Design and Implementation of Audio Adversarial CAPTCHA Md. Imran Hossen X. Hei 31 4 0 05 Mar 2022
A Survey of Multilingual Models for Automatic Speech Recognition Hemant Yadav Sunayana Sitaram 24 35 0 25 Feb 2022
Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey Ngoc Dung Huynh Mohamed Reda Bouadjenek Imran Razzak Kevin Lee Chetan Arora Ali Hassani A. Zaslavsky AAML 29 6 0 22 Feb 2022
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments Mario Esparza 24 0 0 21 Feb 2022
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines Alexander Isenko R. Mayer Jeffrey Jedele Hans-Arno Jacobsen 19 23 0 17 Feb 2022
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition Chao-Han Huck Yang Zeeshan Ahmed Yile Gu Joseph Szurley Roger Ren Linda Liu A. Stolcke I. Bulyko AAML 21 3 0 17 Feb 2022
Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference G. Paulin Francesco Conti Lukas Cavigelli Luca Benini 24 8 0 14 Feb 2022
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy H.C.M. Turner Giulio Lovisotto Simon Eberz Ivan Martinovic 13 1 0 13 Feb 2022
FAAG: Fast Adversarial Audio Generation through Interactive Attack Optimisation Yuantian Miao Chao Chen Lei Pan Jun Zhang Yang Xiang AAML 20 2 0 11 Feb 2022
Convergence of a New Learning Algorithm Feng Lin 3DV 16 0 0 08 Feb 2022
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian P. Mihajlik A. Balog T. E. Gráczi A. Kohári Balázs Tarján K. Mády 25 8 0 01 Feb 2022
Visualizing Automatic Speech Recognition -- Means for a Better Understanding? Karla Markert Romain Parracone Mykhailo Kulakov Philip Sperl Ching-yu Kao Konstantin Böttinger 19 8 0 01 Feb 2022
Language Dependencies in Adversarial Attacks on Speech Recognition Systems Karla Markert Donika Mirdita Konstantin Böttinger AAML SILM 19 4 0 01 Feb 2022
Unicorn: Reasoning about Configurable System Performance through the lens of Causality Md Shahriar Iqbal R. Krishna Mohammad Ali Javidian Baishakhi Ray Pooyan Jamshidi LRM 26 28 0 20 Jan 2022
iDECODe: In-distribution Equivariance for Conformal Out-of-distribution Detection R. Kaur Susmit Jha Anirban Roy Sangdon Park Yan Sun O. Sokolsky Insup Lee OODD 19 45 0 07 Jan 2022
Discrete and continuous representations and processing in deep learning: Looking forward Ruben Cartuyvels Graham Spinks Marie-Francine Moens OCL 33 20 0 04 Jan 2022
Multi-Dialect Arabic Speech Recognition Abbas Raza Ali 14 15 0 25 Dec 2021
Parameter identifiability of a deep feedforward ReLU neural network Joachim Bona-Pellissier François Bachoc François Malgouyres 41 15 0 24 Dec 2021
Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion S. Agarwal Liwen Hu Evonne Ng Trevor Darrell Hao Li Anna Rohrbach AAML 31 19 0 21 Dec 2021
ImportantAug: a data augmentation agent for speech V. Trinh Hassan Salami Kavaki Michael I. Mandel 27 10 0 14 Dec 2021
Real-Time Neural Voice Camouflage Mia Chiquier Chengzhi Mao Carl Vondrick 27 6 0 14 Dec 2021
Detecting Audio Adversarial Examples with Logit Noising N. Park Sangwoo Ji Jong Kim AAML 30 5 0 13 Dec 2021
Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications Yongqiang Tian Wuqi Zhang Ming Wen Shing-Chi Cheung Chengnian Sun Shiqing Ma Yu Jiang 29 7 0 06 Dec 2021
Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation Yingruo Fan Zhaojiang Lin Jun Saito Wenping Wang Taku Komura 31 21 0 04 Dec 2021
Catch Me If You Can: Blackbox Adversarial Attacks on Automatic Speech Recognition using Frequency Masking Xiao-lan Wu A. Rajan AAML 19 4 0 03 Dec 2021
Transformer-S2A: Robust and Efficient Speech-to-Animation Liyang Chen Zhiyong Wu Jun Ling Runnan Li Xu Tan Sheng Zhao 29 18 0 18 Nov 2021
A Survey on Adversarial Attacks for Malware Analysis Kshitiz Aryal Maanak Gupta Mahmoud Abdelsalam AAML 34 49 0 16 Nov 2021
Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception Joel Dapello J. Feather Hang Le Tiago Marques David D. Cox Josh H. McDermott J. DiCarlo SueYeon Chung AAML OOD 19 25 0 12 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition Jinyu Li VLM 35 363 0 02 Nov 2021
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition Evangelos Kazakos Jaesung Huh Arsha Nagrani Andrew Zisserman Dima Damen EgoV 50 45 0 01 Nov 2021
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis Haozhe Wu Jia Jia Haoyu Wang Yishun Dou Chao Duan Qingshan Deng CVBM 11 73 0 30 Oct 2021
TorchAudio: Building Blocks for Audio and Speech Processing Yao-Yuan Yang Moto Hira Zhaoheng Ni Anjali Chourdia Artyom Astafurov ... Sean Narenthiran Shinji Watanabe Soumith Chintala Vincent Quenneville-Bélair Yangyang Shi 31 165 0 28 Oct 2021
Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs H. Abdullah Muhammad Sajidur Rahman Christian Peeters Cassidy Gibson Washington Garcia Vincent Bindschaedler T. Shrimpton Patrick Traynor AAML 19 9 0 25 Oct 2021
Deep Neural Networks on EEG Signals to Predict Auditory Attention Score Using Gramian Angular Difference Field Mahak Kothari Shreyansh Joshi Adarsh Nandanwar Aadetya Jaiswal V. Baths 15 1 0 24 Oct 2021
Asynchronous Decentralized Distributed Training of Acoustic Models Xiaodong Cui Wei Zhang Abdullah Kayi Mingrui Liu Ulrich Finkler Brian Kingsbury G. Saon David S. Kung 32 3 0 21 Oct 2021
Activation Landscapes as a Topological Summary of Neural Network Performance Matthew Wheeler Jose J. Bouza Peter Bubenik 34 19 0 19 Oct 2021
Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition Haozhe Chen Weiming Zhang Kunlin Liu Kejiang Chen Han Fang Nenghai Yu 19 4 0 19 Oct 2021
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information Baolin Zheng Peipei Jiang Qian Wang Qi Li Chao Shen Cong Wang Yunjie Ge Qingyang Teng Shenyi Zhang AAML 18 69 0 19 Oct 2021
Intent Classification Using Pre-trained Language Agnostic Embeddings For Low Resource Languages Hemant Yadav Akshat Gupta Sai Krishna Rallabandi A. Black R. Shah 11 0 0 18 Oct 2021