Achieving Human Parity in Conversational Speech Recognition

17 October 2016

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 67 papers shown

Title
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling Michael McGuire 58 0 0 10 Mar 2025
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Manish Dhakal Arman Chhetri Aman Kumar Gupta Prabin B. Lamichhane S. Pandey S. Shakya AI4TS 35 10 0 25 Jun 2024
Tag and correct: high precision post-editing approach to correction of speech recognition errors Tomasz Ziętkiewicz 31 0 0 11 Jun 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models A. Ogawa Naohiro Tawara Marc Delcroix S. Araki 35 3 0 20 Dec 2023
Beating Backdoor Attack at Its Own Game Min Liu Alberto L. Sangiovanni-Vincentelli Xiangyu Yue AAML 65 11 0 28 Jul 2023
Multilingual Word Error Rate Estimation: e-WER3 Shammur A. Chowdhury Ahmed M. Ali 24 7 0 02 Apr 2023
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing Xianxian Zeng Yanjun Zheng 22 2 0 20 Mar 2023
Factual Consistency Oriented Speech Recognition Naoyuki Kanda Takuya Yoshioka Yang Liu 43 0 0 24 Feb 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition Colin S. Lea Zifang Huang Lauren Tooley Jaya Narain Dianna Yee P. Georgiou Dung Tien Tran Jeffrey P. Bigham Leah Findlater 32 31 0 17 Feb 2023
A Survey of Robust Adversarial Training in Pattern Recognition: Fundamental, Theory, and Methodologies Zhuang Qian Kaizhu Huang Qiufeng Wang Xu-Yao Zhang OOD AAML ObjD 49 72 0 26 Mar 2022
DeepSketch: A New Machine Learning-Based Reference Search Technique for Post-Deduplication Delta Compression Jisung Park Jeoggyun Kim Yeseong Kim Sungjin Lee O. Mutlu 13 23 0 17 Feb 2022
Robust Self-Supervised Audio-Visual Speech Recognition Bowen Shi Wei-Ning Hsu Abdel-rahman Mohamed 36 90 0 05 Jan 2022
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories Adnan Siraj Rakin Md Hafizul Islam Chowdhuryy Fan Yao Deliang Fan AAML MIACV 42 110 0 08 Nov 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition Shih-Hsuan Chiu Tien-Hong Lo Fu-An Chao Berlin Chen BDL 33 10 0 13 Jun 2021
On Feature Decorrelation in Self-Supervised Learning Tianyu Hua Wenxiao Wang Zihui Xue Sucheng Ren Yue Wang Hang Zhao SSL OOD 133 187 0 02 May 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation G. Sun C. Zhang P. Woodland 76 32 0 12 Feb 2021
Dompteur: Taming Audio Adversarial Examples Thorsten Eisenhofer Lea Schonherr Joel Frank Lars Speckemeier D. Kolossa Thorsten Holz AAML 36 24 0 10 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning Tae Jin Park Naoyuki Kanda Dimitrios Dimitriadis Kyu Jeong Han Shinji Watanabe Shrikanth Narayanan VLM 274 326 0 24 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data Chengyi Wang Yu-Huan Wu Yao Qian K. Kumatani Shujie Liu Furu Wei Michael Zeng Xuedong Huang OT SSL 38 112 0 19 Jan 2021
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA Adnan Siraj Rakin Yukui Luo Xiaolin Xu Deliang Fan AAML 25 49 0 05 Nov 2020
Review: Deep Learning in Electron Microscopy Jeffrey M. Ede 34 79 0 17 Sep 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 39 9 0 14 Jun 2020
Large scale weakly and semi-supervised learning for low-resource video ASR Kritika Singh Vimal Manohar Alex Xiao Sergey Edunov Ross B. Girshick Vitaliy Liptchinsky Christian Fuegen Yatharth Saraf Geoffrey Zweig Abdel-rahman Mohamed 31 9 0 16 May 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips Fan Yao Adnan Siraj Rakin Deliang Fan AAML 18 154 0 30 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks Théodore Bluche Maël Primet Thibault Gisselbrecht ObjD MQ 26 24 0 25 Feb 2020
A simple way to make neural networks robust against diverse image corruptions E. Rusak Lukas Schott Roland S. Zimmermann Julian Bitterwolf Oliver Bringmann Matthias Bethge Wieland Brendel 21 64 0 16 Jan 2020
Predicting detection filters for small footprint open-vocabulary keyword spotting Théodore Bluche Thibault Gisselbrecht ObjD 18 19 0 16 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited Data Xinyun Chen Wenxiao Wang Chris Bender Yiming Ding R. Jia Bo-wen Li D. Song AAML 27 106 0 17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention Ching-Feng Yeh Jay Mahadeokar Kaustubh Kalgaonkar Yongqiang Wang Duc Le Mahaveer Jain Kjell Schubert Christian Fuegen M. Seltzer 27 147 0 28 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition Shahram Ghorbani S. Khorram John H. L. Hansen 29 18 0 01 Oct 2019
Survey on Deep Neural Networks in Speech and Vision Systems M. Alam Manar D. Samad Lasitha Vidyaratne Alexander M. Glandon Khan M. Iftekharuddin 3DV VLM AI4TS 34 205 0 16 Aug 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR Naoyuki Kanda Christoph Boeddeker Jens Heitkaemper Yusuke Fujita Shota Horiguchi Kenji Nagamatsu Reinhold Häb-Umbach 23 61 0 29 May 2019
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech Joshua Y. Kim Chunfeng Liu R. Calvo K. McCabe Silas C. R. Taylor Björn W. Schuller Kaihang Wu 26 38 0 29 Apr 2019
Natural Language Interactions in Autonomous Vehicles: Intent Detection and Slot Filling from Passenger Utterances Eda Okur Shachi H. Kumar Saurav Sahay Asli Arslan Esme L. Nachman 13 19 0 23 Apr 2019
Disfluencies and Human Speech Transcription Errors Vicky Zayats Trang Tran Richard A. Wright Courtney Mansfield Mari Ostendorf 26 37 0 08 Apr 2019
Neural network gradient-based learning of black-box function interfaces Alon Jacovi Guy Hadash Einat Kermany Boaz Carmeli Ofer Lavi George Kour Jonathan Berant 18 13 0 13 Jan 2019
Feature Extraction for Temporal Signal Recognition: An Overview Imad Rida 22 12 0 03 Dec 2018
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks Jing Shi Jiaming Xu Yiqun Yao Bo Xu 33 24 0 15 Nov 2018
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition Xinpei Zhou Jiwei Li Xi Zhou 25 3 0 29 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks Takuya Yoshioka Hakan Erdogan Zhuo Chen Xiong Xiao F. Alleva BDL 30 81 0 08 Oct 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm Shuxin Zheng Qi Meng Huishuai Zhang Wei-neng Chen Nenghai Yu Tie-Yan Liu 24 23 0 19 Sep 2018
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition Pavel Denisov Ngoc Thang Vu Marc Ferras 8 18 0 30 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition Chun-Fu Chen Quanfu Fan Neil Rohit Mallinar Tom Sercu Rogerio Feris 20 96 0 10 Jul 2018
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces A. Coucke Alaa Saade Adrien Ball Théodore Bluche A. Caulier ... Thibault Gisselbrecht F. Caltagirone Thibaut Lavril Maël Primet Joseph Dureau SyDa 70 812 0 25 May 2018
Estimate and Replace: A Novel Approach to Integrating Deep Neural Networks with Existing Applications Guy Hadash Einat Kermany Boaz Carmeli Ofer Lavi George Kour Alon Jacovi AI4TS 17 42 0 24 Apr 2018
Low-Precision Floating-Point Schemes for Neural Network Training Marc Ortiz A. Cristal Eduard Ayguadé Marc Casas MQ 30 22 0 14 Apr 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines Jon Barker Shinji Watanabe Emmanuel Vincent J. Trmal 20 678 0 28 Mar 2018
Deep-FSMN for Large Vocabulary Continuous Speech Recognition Shiliang Zhang Ming Lei Zhijie Yan Lirong Dai 21 108 0 04 Mar 2018
Sequence-based Multi-lingual Low Resource Speech Recognition Siddharth Dalmia Ramon Sanabria Florian Metze A. Black 29 94 0 21 Feb 2018
Learning Combinations of Activation Functions Franco Manessi A. Rozza AI4CE 26 54 0 29 Jan 2018