Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1610.05256
Cited By
Achieving Human Parity in Conversational Speech Recognition
17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Achieving Human Parity in Conversational Speech Recognition"
50 / 67 papers shown
Title
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling
Michael McGuire
58
0
0
10 Mar 2025
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
Manish Dhakal
Arman Chhetri
Aman Kumar Gupta
Prabin B. Lamichhane
S. Pandey
S. Shakya
AI4TS
35
10
0
25 Jun 2024
Tag and correct: high precision post-editing approach to correction of speech recognition errors
Tomasz Ziętkiewicz
31
0
0
11 Jun 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
35
3
0
20 Dec 2023
Beating Backdoor Attack at Its Own Game
Min Liu
Alberto L. Sangiovanni-Vincentelli
Xiangyu Yue
AAML
65
11
0
28 Jul 2023
Multilingual Word Error Rate Estimation: e-WER3
Shammur A. Chowdhury
Ahmed M. Ali
24
7
0
02 Apr 2023
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing
Xianxian Zeng
Yanjun Zheng
22
2
0
20 Mar 2023
Factual Consistency Oriented Speech Recognition
Naoyuki Kanda
Takuya Yoshioka
Yang Liu
43
0
0
24 Feb 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition
Colin S. Lea
Zifang Huang
Lauren Tooley
Jaya Narain
Dianna Yee
P. Georgiou
Dung Tien Tran
Jeffrey P. Bigham
Leah Findlater
32
31
0
17 Feb 2023
A Survey of Robust Adversarial Training in Pattern Recognition: Fundamental, Theory, and Methodologies
Zhuang Qian
Kaizhu Huang
Qiufeng Wang
Xu-Yao Zhang
OOD
AAML
ObjD
49
72
0
26 Mar 2022
DeepSketch: A New Machine Learning-Based Reference Search Technique for Post-Deduplication Delta Compression
Jisung Park
Jeoggyun Kim
Yeseong Kim
Sungjin Lee
O. Mutlu
13
23
0
17 Feb 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
36
90
0
05 Jan 2022
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories
Adnan Siraj Rakin
Md Hafizul Islam Chowdhuryy
Fan Yao
Deliang Fan
AAML
MIACV
42
110
0
08 Nov 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
33
10
0
13 Jun 2021
On Feature Decorrelation in Self-Supervised Learning
Tianyu Hua
Wenxiao Wang
Zihui Xue
Sucheng Ren
Yue Wang
Hang Zhao
SSL
OOD
133
187
0
02 May 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation
G. Sun
C. Zhang
P. Woodland
76
32
0
12 Feb 2021
Dompteur: Taming Audio Adversarial Examples
Thorsten Eisenhofer
Lea Schonherr
Joel Frank
Lars Speckemeier
D. Kolossa
Thorsten Holz
AAML
36
24
0
10 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
274
326
0
24 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
38
112
0
19 Jan 2021
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA
Adnan Siraj Rakin
Yukui Luo
Xiaolin Xu
Deliang Fan
AAML
25
49
0
05 Nov 2020
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
34
79
0
17 Sep 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
39
9
0
14 Jun 2020
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
31
9
0
16 May 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips
Fan Yao
Adnan Siraj Rakin
Deliang Fan
AAML
18
154
0
30 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
Théodore Bluche
Maël Primet
Thibault Gisselbrecht
ObjD
MQ
26
24
0
25 Feb 2020
A simple way to make neural networks robust against diverse image corruptions
E. Rusak
Lukas Schott
Roland S. Zimmermann
Julian Bitterwolf
Oliver Bringmann
Matthias Bethge
Wieland Brendel
21
64
0
16 Jan 2020
Predicting detection filters for small footprint open-vocabulary keyword spotting
Théodore Bluche
Thibault Gisselbrecht
ObjD
18
19
0
16 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited Data
Xinyun Chen
Wenxiao Wang
Chris Bender
Yiming Ding
R. Jia
Bo-wen Li
D. Song
AAML
27
106
0
17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Ching-Feng Yeh
Jay Mahadeokar
Kaustubh Kalgaonkar
Yongqiang Wang
Duc Le
Mahaveer Jain
Kjell Schubert
Christian Fuegen
M. Seltzer
27
147
0
28 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Shahram Ghorbani
S. Khorram
John H. L. Hansen
29
18
0
01 Oct 2019
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
34
205
0
16 Aug 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
Naoyuki Kanda
Christoph Boeddeker
Jens Heitkaemper
Yusuke Fujita
Shota Horiguchi
Kenji Nagamatsu
Reinhold Häb-Umbach
23
61
0
29 May 2019
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Joshua Y. Kim
Chunfeng Liu
R. Calvo
K. McCabe
Silas C. R. Taylor
Björn W. Schuller
Kaihang Wu
26
38
0
29 Apr 2019
Natural Language Interactions in Autonomous Vehicles: Intent Detection and Slot Filling from Passenger Utterances
Eda Okur
Shachi H. Kumar
Saurav Sahay
Asli Arslan Esme
L. Nachman
13
19
0
23 Apr 2019
Disfluencies and Human Speech Transcription Errors
Vicky Zayats
Trang Tran
Richard A. Wright
Courtney Mansfield
Mari Ostendorf
26
37
0
08 Apr 2019
Neural network gradient-based learning of black-box function interfaces
Alon Jacovi
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Jonathan Berant
18
13
0
13 Jan 2019
Feature Extraction for Temporal Signal Recognition: An Overview
Imad Rida
22
12
0
03 Dec 2018
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Jing Shi
Jiaming Xu
Yiqun Yao
Bo Xu
33
24
0
15 Nov 2018
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Xinpei Zhou
Jiwei Li
Xi Zhou
25
3
0
29 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
30
81
0
08 Oct 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
24
23
0
19 Sep 2018
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition
Pavel Denisov
Ngoc Thang Vu
Marc Ferras
8
18
0
30 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
20
96
0
10 Jul 2018
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces
A. Coucke
Alaa Saade
Adrien Ball
Théodore Bluche
A. Caulier
...
Thibault Gisselbrecht
F. Caltagirone
Thibaut Lavril
Maël Primet
Joseph Dureau
SyDa
70
812
0
25 May 2018
Estimate and Replace: A Novel Approach to Integrating Deep Neural Networks with Existing Applications
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Alon Jacovi
AI4TS
17
42
0
24 Apr 2018
Low-Precision Floating-Point Schemes for Neural Network Training
Marc Ortiz
A. Cristal
Eduard Ayguadé
Marc Casas
MQ
30
22
0
14 Apr 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker
Shinji Watanabe
Emmanuel Vincent
J. Trmal
20
678
0
28 Mar 2018
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
Lirong Dai
21
108
0
04 Mar 2018
Sequence-based Multi-lingual Low Resource Speech Recognition
Siddharth Dalmia
Ramon Sanabria
Florian Metze
A. Black
29
94
0
21 Feb 2018
Learning Combinations of Activation Functions
Franco Manessi
A. Rozza
AI4CE
26
54
0
29 Jan 2018
1
2
Next