ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.02136
  4. Cited By
English Conversational Telephone Speech Recognition by Humans and
  Machines

English Conversational Telephone Speech Recognition by Humans and Machines

6 March 2017
G. Saon
Gakuto Kurata
Tom Sercu
Kartik Audhkhasi
Samuel Thomas
Dimitrios Dimitriadis
Xiaodong Cui
Bhuvana Ramabhadran
M. Picheny
L. Lim
Bergul Roomi
Phil Hall
ArXivPDFHTML

Papers citing "English Conversational Telephone Speech Recognition by Humans and Machines"

41 / 41 papers shown
Title
Lattice Rescoring Based on Large Ensemble of Complementary Neural
  Language Models
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
27
3
0
20 Dec 2023
Soft Random Sampling: A Theoretical and Empirical Analysis
Soft Random Sampling: A Theoretical and Empirical Analysis
Xiaodong Cui
Ashish R. Mittal
Songtao Lu
Wei Zhang
G. Saon
Brian Kingsbury
36
1
0
21 Nov 2023
Multilingual Word Error Rate Estimation: e-WER3
Multilingual Word Error Rate Estimation: e-WER3
Shammur A. Chowdhury
Ahmed M. Ali
16
7
0
02 Apr 2023
Enhancing and Adversarial: Improve ASR with Speaker Labels
Enhancing and Adversarial: Improve ASR with Speaker Labels
Wei Zhou
Haotian Wu
Jingjing Xu
Mohammad Zeineldeen
Christoph Luscher
Ralf Schluter
Hermann Ney
21
8
0
11 Nov 2022
Effect and Analysis of Large-scale Language Model Rescoring on
  Competitive ASR Systems
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Takuma Udagawa
Masayuki Suzuki
Gakuto Kurata
N. Itoh
G. Saon
34
23
0
01 Apr 2022
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech
  Representations
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations
L. D. Prasad
Sreyan Ghosh
S. Umesh
19
12
0
31 Mar 2022
Language Adaptive Cross-lingual Speech Representation Learning with
  Sparse Sharing Sub-networks
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Yizhou Lu
Mingkun Huang
Xinghua Qu
Pengfei Wei
Zejun Ma
21
19
0
09 Mar 2022
Adversarial Attacks on Speech Recognition Systems for Mission-Critical
  Applications: A Survey
Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Imran Razzak
Kevin Lee
Chetan Arora
Ali Hassani
A. Zaslavsky
AAML
23
6
0
22 Feb 2022
Investigation of Data Augmentation Techniques for Disordered Speech
  Recognition
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Mengzhe Geng
Xurong Xie
Shansong Liu
Jianwei Yu
Shoukang Hu
Xunying Liu
H. Meng
8
55
0
14 Jan 2022
Data augmentation through multivariate scenario forecasting in Data
  Centers using Generative Adversarial Networks
Data augmentation through multivariate scenario forecasting in Data Centers using Generative Adversarial Networks
J. Pérez
Patricia Arroba
Jose M. Moya
24
14
0
12 Jan 2022
4-bit Quantization of LSTM-based Speech Recognition Models
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
23
21
0
27 Aug 2021
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Reducing Exposure Bias in Training Recurrent Neural Network Transducers
Xiaodong Cui
Brian Kingsbury
G. Saon
David Haws
Zoltán Tüske
11
5
0
24 Aug 2021
Voice2Series: Reprogramming Acoustic Models for Time Series
  Classification
Voice2Series: Reprogramming Acoustic Models for Time Series Classification
Chao-Han Huck Yang
Yun-Yun Tsai
Pin-Yu Chen
AI4TS
23
122
0
17 Jun 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional
  Networks for Conversational Speech Recognition
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
27
10
0
13 Jun 2021
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of
  Transcribed Audio
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
Guoguo Chen
Shuzhou Chai
Guan-Bo Wang
Jiayu Du
Weiqiang Zhang
...
Xuchen Yao
Yongqing Wang
Yujun Wang
Zhao You
Zhiyong Yan
34
348
0
13 Jun 2021
On the limit of English conversational speech recognition
On the limit of English conversational speech recognition
Zoltán Tüske
G. Saon
Brian Kingsbury
19
50
0
03 May 2021
Dataset Condensation with Gradient Matching
Dataset Condensation with Gradient Matching
Bo-Lu Zhao
Konda Reddy Mopuri
Hakan Bilen
DD
30
472
0
10 Jun 2020
Learning not to Discriminate: Task Agnostic Learning for Improving
  Monolingual and Code-switched Speech Recognition
Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Gurunath Reddy Madhumani
Sanket Shah
Basil Abraham
Vikas Joshi
Sunayana Sitaram
10
7
0
09 Jun 2020
AccentDB: A Database of Non-Native English Accents to Assist Neural
  Speech Recognition
AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition
Afroz Ahamad
Ankit Anand
Pranesh Bhargava
16
22
0
16 May 2020
Large scale weakly and semi-supervised learning for low-resource video
  ASR
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
23
9
0
16 May 2020
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for
  Unsegmented Recordings
CHiME-6 Challenge:Tackling Multispeaker Speech Recognition for Unsegmented Recordings
Shinji Watanabe
Michael I. Mandel
Jon Barker
Emmanuel Vincent
Ashish Arora
...
Emmanuel Vincent
Shota Horiguchi
Naoyuki Kanda
Takuya Yoshioka
Neville Ryant
17
295
0
20 Apr 2020
Improving noise robust automatic speech recognition with single-channel
  time-domain enhancement network
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network
K. Kinoshita
Tsubasa Ochiai
Marc Delcroix
Tomohiro Nakatani
13
97
0
09 Mar 2020
Distributed Training of Deep Neural Network Acoustic Models for
  Automatic Speech Recognition
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Xiaodong Cui
Wei Zhang
Ulrich Finkler
G. Saon
M. Picheny
David S. Kung
22
19
0
24 Feb 2020
Single headed attention based sequence-to-sequence model for
  state-of-the-art results on Switchboard
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard
Zoltán Tüske
G. Saon
Kartik Audhkhasi
Brian Kingsbury
BDL
14
68
0
20 Jan 2020
Domain Expansion in DNN-based Acoustic Models for Robust Speech
  Recognition
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Shahram Ghorbani
S. Khorram
John H. L. Hansen
21
18
0
01 Oct 2019
Weighted delay-and-sum beamforming guided by visual tracking for
  human-robot interaction
Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction
José Novoa
R. Mahú
Alejandro Díaz
J. Wuth
R. Stern
N. B. Yoma
13
8
0
17 Jun 2019
Listening while Speaking and Visualizing: Improving ASR through
  Multimodal Chain
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain
Johanes Effendi
Andros Tjandra
S. Sakti
Satoshi Nakamura
14
3
0
03 Jun 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn
  University Joint Investigation for Dinner Party ASR
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
Naoyuki Kanda
Christoph Boeddeker
Jens Heitkaemper
Yusuke Fujita
Shota Horiguchi
Kenji Nagamatsu
Reinhold Häb-Umbach
13
61
0
29 May 2019
Acoustic-to-Word Models with Conversational Context Information
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
14
7
0
21 May 2019
Encrypted Speech Recognition using Deep Polynomial Networks
Encrypted Speech Recognition using Deep Polynomial Networks
Shi-Xiong Zhang
Y. Gong
Dong Yu
16
25
0
11 May 2019
A Comparison of Online Automatic Speech Recognition Systems and the
  Nonverbal Responses to Unintelligible Speech
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Joshua Y. Kim
Chunfeng Liu
R. Calvo
K. McCabe
Silas C. R. Taylor
Björn W. Schuller
Kaihang Wu
15
38
0
29 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
14
46
0
17 Apr 2019
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial
  and Multi-task Learning in Speech Recognition
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Yossi Adi
Neil Zeghidour
R. Collobert
Nicolas Usunier
Vitaliy Liptchinsky
Gabriel Synnaeve
17
39
0
09 Dec 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
17
96
0
10 Jul 2018
RETURNN as a Generic Flexible Neural Toolkit with Application to
  Translation and Speech Recognition
RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition
Albert Zeyer
Tamer Alkhouli
Hermann Ney
29
90
0
14 May 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset,
  task and baselines
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker
Shinji Watanabe
Emmanuel Vincent
J. Trmal
14
678
0
28 Mar 2018
Sequence-based Multi-lingual Low Resource Speech Recognition
Sequence-based Multi-lingual Low Resource Speech Recognition
Siddharth Dalmia
Ramon Sanabria
Florian Metze
A. Black
18
94
0
21 Feb 2018
The CAPIO 2017 Conversational Speech Recognition System
The CAPIO 2017 Conversational Speech Recognition System
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
15
72
0
29 Dec 2017
Language Modeling with Highway LSTM
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
13
38
0
19 Sep 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
34
229
0
24 Jul 2017
Multi-talker Speech Separation with Utterance-level Permutation
  Invariant Training of Deep Recurrent Neural Networks
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
Morten Kolbaek
Dong Yu
Z. Tan
Jesper Jensen
8
721
0
18 Mar 2017
1