ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10697
  4. Cited By
Correction of Automatic Speech Recognition with Transformer
  Sequence-to-sequence Model

Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model

23 October 2019
Oleksii Hrinchuk
Mariya Popova
Boris Ginsburg
    VLM
ArXivPDFHTML

Papers citing "Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model"

49 / 49 papers shown
Title
GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems
GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems
Amin Robatian
Mohammad Hajipour
Mohammad Reza Peyghan
Fatemeh Rajabi
Sajjad Amini
Shahrokh Ghaemmaghami
Iman Gholampour
41
0
0
18 Jan 2025
ASR Error Correction using Large Language Models
ASR Error Correction using Large Language Models
Rao Ma
Mengjie Qian
Mark J. F. Gales
Kate Knill
KELM
46
1
0
14 Sep 2024
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking
Jihyun Lee
Solee Im
Wonjun Lee
Gary Geunbae Lee
33
0
0
10 Sep 2024
Towards interfacing large language models with ASR systems using
  confidence measures and prompting
Towards interfacing large language models with ASR systems using confidence measures and prompting
Maryam Naderi
Xingrui Yang
Weihan Wang
Sevada Hovsepyan
Weichen Dai
KELM
29
1
0
31 Jul 2024
Robust ASR Error Correction with Conservative Data Filtering
Robust ASR Error Correction with Conservative Data Filtering
Takuma Udagawa
Masayuki Suzuki
Masayasu Muraoka
Gakuto Kurata
51
0
0
18 Jul 2024
Transformer-based Model for ASR N-Best Rescoring and Rewriting
Transformer-based Model for ASR N-Best Rescoring and Rewriting
Iwen E. Kang
Christophe Van Gysel
Man-Hung Siu
34
2
0
12 Jun 2024
Denoising LM: Pushing the Limits of Error Correction Models for Speech
  Recognition
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
AuLLM
48
2
0
24 May 2024
Semantically Corrected Amharic Automatic Speech Recognition
Semantically Corrected Amharic Automatic Speech Recognition
Samuael Adnew
Paul Pu Liang
28
0
0
20 Apr 2024
Automatic Speech Recognition using Advanced Deep Learning Approaches: A
  survey
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey
Hamza Kheddar
Mustapha Hemis
Yassine Himeur
OffRL
38
59
0
02 Mar 2024
Toward Practical Automatic Speech Recognition and Post-Processing: a
  Call for Explainable Error Benchmark Guideline
Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline
Seonmin Koo
Chanjun Park
Jinsung Kim
Jaehyung Seo
Sugyeong Eo
Hyeonseok Moon
Heu-Jeoung Lim
33
4
0
26 Jan 2024
Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition
  and Phoneme to Grapheme Translation
Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation
Wonjun Lee
Gary Geunbae Lee
Yunsu Kim
31
0
0
06 Dec 2023
Optimized Tokenization for Transcribed Error Correction
Optimized Tokenization for Transcribed Error Correction
Tomer Wullach
Shlomo E. Chazan
24
0
0
16 Oct 2023
DiaCorrect: Error Correction Back-end For Speaker Diarization
DiaCorrect: Error Correction Back-end For Speaker Diarization
Jiangyu Han
Federico Landini
Johan Rohdin
Mireia Díez
Lukás Burget
Yuhang Cao
Heng Lu
J. Černocký
39
3
0
15 Sep 2023
Boosting Chinese ASR Error Correction with Dynamic Error Scaling
  Mechanism
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
Jiaxin Fan
Yong Zhang
Hanzhang Li
Jianzong Wang
Zhitao Li
Ouyang Sheng
Ning Cheng
Jing Xiao
14
0
0
07 Aug 2023
Enhancing conversational quality in language learning chatbots: An
  evaluation of GPT4 for ASR error correction
Enhancing conversational quality in language learning chatbots: An evaluation of GPT4 for ASR error correction
Long Mai
Julie Carson-Berndsen
11
4
0
19 Jul 2023
Can Generative Large Language Models Perform ASR Error Correction?
Can Generative Large Language Models Perform ASR Error Correction?
Rao Ma
Mengjie Qian
Potsawee Manakul
Mark J. F. Gales
Kate Knill
AuLLM
KELM
19
49
0
09 Jul 2023
SpellMapper: A non-autoregressive neural spellchecker for ASR
  customization with candidate retrieval based on n-gram mappings
SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings
Alexandra Antonova
Evelina Bakhturina
Boris Ginsburg
KELM
12
6
0
04 Jun 2023
Adapting an Unadaptable ASR System
Adapting an Unadaptable ASR System
Rao Ma
Mengjie Qian
Mark J. F. Gales
Kate Knill
28
3
0
01 Jun 2023
A Survey of Safety and Trustworthiness of Large Language Models through
  the Lens of Verification and Validation
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
39
82
0
19 May 2023
OLISIA: a Cascade System for Spoken Dialogue State Tracking
OLISIA: a Cascade System for Spoken Dialogue State Tracking
Léo Jacqmin
Lucas Druart
Yannick Esteve
Benoit Favre
L. Rojas-Barahona
Valentin Vielzeuf
20
3
0
20 Apr 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Junaid Qadir
42
47
0
21 Mar 2023
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses
  and Constrained Decoding Space
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space
Rao Ma
Mark J. F. Gales
Kate Knill
Mengjie Qian
11
32
0
01 Mar 2023
Partitioned Gradient Matching-based Data Subset Selection for
  Compute-Efficient Robust ASR Training
Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training
Ashish R. Mittal
D. Sivasubramanian
Rishabh K. Iyer
P. Jyothi
Ganesh Ramakrishnan
19
3
0
30 Oct 2022
Unsupervised domain adaptation for speech recognition with unsupervised
  error correction
Unsupervised domain adaptation for speech recognition with unsupervised error correction
Long Mai
Julie Carson-Berndsen
30
8
0
24 Sep 2022
Non-autoregressive Error Correction for CTC-based ASR with
  Phone-conditioned Masked LM
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Hayato Futami
H. Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
KELM
45
12
0
08 Sep 2022
Improving Deliberation by Text-Only and Semi-Supervised Training
Improving Deliberation by Text-Only and Semi-Supervised Training
Ke Hu
Tara N. Sainath
Yanzhang He
Rohit Prabhavalkar
Trevor Strohman
S. Mavandadi
Weiran Wang
26
12
0
29 Jun 2022
On Comparison of Encoders for Attention based End to End Speech
  Recognition in Standalone and Rescoring Mode
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Raviraj Joshi
Subodh Kumar
28
2
0
26 Jun 2022
Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture
Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture
Karan Singla
S. Jalalvand
Yeon-Jun Kim
Ryan Price
Daniel Pressel
S. Bangalore
10
2
0
29 Mar 2022
TIGGER: Scalable Generative Modelling for Temporal Interaction Graphs
TIGGER: Scalable Generative Modelling for Temporal Interaction Graphs
Shubham Gupta
S. Manchanda
Srikanta J. Bedathur
Sayan Ranu
19
18
0
07 Mar 2022
Towards Contextual Spelling Correction for Customization of End-to-end
  Speech Recognition Systems
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Xiaoqiang Wang
Yanqing Liu
Jinyu Li
Veljko Miljanic
Sheng Zhao
H. Khalil
KELM
11
18
0
02 Mar 2022
Romanian Speech Recognition Experiments from the ROBIN Project
Romanian Speech Recognition Experiments from the ROBIN Project
Andrei-Marius Avram
Vasile Puaics
Dan Tufics
11
4
0
23 Nov 2021
Oracle Teacher: Leveraging Target Information for Better Knowledge
  Distillation of CTC Models
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models
J. Yoon
H. Kim
Hyeon Seung Lee
Sunghwan Ahn
N. Kim
28
1
0
05 Nov 2021
Remember the context! ASR slot error correction through memorization
Remember the context! ASR slot error correction through memorization
Dhanush Bekal
Ashish Shenoy
Monica Sunkara
S. Bodapati
Katrin Kirchhoff
KELM
23
12
0
10 Sep 2021
Improving Distinction between ASR Errors and Speech Disfluencies with
  Feature Space Interpolation
Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Seongmin Park
D. Shin
Sangyoun Paik
Subong Choi
Alena Kazakova
Jihwa Lee
25
1
0
04 Aug 2021
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks
  using Switching Tokens
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens
Mana Ihori
Naoki Makishima
Tomohiro Tanaka
Akihiko Takashima
Shota Orihashi
Ryo Masumura
9
3
0
23 Jun 2021
Mondegreen: A Post-Processing Solution to Speech Recognition Error
  Correction for Voice Search Queries
Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Sukhdeep S. Sodhi
E. Chio
Ambarish Jash
Santiago Ontañón
Ajit Apte
...
Tameen Khan
Amol Wankhede
M. Alzantot
Allen Wu
Tushar Chandra
17
9
0
20 May 2021
SPGISpeech: 5,000 hours of transcribed financial audio for fully
  formatted end-to-end speech recognition
SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Patrick K. O’Neill
Vitaly Lavrukhin
Somshubra Majumdar
Vahid Noroozi
Yuekai Zhang
...
Keenan Freyberg
Michael D. Shulman
Boris Ginsburg
Shinji Watanabe
Georg Kucsko
AI4TS
18
59
0
05 Apr 2021
BART based semantic correction for Mandarin automatic speech recognition
  system
BART based semantic correction for Mandarin automatic speech recognition system
Yun Zhao
Xuerui Yang
Jinchao Wang
Yongyu Gao
Chao Yan
Yuanfu Zhou
VLM
11
28
0
26 Mar 2021
Generating Human Readable Transcript for Automatic Speech Recognition
  with Pre-trained Language Model
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Junwei Liao
Yu Shi
Ming Gong
Linjun Shou
Sefik Emre Eskimez
Liyang Lu
Hong Qu
Michael Zeng
17
9
0
22 Feb 2021
Neural Inverse Text Normalization
Neural Inverse Text Normalization
Monica Sunkara
Chaitanya P. Shivade
S. Bodapati
Katrin Kirchhoff
41
31
0
12 Feb 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
11
37
0
27 Jan 2021
Warped Language Models for Noise Robust Language Understanding
Warped Language Models for Noise Robust Language Understanding
Mahdi Namazifar
Gökhan Tür
Dilek Z. Hakkani-Tür
9
7
0
03 Nov 2020
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
Anuj Diwan
P. Jyothi
6
5
0
19 Oct 2020
Large-scale Transfer Learning for Low-resource Spoken Language
  Understanding
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
X. Jia
Jianzong Wang
Zhiyong Zhang
Ning Cheng
Jing Xiao
11
17
0
13 Aug 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with
  Customization Capability
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Jinyu Li
Rui Zhao
Zhong Meng
Yanqing Liu
Wenning Wei
...
V. Mazalov
Zhenghao Wang
Lei He
Sheng Zhao
Jiawei Liu
18
107
0
30 Jul 2020
Improving Readability for Automatic Speech Recognition Transcription
Improving Readability for Automatic Speech Recognition Transcription
Junwei Liao
Sefik Emre Eskimez
Liyang Lu
Yu Shi
Ming Gong
Linjun Shou
Hong Qu
Michael Zeng
27
55
0
09 Apr 2020
Stacked DeBERT: All Attention in Incomplete Data for Text Classification
Stacked DeBERT: All Attention in Incomplete Data for Text Classification
Gwenaelle Cunha Sergio
Minho Lee
19
30
0
01 Jan 2020
NeMo: a toolkit for building AI applications using Neural Modules
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
202
292
0
14 Sep 2019
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,743
0
26 Sep 2016
1