Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.10857
Cited By
v1
v2 (latest)
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
27 July 2018
Shubham Toshniwal
Anjuli Kannan
Chung-Cheng Chiu
Yonghui Wu
Tara N. Sainath
Karen Livescu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition"
50 / 60 papers shown
Title
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
Moreno La Quatra
Valerio Mario Salerno
Yu Tsao
Sabato Marco Siniscalchi
177
2
0
22 Jan 2025
GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems
Amin Robatian
Mohammad Hajipour
Mohammad Reza Peyghan
Fatemeh Rajabi
Sajjad Amini
Shahrokh Ghaemmaghami
Iman Gholampour
110
2
0
18 Jan 2025
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping
Martin Pelikan
Sheikh Shams Azam
Vitaly Feldman
Jan Honza Silovsky
Kunal Talwar
Christopher G. Brinton
Tatiana Likhomanenko
97
8
0
29 Sep 2023
Leveraging Cross-Utterance Context For ASR Decoding
Robert Flynn
Anton Ragni
69
1
0
29 Jun 2023
External Language Model Integration for Factorized Neural Transducers
Michael Levit
S. Parthasarathy
Cem Aksoylar
Mohammad Sadegh Rasooli
Shuangyu Chang
84
2
0
26 May 2023
Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
Nilaksh Das
Monica Sunkara
S. Bodapati
Jason (Jinglun) Cai
Devang Kulshreshtha
Jeffrey J. Farris
Katrin Kirchhoff
55
3
0
05 May 2023
An Overview on Language Models: Recent Developments and Outlook
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
93
47
0
10 Mar 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Minglun Han
Feilong Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
83
13
0
30 Jan 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Yukun Feng
Ming Tu
Rui Xia
Chuanzeng Huang
Yuxuan Wang
RALM
81
0
0
30 Dec 2022
Evince the artifacts of Spoof Speech by blending Vocal Tract and Voice Source Features
T. U. K. Reddy
Sahukari Chaitanya Varun
Kota Pranav Kumar Sankala Sreekanth
K. Murty
117
0
0
05 Dec 2022
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Ao Zhang
F. Yu
Kaixun Huang
Linfu Xie
Longbiao Wang
Eng Siong Chng
Hui Bu
Binbin Zhang
Wei Chen
Xin Xu
79
5
0
03 Nov 2022
Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Rao Ma
Xiaobo Wu
Jin Qiu
Yanan Qin
Haihua Xu
Peihao Wu
Zejun Ma
50
2
0
02 Nov 2022
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition
Suyoun Kim
Ke Li
Lucas Kabela
Rongqing Huang
Jiedan Zhu
Ozlem Kalinli
Duc Le
75
8
0
31 Oct 2022
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
VLM
86
10
0
27 Oct 2022
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models
Vilém Zouhar
Marius Mosbach
Dietrich Klakow
57
1
0
04 Aug 2022
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR
Yufei Liu
Rao Ma
Haihua Xu
Yi He
Zejun Ma
Weibin Zhang
63
12
0
26 Jan 2022
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
64
11
0
06 Jan 2022
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
Chia-Yu Li
Ngoc Thang Vu
67
12
0
19 Dec 2021
Context-Aware Transformer Transducer for Speech Recognition
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
66
85
0
05 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
163
377
0
02 Nov 2021
Automatic Learning of Subword Dependent Model Scales
Felix Meyer
Wilfried Michel
Mohammad Zeineldeen
Ralf Schluter
Hermann Ney
30
0
0
18 Oct 2021
Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Timo Lohrenz
P. Schwarz
Zhengyang Li
Tim Fingscheidt
48
11
0
02 Jul 2021
On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Katrin Tomanek
Franccoise Beaufays
Julie Cattiau
Angad Chandorkar
K. Sim
82
15
0
18 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
126
769
0
08 Jun 2021
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Wenjie Huang
Tara N. Sainath
Cal Peyser
Shankar Kumar
David Rybach
Trevor Strohman
RALM
LMTD
82
6
0
09 Apr 2021
Language model fusion for streaming end to end speech recognition
Rodrigo Cabrera
Xiaofeng Liu
M. Ghodsi
Zebulun Matteson
Eugene Weinstein
Anjuli Kannan
MoMe
AI4TS
68
14
0
09 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
62
18
0
08 Apr 2021
LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Anton Mitrofanov
Mariya Korenevskaya
Ivan Podluzhny
Yuri Y. Khokhlov
A. Laptev
A. Andrusenko
A. Ilin
M. Korenevsky
Ivan Medennikov
A. Romanenko
KELM
LRM
26
2
0
06 Apr 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Duc Le
Mahaveer Jain
Gil Keren
Suyoun Kim
Yangyang Shi
...
Yuan Shangguan
Christian Fuegen
Ozlem Kalinli
Yatharth Saraf
M. Seltzer
87
102
0
05 Apr 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation
Md. Akmal Haidar
Chao Xing
Mehdi Rezagholizadeh
118
6
0
17 Mar 2021
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial Networks
Md. Akmal Haidar
Mehdi Rezagholizadeh
111
9
0
10 Mar 2021
Brain Signals to Rescue Aphasia, Apraxia and Dysarthria Speech Recognition
G. Krishna
Mason Carnahan
Shilpa Shamapant
Yashitha Surendranath
Saumya Jain
Arundhati Ghosh
Co Tran
José del R. Millán
Ahmed H. Tewfik
11
3
0
28 Feb 2021
Personalization Strategies for End-to-End Speech Recognition Systems
Aditya Gourav
Linda Liu
Ankur Gandhe
Yile Gu
Guitang Lan
...
Gautam Tiwari
Denis Filimonov
Ariya Rastrow
A. Stolcke
I. Bulyko
54
39
0
15 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation
G. Sun
Chuxu Zhang
P. Woodland
111
32
0
12 Feb 2021
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition
Cheng Yi
Shiyu Zhou
Bo Xu
108
40
0
17 Jan 2021
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems
Xianrui Zheng
Yulan Liu
Deniz Gunceler
D. Willett
129
79
0
23 Nov 2020
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
S. Parthasarathy
Eric Sun
Yashesh Gaur
Naoyuki Kanda
Liang Lu
Xie Chen
Rui Zhao
Jinyu Li
Jiawei Liu
AuLLM
80
110
0
03 Nov 2020
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Suyoun Kim
Shangguan Yuan
Jay Mahadeokar
A. Bruguier
Christian Fuegen
M. Seltzer
Duc Le
66
29
0
26 Oct 2020
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Cal Peyser
S. Mavandadi
Tara N. Sainath
J. Apfel
Ruoming Pang
Shankar Kumar
84
46
0
24 Aug 2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami
Hirofumi Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
78
53
0
09 Aug 2020
Early Stage LM Integration Using Local and Global Log-Linear Combination
Wilfried Michel
Ralf Schluter
Hermann Ney
55
11
0
20 May 2020
Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
Cal Peyser
Tara N. Sainath
Golan Pundak
62
14
0
19 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
85
40
0
16 May 2020
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Hirofumi Inaguma
Yashesh Gaur
Liang Lu
Jinyu Li
Jiawei Liu
AI4TS
73
46
0
10 Apr 2020
A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition
Erik McDermott
Hasim Sak
Ehsan Variani
72
113
0
26 Feb 2020
Continuous Silent Speech Recognition using EEG
G. Krishna
Co Tran
Mason Carnahan
Ahmed H. Tewfik
26
4
0
06 Feb 2020
end-to-end training of a large vocabulary end-to-end speech recognition system
Chanwoo Kim
Sungsoo Kim
Kwangyoun Kim
Mehul Kumar
Jiyeon Kim
...
Eunhyang Kim
Minkyoo Shin
Shatrughan Singh
Larry Heck
Dhananjaya N. Gowda
52
27
0
22 Dec 2019
Semantic Mask for Transformer based End-to-End Speech Recognition
Chengyi Wang
Yu Wu
Yujiao Du
Jinyu Li
Shujie Liu
Liang Lu
Shuo Ren
Guoli Ye
Sheng Zhao
Ming Zhou
70
52
0
06 Dec 2019
Improving EEG based Continuous Speech Recognition
G. Krishna
Co Tran
Mason Carnahan
Yan Han
Ahmed H. Tewfik
23
13
0
24 Nov 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
96
73
0
18 Sep 2019
1
2
Next