Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.03250
Cited By
Context-Aware Transformer Transducer for Speech Recognition
5 November 2021
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Context-Aware Transformer Transducer for Speech Recognition"
49 / 49 papers shown
Title
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
46
0
0
01 Nov 2024
Automatic Speech Recognition with BERT and CTC Transformers: A Review
Noussaiba Djeffal
Hamza Kheddar
Djamel Addou
A. Mazari
Yassine Himeur
37
16
0
12 Oct 2024
Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
L. Velikovich
Christopher Li
D. Caseiro
Shankar Kumar
Pat Rondon
Kandarp Joshi
Xavier Velez
KELM
29
0
0
24 Sep 2024
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems
Hitesh Tulsiani
David M. Chan
Shalini Ghosh
Garima Lalwani
Prabhat Pandey
Ankish Bansal
Sri Garimella
Ariya Rastrow
Björn Hoffmeister
31
0
0
16 Sep 2024
Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach
Siqi Li
Danni Liu
Jan Niehues
28
0
0
13 Sep 2024
Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
Mingyu Cui
Yifan Yang
Jiajun Deng
Jiawen Kang
Shujie Hu
Tianzi Wang
Zhaoqing Li
Shiliang Zhang
Xie Chen
Xunying Liu
33
1
0
13 Sep 2024
Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Yizhou Tan
Yanru Wu
Yuanbo Hou
Xin Xu
Hui Bu
Shengchen Li
Dick Botteldooren
Mark D. Plumbley
38
0
0
10 Sep 2024
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation
Ruizhe Huang
M. Yarmohammadi
Sanjeev Khudanpur
Dan Povey
35
2
0
14 Jul 2024
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss
Muhammad Shakeel
Yui Sudo
Yifan Peng
Shinji Watanabe
AI4CE
31
2
0
23 Jun 2024
Multi-Modal Retrieval For Large Language Model Based Speech Recognition
J. Kolehmainen
Aditya Gourav
Prashanth Gurunath Shivakumar
Yile Gu
Ankur Gandhe
Ariya Rastrow
Grant P. Strimel
I. Bulyko
40
4
0
13 Jun 2024
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
Guanrou Yang
Ziyang Ma
Fan Yu
Zhifu Gao
Shiliang Zhang
Xie Chen
AuLLM
38
2
0
09 Jun 2024
Text Injection for Neural Contextual Biasing
Zhong Meng
Zelin Wu
Rohit Prabhavalkar
Cal Peyser
Weiran Wang
Nanxin Chen
Tara N. Sainath
Bhuvana Ramabhadran
28
3
0
05 Jun 2024
Keyword-Guided Adaptation of Automatic Speech Recognition
Aviv Shamsian
Aviv Navon
Neta Glazer
Gill Hetz
Joseph Keshet
33
1
0
04 Jun 2024
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR
Zelin Wu
Gan Song
Christopher Li
Pat Rondon
Zhong Meng
...
D. Caseiro
Golan Pundak
Tsendsuren Munkhdalai
Angad Chandorkar
Rohit Prabhavalkar
18
3
0
15 Apr 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Hainan Xu
Zhehuai Chen
Fei Jia
Boris Ginsburg
35
0
0
04 Apr 2024
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Yi-Cheng Wang
Hsin-Wei Wang
Bi-Cheng Yan
Chi-Han Lin
Berlin Chen
29
1
0
26 Mar 2024
Self-consistent context aware conformer transducer for speech recognition
Konstantin Kolokolov
Pavel Pekichev
Karthik Raghunathan
22
0
0
09 Feb 2024
Locality enhanced dynamic biasing and sampling strategies for contextual ASR
Md. Asif Jalal
Pablo Peso Parada
George Pavlidis
Vasileios Moschopoulos
Karthikeyan P. Saravanan
...
Jisi Zhang
Anastasios Drosou
Gil Ho Lee
Jungin Lee
Seokyeong Jung
26
2
0
23 Jan 2024
Improving ASR Contextual Biasing with Guided Attention
Jiyang Tang
Kwangyoun Kim
Suwon Shon
Felix Wu
Prashant Sridhar
Shinji Watanabe
23
8
0
16 Jan 2024
Promptformer: Prompted Conformer Transducer for ASR
Sergio Duarte Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
Leif Rädel
Volker Leutnant
34
3
0
14 Jan 2024
LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition
Fan Yu
Haoxu Wang
Xian Shi
Shiliang Zhang
21
3
0
12 Jan 2024
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings
Christopher Li
Gary Wang
Kyle Kastner
Heng Su
Allen Chen
...
Zelin Wu
L. Velikovich
Pat Rondon
D. Caseiro
Petar S. Aleksic
34
1
0
08 Jan 2024
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition
David M. Chan
Shalini Ghosh
Hitesh Tulsiani
Ariya Rastrow
Björn Hoffmeister
28
1
0
04 Jan 2024
Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer
Jin Qiu
Lu Huang
Boyu Li
Jun Zhang
Lu Lu
Zejun Ma
21
3
0
15 Nov 2023
Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Ankitha Sudarshan
Vinay Samuel
Parth Patwa
Ibtihel Amara
Aman Chadha
21
2
0
14 Oct 2023
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition
Kaixun Huang
Aoting Zhang
Binbin Zhang
Tianyi Xu
Xingchen Song
Lei Xie
16
3
0
07 Oct 2023
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Weiran Wang
Zelin Wu
D. Caseiro
Tsendsuren Munkhdalai
K. Sim
...
Rohit Prabhavalkar
Zhong Meng
Ding Zhao
Tara N. Sainath
P. M. Mengibar
48
5
0
29 Sep 2023
Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization
Alexandra Antonova
38
0
0
29 Sep 2023
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting
Yuang Li
Min Zhang
Chang Su
Yinglu Li
Xiaosong Qiao
Mengxin Ren
Miaomiao Ma
Daimeng Wei
Shimin Tao
Hao Yang
24
5
0
18 Sep 2023
Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer
Peng Wang
Yifan Yang
Zheng Liang
Tian Tan
Shiliang Zhang
Xie Chen
15
0
0
14 Sep 2023
SlideSpeech: A Large-Scale Slide-Enriched Audio-Visual Corpus
Haoxu Wang
Fan Yu
Xian Shi
Yuezhang Wang
Shiliang Zhang
Ming Li
29
11
0
11 Sep 2023
Contextual Biasing of Named-Entities with Large Language Models
Chuanneng Sun
Zeeshan Ahmed
Yingyi Ma
Zhe Liu
Lucas Kabela
Yutong Pang
Ozlem Kalinli
KELM
20
7
0
01 Sep 2023
Personalization for BERT-based Discriminative Speech Recognition Rescoring
J. Kolehmainen
Yile Gu
Aditya Gourav
Prashanth Gurunath Shivakumar
Ankur Gandhe
Ariya Rastrow
I. Bulyko
32
2
0
13 Jul 2023
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
Mingyu Cui
Jiawen Kang
Jiajun Deng
Xiaoyue Yin
Yutao Xie
Xie Chen
Xunying Liu
29
8
0
23 Jun 2023
Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition
Tianyi Xu
Zhanheng Yang
Kaixun Huang
Pengcheng Guo
Aoting Zhang
Biao Li
Changru Chen
Chong Li
Linfu Xie
22
10
0
01 Jun 2023
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network
Kaixun Huang
Aoting Zhang
Zhanheng Yang
Pengcheng Guo
Bingshen Mu
Tianyi Xu
Linfu Xie
29
16
0
21 May 2023
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
Xuandi Fu
Kanthashree Mysore Sathyendra
Ankur Gandhe
Jing Liu
Grant P. Strimel
Ross McGowan
Athanasios Mouchtaris
25
14
0
09 May 2023
Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Maurits J. R. Bleeker
P. Swietojanski
Stefan Braun
Xiaodan Zhuang
39
8
0
18 Apr 2023
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition
Saumya Yashmohini Sahai
Jing Liu
Thejaswi Muniyappa
Kanthashree Mysore Sathyendra
Anastasios Alexandridis
...
Ross McGowan
Ariya Rastrow
Feng-Ju Chang
Athanasios Mouchtaris
Siegfried Kunzmann
36
5
0
03 Apr 2023
Dialog act guided contextual adapter for personalized speech recognition
Feng-Ju Chang
Thejaswi Muniyappa
Kanthashree Mysore Sathyendra
Kailin Wei
Grant P. Strimel
Ross McGowan
16
4
0
31 Mar 2023
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
R. Pandey
Roger Ren
Qi Luo
Jing Liu
Ariya Rastrow
Ankur Gandhe
Denis Filimonov
Grant P. Strimel
A. Stolcke
I. Bulyko
29
13
0
30 Mar 2023
On-the-fly Text Retrieval for End-to-End ASR Adaptation
Bolaji Yusuf
Aditya Gourav
Ankur Gandhe
I. Bulyko
KELM
RALM
37
4
0
20 Mar 2023
Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer
Zhanheng Yang
Sining Sun
Xiong Wang
Yike Zhang
Long Ma
Linfu Xie
20
9
0
17 Jan 2023
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition
David M. Chan
Shalini Ghosh
Ariya Rastrow
Björn Hoffmeister
OffRL
18
6
0
06 Jan 2023
Contextual-Utterance Training for Automatic Speech Recognition
Alejandro Gomez-Alanis
Lukas Drude
A. Schwarz
R. Swaminathan
Simon Wiesler
26
1
0
27 Oct 2022
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?
Pradip Pramanick
Chayan Sarkar
21
7
0
21 Oct 2022
ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Martin H. Radfar
Rohit Barnwal
R. Swaminathan
Feng-Ju Chang
Grant P. Strimel
Nathan Susanj
Athanasios Mouchtaris
31
13
0
29 Sep 2022
Contextual Adapters for Personalized Speech Recognition in Neural Transducers
Kanthashree Mysore Sathyendra
Thejaswi Muniyappa
Feng-Ju Chang
Jing Liu
Jinru Su
Grant P. Strimel
Athanasios Mouchtaris
Siegfried Kunzmann
19
75
0
26 May 2022
Domain-aware Neural Language Models for Speech Recognition
Linda Liu
Yile Gu
Aditya Gourav
Ankur Gandhe
Shashank Kalmane
Denis Filimonov
Ariya Rastrow
I. Bulyko
33
21
0
05 Jan 2021
1