ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.01211
  4. Cited By
Listen, Attend and Spell

Listen, Attend and Spell

5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
    RALM
ArXivPDFHTML

Papers citing "Listen, Attend and Spell"

50 / 1,033 papers shown
Title
End-to-end contextual asr based on posterior distribution adaptation for
  hybrid ctc/attention system
End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Zheng-Wei Zhang
Pan Zhou
47
6
0
18 Feb 2022
Knowledge Transfer from Large-scale Pretrained Language Models to
  End-to-end Speech Recognizers
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers
Yotaro Kubo
Shigeki Karita
M. Bacchiani
6
26
0
16 Feb 2022
Conversational Speech Recognition By Learning Conversation-level
  Characteristics
Conversational Speech Recognition By Learning Conversation-level Characteristics
Kun Wei
Yike Zhang
Sining Sun
Lei Xie
Long Ma
43
7
0
16 Feb 2022
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
USTED: Improving ASR with a Unified Speech and Text Encoder-Decoder
Bolaji Yusuf
Ankur Gandhe
Alex Sokolov
40
8
0
12 Feb 2022
Improving Automatic Speech Recognition for Non-Native English with
  Transfer Learning and Language Model Decoding
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Peter Sullivan
Toshiko Shibano
Muhammad Abdul-Mageed
41
11
0
10 Feb 2022
ASRPU: A Programmable Accelerator for Low-Power Automatic Speech
  Recognition
ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
D. Pinto
J. Arnau
Antonio González
33
0
0
10 Feb 2022
Semantic-aware Speech to Text Transmission with Redundancy Removal
Semantic-aware Speech to Text Transmission with Redundancy Removal
Tian Han
Qianqian Yang
Zhiguo Shi
Shibo He
Zhaoyang Zhang
20
16
0
07 Feb 2022
Joint Speech Recognition and Audio Captioning
Joint Speech Recognition and Audio Captioning
Chaitanya Narisetty
E. Tsunoo
Xuankai Chang
Yosuke Kashiwagi
Michael Hentschel
Shinji Watanabe
19
10
0
03 Feb 2022
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Liyan Xu
Yile Gu
J. Kolehmainen
Haidar Khan
Ankur Gandhe
Ariya Rastrow
A. Stolcke
I. Bulyko
42
45
0
02 Feb 2022
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian
P. Mihajlik
A. Balog
T. E. Gráczi
A. Kohári
Balázs Tarján
K. Mády
25
8
0
01 Feb 2022
Transformer-based Models of Text Normalization for Speech Applications
Transformer-based Models of Text Normalization for Speech Applications
Jae Hun Ro
Felix Stahlberg
Ke Wu
Shankar Kumar
14
7
0
01 Feb 2022
Improving End-to-End Contextual Speech Recognition with Fine-Grained
  Contextual Knowledge Selection
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
Minglun Han
Linhao Dong
Zhenlin Liang
Meng Cai
Shiyu Zhou
Zejun Ma
Bo Xu
24
45
0
30 Jan 2022
Reducing language context confusion for end-to-end code-switching
  automatic speech recognition
Reducing language context confusion for end-to-end code-switching automatic speech recognition
Shuai Zhang
Jiangyan Yi
Zhengkun Tian
J. Tao
Y. Yeung
Liqun Deng
27
11
0
28 Jan 2022
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End
  Mandarin Chinese ASR
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR
Zhao Yang
Dianwen Ng
Xiao Fu
Liping Han
Wei Xi
Ruimeng Wang
Rui Jiang
Jizhong Zhao
40
2
0
26 Jan 2022
Improving the fusion of acoustic and text representations in RNN-T
Improving the fusion of acoustic and text representations in RNN-T
Chao Zhang
Bo-wen Li
Zhiyun Lu
Tara N. Sainath
Shuo-yiin Chang
AI4CE
43
12
0
25 Jan 2022
Run-and-back stitch search: novel block synchronous decoding for
  streaming encoder-decoder ASR
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
14
2
0
25 Jan 2022
Recent Progress in the CUHK Dysarthric Speech Recognition System
Recent Progress in the CUHK Dysarthric Speech Recognition System
Shansong Liu
Mengzhe Geng
Shoukang Hu
Xurong Xie
Mingyu Cui
Jianwei Yu
Xunying Liu
Helen Meng
16
58
0
15 Jan 2022
Spectro-Temporal Deep Features for Disordered Speech Assessment and
  Recognition
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Mengzhe Geng
Shansong Liu
Jianwei Yu
Xurong Xie
Shoukang Hu
Zi Ye
Zengrui Jin
Xunying Liu
Helen Meng
34
21
0
14 Jan 2022
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Shou-Yong Hu
Xurong Xie
Mingyu Cui
Jiajun Deng
Shansong Liu
Jianwei Yu
Mengzhe Geng
Xunying Liu
Helen Meng
44
26
0
08 Jan 2022
Two-Pass End-to-End ASR Model Compression
Two-Pass End-to-End ASR Model Compression
Nauman Dawalatabad
Tushar Vatsal
Ashutosh Gupta
Sungsoo Kim
Shatrughan Singh
Dhananjaya N. Gowda
Chanwoo Kim
24
5
0
08 Jan 2022
Sign Language Video Retrieval with Free-Form Textual Queries
Sign Language Video Retrieval with Free-Form Textual Queries
A. Duarte
Samuel Albanie
Xavier Giró-i-Nieto
Gül Varol
SLR
50
29
0
07 Jan 2022
Improving Mandarin End-to-End Speech Recognition with Word N-gram
  Language Model
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
31
10
0
06 Jan 2022
Discrete and continuous representations and processing in deep learning:
  Looking forward
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
33
20
0
04 Jan 2022
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural
  Language Question
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Yuanfeng Song
Raymond Chi-Wing Wong
Xuefang Zhao
Di Jiang
39
13
0
04 Jan 2022
Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Voice Quality and Pitch Features in Transformer-Based Speech Recognition
Guillermo Cámbara
Jordi Luque
Mireia Farrús
24
0
0
21 Dec 2021
Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated
  Label Mixing
Saliency Grafting: Innocuous Attribution-Guided Mixup with Calibrated Label Mixing
Joonhyung Park
J. Yang
Jinwoo Shin
Sung Ju Hwang
Eunho Yang
33
23
0
16 Dec 2021
Prompt Tuning GPT-2 language model for parameter-efficient domain
  adaptation of ASR systems
Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems
Saket Dingliwal
Ashish Shenoy
S. Bodapati
Ankur Gandhe
R. Gadde
Katrin Kirchhoff
VLM
33
4
0
16 Dec 2021
Improving Hybrid CTC/Attention End-to-end Speech Recognition with
  Pretrained Acoustic and Language Model
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
22
31
0
14 Dec 2021
PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit
  Training for Phonetic-Reduction-Robust E2E Speech Recognition
PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition
Guodong Ma
Pengfei Hu
Nurmemet Yolwas
Shen Huang
Hao-Ming Huang
27
4
0
13 Dec 2021
Consistent Training and Decoding For End-to-end Speech Recognition Using
  Lattice-free MMI
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Jinchuan Tian
Jianwei Yu
Chao Weng
Shi-Xiong Zhang
Dan Su
Dong Yu
Yuexian Zou
AuLLM
45
13
0
05 Dec 2021
Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding
Deliberation of Streaming RNN-Transducer by Non-autoregressive Decoding
Weiran Wang
Ke Hu
Tara N. Sainath
35
21
0
01 Dec 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models
  for Speech Recognition
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
27
13
0
29 Nov 2021
Lattention: Lattice-attention in ASR rescoring
Lattention: Lattice-attention in ASR rescoring
Prabhat Pandey
Sergio Duarte Torres
Ali Orkan Bayer
Ankur Gandhe
Volker Leutnant
18
7
0
19 Nov 2021
A comparison of streaming models and data augmentation methods for
  robust speech recognition
A comparison of streaming models and data augmentation methods for robust speech recognition
Jiyeon Kim
Mehul Kumar
Dhananjaya N. Gowda
Abhinav Garg
Chanwoo Kim
31
5
0
19 Nov 2021
Integrated Semantic and Phonetic Post-correction for Chinese Speech
  Recognition
Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition
Yi-Chang Chen
Chun-Yen Cheng
Chien-An Chen
Ming-Chieh Sung
Yi-Ren Yeh
17
6
0
16 Nov 2021
Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer
  in ASR
Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Ondˇrej Klejch
E. Wallington
P. Bell
11
12
0
12 Nov 2021
Enhancing Backdoor Attacks with Multi-Level MMD Regularization
Enhancing Backdoor Attacks with Multi-Level MMD Regularization
Pengfei Xia
Hongjing Niu
Ziqiang Li
Bin Li
AAML
46
29
0
09 Nov 2021
Conformer-based Hybrid ASR System for Switchboard Dataset
Conformer-based Hybrid ASR System for Switchboard Dataset
Mohammad Zeineldeen
Jingjing Xu
Christoph Luscher
Wilfried Michel
Alexander Gerstenberger
Ralf Schluter
Hermann Ney
22
24
0
05 Nov 2021
Context-Aware Transformer Transducer for Speech Recognition
Context-Aware Transformer Transducer for Speech Recognition
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
21
79
0
05 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
35
363
0
02 Nov 2021
With a Little Help from my Temporal Context: Multimodal Egocentric
  Action Recognition
With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition
Evangelos Kazakos
Jaesung Huh
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
50
45
0
01 Nov 2021
Revealing and Protecting Labels in Distributed Training
Revealing and Protecting Labels in Distributed Training
Trung D. Q. Dang
Om Thakkar
Swaroop Indra Ramaswamy
Rajiv Mathews
Peter Chin
Franccoise Beaufays
12
25
0
31 Oct 2021
Pseudo-Labeling for Massively Multilingual Speech Recognition
Pseudo-Labeling for Massively Multilingual Speech Recognition
Loren Lugosch
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
VLM
13
29
0
30 Oct 2021
Cross-attention conformer for context modeling in speech enhancement for
  ASR
Cross-attention conformer for context modeling in speech enhancement for ASR
A. Narayanan
Chung-Cheng Chiu
Tom O'Malley
Quan Wang
Yanzhang He
24
14
0
30 Oct 2021
An Investigation of Enhancing CTC Model for Triggered Attention-based
  Streaming ASR
An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR
Huaibo Zhao
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
17
4
0
20 Oct 2021
Automatic Learning of Subword Dependent Model Scales
Automatic Learning of Subword Dependent Model Scales
Felix Meyer
Wilfried Michel
Mohammad Zeineldeen
Ralf Schluter
Hermann Ney
19
0
0
18 Oct 2021
Sub-word Level Lip Reading With Visual Attention
Sub-word Level Lip Reading With Visual Attention
Prajwal K R
Triantafyllos Afouras
Andrew Zisserman
17
92
0
14 Oct 2021
On Language Model Integration for RNN Transducer based Speech
  Recognition
On Language Model Integration for RNN Transducer based Speech Recognition
Wei Zhou
Zuoyun Zheng
Ralf Schluter
Hermann Ney
37
22
0
13 Oct 2021
Reason induced visual attention for explainable autonomous driving
Reason induced visual attention for explainable autonomous driving
Sikai Chen
Jiqian Dong
Runjia Du
Yujie Li
S. Labi
34
1
0
11 Oct 2021
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text
  Generation
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Yosuke Higuchi
Nanxin Chen
Yuya Fujita
Hirofumi Inaguma
Tatsuya Komatsu
Jaesong Lee
Jumon Nozaki
Tianzi Wang
Shinji Watanabe
38
41
0
11 Oct 2021
Previous
123...789...192021
Next