ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.01211
  4. Cited By
Listen, Attend and Spell
v1v2 (latest)

Listen, Attend and Spell

5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
    RALM
ArXiv (abs)PDFHTML

Papers citing "Listen, Attend and Spell"

50 / 1,041 papers shown
Title
Towards Lifelong Learning of End-to-end ASR
Towards Lifelong Learning of End-to-end ASR
Heng-Jui Chang
Hung-yi Lee
Lin-Shan Lee
KELMCLL
94
34
0
04 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding
  with Numbers
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers
Loren Lugosch
Piyush Papreja
Mirco Ravanelli
A. Heba
Titouan Parcollet
77
14
0
04 Apr 2021
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech
  Recognition
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
Xuefei Liu
56
19
0
04 Apr 2021
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
Lu Huang
J. Sun
Yu Tang
Junfeng Hou
Jinkun Chen
Jun Zhang
Zejun Ma
29
3
0
02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a
  Language-Independent Subword Discriminative Feature Representation
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation
Siyuan Feng
Piotr Żelasko
Laureano Moro-Velazquez
O. Scharenborg
71
4
0
02 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a
  SMART: a Monte Carlo-based approach and case study with longitudinal
  overdispersed count outcomes
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomes
Jamie Yap
John J. Dziak
David Kabiito
Claire Babirye
J. McKay
Bibhas Chakraborty
J. Nakatumba‐Nabende
64
0
0
31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep Learning
Alana de Santana Correia
Esther Luna Colombini
HAI
128
198
0
31 Mar 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter Transformers
Quentin Fournier
G. Caron
Daniel Aloise
137
105
0
26 Mar 2021
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual
  Speech Separation
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation
Jiyoung Lee
Soo-Whan Chung
Sunok Kim
Hong-Goo Kang
Kwanghoon Sohn
64
51
0
25 Mar 2021
Advancing RNN Transducer Technology for Speech Recognition
Advancing RNN Transducer Technology for Speech Recognition
G. Saon
Zoltan Tueske
Daniel Bolaños
Brian Kingsbury
95
88
0
17 Mar 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning
  with Self-Knowledge Distillation
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation
Md. Akmal Haidar
Chao Xing
Mehdi Rezagholizadeh
118
6
0
17 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
72
14
0
13 Mar 2021
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource
  End-to-End Speech Recognition
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
A. Laptev
A. Andrusenko
Ivan Podluzhny
Anton Mitrofanov
Ivan Medennikov
Yuri N. Matveev
VLM
57
14
0
12 Mar 2021
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative
  Adversarial Networks
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial Networks
Md. Akmal Haidar
Mehdi Rezagholizadeh
113
9
0
10 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers
End-to-end acoustic modelling for phone recognition of young readers
Lucile Gelin
Morgane Daniel
J. Pinquier
Thomas Pellegrini
60
13
0
04 Mar 2021
Alignment Knowledge Distillation for Online Streaming Attention-based
  Speech Recognition
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Hirofumi Inaguma
Tatsuya Kawahara
130
14
0
28 Feb 2021
MixSpeech: Data Augmentation for Low-resource Automatic Speech
  Recognition
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition
Linghui Meng
Jin Xu
Xu Tan
Jindong Wang
Tao Qin
Bo Xu
VLM
120
78
0
25 Feb 2021
Neural ranking models for document retrieval
Neural ranking models for document retrieval
M. Trabelsi
Zhiyu Zoey Chen
Brian D. Davison
J. Heflin
FedML
85
29
0
23 Feb 2021
Joint Intent Detection And Slot Filling Based on Continual Learning
  Model
Joint Intent Detection And Slot Filling Based on Continual Learning Model
Yanfei Hui
Jianzong Wang
Ning Cheng
Fengying Yu
Tianbo Wu
Jing Xiao
42
15
0
22 Feb 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets,
  Tracks, Baselines, Results and Methods
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Xian Shi
Fan Yu
Yizhou Lu
Yuhao Liang
Qiangze Feng
Daliang Wang
Y. Qian
Lei Xie
65
68
0
20 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An
  Empirical Study
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
55
54
0
19 Feb 2021
Vision-Aided 6G Wireless Communications: Blockage Prediction and
  Proactive Handoff
Vision-Aided 6G Wireless Communications: Blockage Prediction and Proactive Handoff
Gouranga Charan
Muhammad Alrabeiah
Ahmed Alkhateeb
52
136
0
18 Feb 2021
Do End-to-End Speech Recognition Models Care About Context?
Do End-to-End Speech Recognition Models Care About Context?
Lasse Borgholt
Jakob Drachmann Havtorn
Zeljko Agic
Anders Søgaard
Lars Maaløe
Christian Igel
61
7
0
17 Feb 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for
  air traffic control systems
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems
Yi Lin
Bo Yang
Linchao Li
Dongyue Guo
Jianwei Zhang
Hu Chen
Yi Zhang
80
29
0
17 Feb 2021
End-to-End Automatic Speech Recognition with Deep Mutual Learning
End-to-End Automatic Speech Recognition with Deep Mutual Learning
Ryo Masumura
Mana Ihori
Akihiko Takashima
Tomohiro Tanaka
Takanori Ashihara
36
5
0
16 Feb 2021
Exploring Transformers in Natural Language Generation: GPT, BERT, and
  XLNet
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet
M. O. Topal
Anil Bas
Imke van Heerden
LLMAGAI4CE
73
91
0
16 Feb 2021
Improving speech recognition models with small samples for air traffic
  control systems
Improving speech recognition models with small samples for air traffic control systems
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
104
32
0
16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and
  Cross-Modal Knowledge Transferring from BERT
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
91
52
0
15 Feb 2021
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and
  language Models for Intent Classification
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Bidisha Sharma
Maulik C. Madhavi
Haizhou Li
51
20
0
15 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural
  Networks for Automatic Speech Recognition
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Priyabrata Karmakar
S. Teng
Guojun Lu
58
27
0
14 Feb 2021
Do as I mean, not as I say: Sequence Loss Training for Spoken Language
  Understanding
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Milind Rao
Pranav Dheram
Gautam Tiwari
A. Raju
J. Droppo
Ariya Rastrow
A. Stolcke
50
17
0
12 Feb 2021
Sparsification via Compressed Sensing for Automatic Speech Recognition
Sparsification via Compressed Sensing for Automatic Speech Recognition
Kai Zhen
Hieu Duy Nguyen
Feng-Ju Chang
Athanasios Mouchtaris
Ariya Rastrow
.
63
13
0
09 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
Intermediate Loss Regularization for CTC-based Speech Recognition
Jaesong Lee
Shinji Watanabe
153
140
0
05 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech
  Recognition
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Zhong Meng
Naoyuki Kanda
Yashesh Gaur
S. Parthasarathy
Eric Sun
Liang Lu
Xie Chen
Jinyu Li
Jiawei Liu
AuLLM
104
53
0
02 Feb 2021
End2End Acoustic to Semantic Transduction
End2End Acoustic to Semantic Transduction
Valentin Pelloin
Nathalie Camelin
Antoine Laurent
R. Mori
Antoine Caubrière
Yannick Esteve
S. Meignier
43
15
0
01 Feb 2021
Speech Recognition by Simply Fine-tuning BERT
Speech Recognition by Simply Fine-tuning BERT
Wen-Chin Huang
Chia-Hua Wu
Shang-Bao Luo
Kuan-Yu Chen
Hsin-Min Wang
Tomoki Toda
126
28
0
30 Jan 2021
Transformer Based Deliberation for Two-Pass Speech Recognition
Transformer Based Deliberation for Two-Pass Speech Recognition
Ke Hu
Ruoming Pang
Tara N. Sainath
Trevor Strohman
76
38
0
27 Jan 2021
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for
  Uncertainty Aware Control of Sepsis Treatment
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis Treatment
Thesath Nanayakkara
G. Clermont
C. Langmead
D. Swigon
AI4CE
91
24
0
21 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and Human
Arabic Speech Recognition by End-to-End, Modular Systems and Human
A. Hussein
Shinji Watanabe
Ahmed M. Ali
VLM
75
50
0
21 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and
  Unlabeled Data
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OTSSL
92
115
0
19 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Min Zhang
OffRL
125
75
0
01 Jan 2021
The 2020 ESPnet update: new features, broadened applications,
  performance improvements, and future plans
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Shinji Watanabe
Florian Boyer
Xuankai Chang
Pengcheng Guo
Tomoki Hayashi
...
Shigeki Karita
Chenda Li
Jing Shi
Aswin Shanmugam Subramanian
Wangyou Zhang
VLM
110
38
0
23 Dec 2020
ConvMath: A Convolutional Sequence Network for Mathematical Expression
  Recognition
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
Zuoyu Yan
Xiaode Zhang
Liangcai Gao
Ke Yuan
Zhi Tang
63
17
0
23 Dec 2020
Adversarial Meta Sampling for Multilingual Low-Resource Speech
  Recognition
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition
Yubei Xiao
Ke Gong
Pan Zhou
Guolin Zheng
Xiaodan Liang
Liang Lin
80
35
0
22 Dec 2020
NeurST: Neural Speech Translation Toolkit
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
91
32
0
18 Dec 2020
The effectiveness of unsupervised subword modeling with autoregressive
  and cross-lingual phone-aware networks
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks
Siyuan Feng
O. Scharenborg
SSL
56
3
0
17 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech
  Recognition
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition
Minglun Han
Linhao Dong
Shiyu Zhou
Bo Xu
73
23
0
17 Dec 2020
AV Taris: Online Audio-Visual Speech Recognition
AV Taris: Online Audio-Visual Speech Recognition
George Sterpu
N. Harte
63
1
0
14 Dec 2020
Bayesian Learning for Deep Neural Network Adaptation
Bayesian Learning for Deep Neural Network Adaptation
Xurong Xie
Xunying Liu
Tan Lee
Lan Wang
BDL
120
22
0
14 Dec 2020
Less Is More: Improved RNN-T Decoding Using Limited Label Context and
  Path Merging
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging
Rohit Prabhavalkar
Yanzhang He
David Rybach
S. Campbell
A. Narayanan
Trevor Strohman
Tara N. Sainath
128
35
0
12 Dec 2020
Previous
123...101112...192021
Next