v1v2 (latest)

Listen, Attend and Spell

5 August 2015

Papers citing "Listen, Attend and Spell"

50 / 1,041 papers shown

Title
Towards Lifelong Learning of End-to-end ASR Heng-Jui Chang Hung-yi Lee Lin-Shan Lee KELM CLL 94 34 0 04 Apr 2021
Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers Loren Lugosch Piyush Papreja Mirco Ravanelli A. Heba Titouan Parcollet 77 14 0 04 Apr 2021
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition Zhengkun Tian Jiangyan Yi J. Tao Ye Bai Shuai Zhang Zhengqi Wen Xuefei Liu 56 19 0 04 Apr 2021
HMM-Free Encoder Pre-Training for Streaming RNN Transducer Lu Huang J. Sun Yu Tang Junfeng Hou Jinkun Chen Jun Zhang Zejun Ma 29 3 0 02 Apr 2021
Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation Siyuan Feng Piotr Żelasko Laureano Moro-Velazquez O. Scharenborg 71 4 0 02 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomes Jamie Yap John J. Dziak David Kabiito Claire Babirye J. McKay Bibhas Chakraborty J. Nakatumba‐Nabende 64 0 0 31 Mar 2021
Attention, please! A survey of Neural Attention Models in Deep Learning Alana de Santana Correia Esther Luna Colombini HAI 128 198 0 31 Mar 2021
A Practical Survey on Faster and Lighter Transformers Quentin Fournier G. Caron Daniel Aloise 137 105 0 26 Mar 2021
Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation Jiyoung Lee Soo-Whan Chung Sunok Kim Hong-Goo Kang Kwanghoon Sohn 64 51 0 25 Mar 2021
Advancing RNN Transducer Technology for Speech Recognition G. Saon Zoltan Tueske Daniel Bolaños Brian Kingsbury 95 88 0 17 Mar 2021
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation Md. Akmal Haidar Chao Xing Mehdi Rezagholizadeh 118 6 0 17 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo Bonaventure F. P. Dossou Chris C. Emezue 72 14 0 13 Mar 2021
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition A. Laptev A. Andrusenko Ivan Podluzhny Anton Mitrofanov Ivan Medennikov Yuri N. Matveev VLM 57 14 0 12 Mar 2021
Fine-tuning of Pre-trained End-to-end Speech Recognition with Generative Adversarial Networks Md. Akmal Haidar Mehdi Rezagholizadeh 113 9 0 10 Mar 2021
End-to-end acoustic modelling for phone recognition of young readers Lucile Gelin Morgane Daniel J. Pinquier Thomas Pellegrini 60 13 0 04 Mar 2021
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition Hirofumi Inaguma Tatsuya Kawahara 130 14 0 28 Feb 2021
MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition Linghui Meng Jin Xu Xu Tan Jindong Wang Tao Qin Bo Xu VLM 120 78 0 25 Feb 2021
Neural ranking models for document retrieval M. Trabelsi Zhiyu Zoey Chen Brian D. Davison J. Heflin FedML 85 29 0 23 Feb 2021
Joint Intent Detection And Slot Filling Based on Continual Learning Model Yanfei Hui Jianzong Wang Ning Cheng Fengying Yu Tianbo Wu Jing Xiao 42 15 0 22 Feb 2021
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods Xian Shi Fan Yu Yizhou Lu Yuhao Liang Qiangze Feng Daliang Wang Y. Qian Lei Xie 65 68 0 20 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study Prashanth Gurunath Shivakumar Shrikanth Narayanan 55 54 0 19 Feb 2021
Vision-Aided 6G Wireless Communications: Blockage Prediction and Proactive Handoff Gouranga Charan Muhammad Alrabeiah Ahmed Alkhateeb 52 136 0 18 Feb 2021
Do End-to-End Speech Recognition Models Care About Context? Lasse Borgholt Jakob Drachmann Havtorn Zeljko Agic Anders Søgaard Lars Maaløe Christian Igel 61 7 0 17 Feb 2021
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems Yi Lin Bo Yang Linchao Li Dongyue Guo Jianwei Zhang Hu Chen Yi Zhang 80 29 0 17 Feb 2021
End-to-End Automatic Speech Recognition with Deep Mutual Learning Ryo Masumura Mana Ihori Akihiko Takashima Tomohiro Tanaka Takanori Ashihara 36 5 0 16 Feb 2021
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet M. O. Topal Anil Bas Imke van Heerden LLMAG AI4CE 73 91 0 16 Feb 2021
Improving speech recognition models with small samples for air traffic control systems Yi Lin Qin Li Bo Yang Zhen Yan Huachun Tan Zhengmao Chen 104 32 0 16 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT Ye Bai Jiangyan Yi J. Tao Zhengkun Tian Zhengqi Wen Shuai Zhang RALM 91 52 0 15 Feb 2021
Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification Bidisha Sharma Maulik C. Madhavi Haizhou Li 51 20 0 15 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition Priyabrata Karmakar S. Teng Guojun Lu 58 27 0 14 Feb 2021
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding Milind Rao Pranav Dheram Gautam Tiwari A. Raju J. Droppo Ariya Rastrow A. Stolcke 50 17 0 12 Feb 2021
Sparsification via Compressed Sensing for Automatic Speech Recognition Kai Zhen Hieu Duy Nguyen Feng-Ju Chang Athanasios Mouchtaris Ariya Rastrow . 63 13 0 09 Feb 2021
Intermediate Loss Regularization for CTC-based Speech Recognition Jaesong Lee Shinji Watanabe 153 140 0 05 Feb 2021
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition Zhong Meng Naoyuki Kanda Yashesh Gaur S. Parthasarathy Eric Sun Liang Lu Xie Chen Jinyu Li Jiawei Liu AuLLM 104 53 0 02 Feb 2021
End2End Acoustic to Semantic Transduction Valentin Pelloin Nathalie Camelin Antoine Laurent R. Mori Antoine Caubrière Yannick Esteve S. Meignier 43 15 0 01 Feb 2021
Speech Recognition by Simply Fine-tuning BERT Wen-Chin Huang Chia-Hua Wu Shang-Bao Luo Kuan-Yu Chen Hsin-Min Wang Tomoki Toda 126 28 0 30 Jan 2021
Transformer Based Deliberation for Two-Pass Speech Recognition Ke Hu Ruoming Pang Tara N. Sainath Trevor Strohman 76 38 0 27 Jan 2021
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis Treatment Thesath Nanayakkara G. Clermont C. Langmead D. Swigon AI4CE 91 24 0 21 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and Human A. Hussein Shinji Watanabe Ahmed M. Ali VLM 75 50 0 21 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data Chengyi Wang Yu-Huan Wu Yao Qian K. Kumatani Shujie Liu Furu Wei Michael Zeng Xuedong Huang OT SSL 92 115 0 19 Jan 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications S. Latif Heriberto Cuayáhuitl Farrukh Pervez Fahad Shamshad Hafiz Shehbaz Ali Min Zhang OffRL 125 75 0 01 Jan 2021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans Shinji Watanabe Florian Boyer Xuankai Chang Pengcheng Guo Tomoki Hayashi ... Shigeki Karita Chenda Li Jing Shi Aswin Shanmugam Subramanian Wangyou Zhang VLM 110 38 0 23 Dec 2020
ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition Zuoyu Yan Xiaode Zhang Liangcai Gao Ke Yuan Zhi Tang 63 17 0 23 Dec 2020
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition Yubei Xiao Ke Gong Pan Zhou Guolin Zheng Xiaodan Liang Liang Lin 80 35 0 22 Dec 2020
NeurST: Neural Speech Translation Toolkit Chengqi Zhao Mingxuan Wang Qianqian Dong Rong Ye Lei Li 91 32 0 18 Dec 2020
The effectiveness of unsupervised subword modeling with autoregressive and cross-lingual phone-aware networks Siyuan Feng O. Scharenborg SSL 56 3 0 17 Dec 2020
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition Minglun Han Linhao Dong Shiyu Zhou Bo Xu 73 23 0 17 Dec 2020
AV Taris: Online Audio-Visual Speech Recognition George Sterpu N. Harte 63 1 0 14 Dec 2020
Bayesian Learning for Deep Neural Network Adaptation Xurong Xie Xunying Liu Tan Lee Lan Wang BDL 120 22 0 14 Dec 2020
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging Rohit Prabhavalkar Yanzhang He David Rybach S. Campbell A. Narayanan Trevor Strohman Tara N. Sainath 128 35 0 12 Dec 2020