v1v2 (latest)

Listen, Attend and Spell

5 August 2015

Papers citing "Listen, Attend and Spell"

50 / 1,041 papers shown

Title
Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks Tae Jin Park P. Georgiou 63 37 0 28 May 2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces Yu-An Chung W. Weng S. Tong James R. Glass 91 100 0 18 May 2018
A comparable study of modeling units for end-to-end Mandarin speech recognition Wei Zou Dongwei Jiang Shuaijiang Zhao Xiangang Li 60 33 0 10 May 2018
Improved training of end-to-end attention models for speech recognition Albert Zeyer Kazuki Irie Ralf Schluter Hermann Ney VLM 83 270 0 08 May 2018
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech Akihiro Kato Tomi Kinnunen 43 7 0 08 May 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition Albert Haque Corinna Fukushima 23 0 0 30 Apr 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction Zihang Dai Qizhe Xie Eduard H. Hovy 46 6 0 29 Apr 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese Shiyu Zhou Linhao Dong Shuang Xu Bo Xu 99 118 0 28 Apr 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated) Dong Yu Jinyu Li VLM 77 160 0 25 Apr 2018
Multi-Head Decoder for End-to-End Speech Recognition Tomoki Hayashi Shinji Watanabe Tomoki Toda K. Takeda 57 16 0 22 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression Shihui Yin Gaurav Srivastava S. Venkataramanaiah C. Chakrabarti Visar Berisha Jae-sun Seo 27 8 0 19 Apr 2018
Conditional End-to-End Audio Transforms Albert Haque Michelle Guo Prateek Verma 114 41 0 30 Mar 2018
ESPnet: End-to-End Speech Processing Toolkit Shinji Watanabe Takaaki Hori Shigeki Karita Tomoki Hayashi Jiro Nishitoba ... Jahn Heymann Sanjeev Khudanpur Nanxin Chen Adithya Renduchintala Tsubasa Ochiai VLM 128 1,515 0 30 Mar 2018
Single Stream Parallelization of Recurrent Neural Networks for Low Power and Fast Inference Wonyong Sung Jinhwan Park 36 5 0 30 Mar 2018
Attention-based End-to-End Models for Small-Footprint Keyword Spotting Changhao Shan Junbo Zhang Yujun Wang Lei Xie AI4TS 61 110 0 29 Mar 2018
Machine Speech Chain with One-shot Speaker Adaptation Andros Tjandra S. Sakti Satoshi Nakamura 71 56 0 28 Mar 2018
Multi-Modal Data Augmentation for End-to-End ASR Adithya Renduchintala Shuoyang Ding Sanjeev Khudanpur Shinji Watanabe 80 36 0 27 Mar 2018
Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus Md Mahadi Hasan Nahid Md. Ashraful Islam Bishwajit Purkaystha Md. Saiful Islam 32 5 0 27 Mar 2018
Self-Attentional Acoustic Models Matthias Sperber Jan Niehues Graham Neubig Sebastian Stüker A. Waibel 62 153 0 26 Mar 2018
Leveraging translations for speech transcription in low-resource settings Antonios Anastasopoulos David Chiang 63 27 0 23 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning Lijun Li Boqing Gong 71 56 0 21 Mar 2018
ORGaNICs: A Theory of Working Memory in Brains and Machines D. Heeger Wayne E. Mackey 90 7 0 16 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC Kai Xu Dawei Li N. Cassimatis Xiaolong Wang 70 97 0 13 Mar 2018
Feature Selective Small Object Detection via Knowledge-based Recurrent Attentive Neural Network Kai Yi Zhiqiang Jian Shi-tao Chen N. Zheng ObjD 55 6 0 13 Mar 2018
Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with Adversarial Examples Minhao Cheng Jinfeng Yi Pin-Yu Chen Huan Zhang Cho-Jui Hsieh SILM AAML 118 245 0 03 Mar 2018
XNMT: The eXtensible Neural Machine Translation Toolkit Graham Neubig Matthias Sperber Xinyi Wang Matthieu Felix Austin Matthews ... Philip Arthur Pierre Godard John Hewitt Rachid Riad Liming Wang 79 67 0 01 Mar 2018
Learning Longer-term Dependencies in RNNs with Auxiliary Losses Trieu H. Trinh Andrew M. Dai Thang Luong Quoc V. Le 98 181 0 01 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis Tal Ben-Nun Torsten Hoefler GNN 87 713 0 26 Feb 2018
Towards end-to-end spoken language understanding Dmitriy Serdyuk Yongqiang Wang Christian Fuegen Anuj Kumar Baiyang Liu Yoshua Bengio 60 234 0 23 Feb 2018
Tied Multitask Learning for Neural Speech Translation Antonios Anastasopoulos David Chiang 182 174 0 19 Feb 2018
Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation Takatomo Kano S. Sakti Satoshi Nakamura 79 46 0 13 Feb 2018
Recurrent Neural Network-Based Semantic Variational Autoencoder for Sequence-to-Sequence Learning Myeongjun Jang Seungwan Seo Pilsung Kang DRL 90 57 0 09 Feb 2018
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition Xuesong Yang Kartik Audhkhasi Andrew Rosenberg Samuel Thomas Bhuvana Ramabhadran M. Hasegawa-Johnson 60 71 0 07 Feb 2018
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling Prashanth Gurunath Shivakumar Haoqi Li Kevin Knight P. Georgiou 66 28 0 07 Feb 2018
DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction Brandon Ballinger Johnson Hsieh Avesh Singh N. Sohoni Jack Wang ... G. Marcus Jose M. Sanchez Carol Maguire J. Olgin M. Pletcher HAI 92 132 0 07 Feb 2018
Letter-Based Speech Recognition with Gated ConvNets Vitaliy Liptchinsky Gabriel Synnaeve R. Collobert 83 72 0 22 Dec 2017
Subword and Crossword Units for CTC Acoustic Models Thomas Zenkel Ramon Sanabria Florian Metze A. Waibel 59 33 0 19 Dec 2017
Monotonic Chunkwise Attention Chung-Cheng Chiu Colin Raffel 98 256 0 14 Dec 2017
Building competitive direct acoustics-to-word models for English conversational speech recognition Kartik Audhkhasi Brian Kingsbury Bhuvana Ramabhadran G. Saon M. Picheny 72 152 0 08 Dec 2017
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models Rohit Prabhavalkar Tara N. Sainath Yonghui Wu Patrick Nguyen Zhiwen Chen Chung-Cheng Chiu Anjuli Kannan 82 162 0 05 Dec 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks Xin Wang Feng Yu Zi-Yi Dou Trevor Darrell Joseph E. Gonzalez 154 640 0 26 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks Nan Rosemary Ke Anirudh Goyal O. Bilaniuk Jonathan Binas Laurent Charlin C. Pal Yoshua Bengio 78 15 0 07 Nov 2017
Multilingual Speech Recognition With A Single End-To-End Model Shubham Toshniwal Tara N. Sainath Ron J. Weiss Yue Liu Pedro J. Moreno Eugene Weinstein Kanishka Rao 72 264 0 06 Nov 2017
Sequence-to-Sequence ASR Optimization via Reinforcement Learning Andros Tjandra S. Sakti Satoshi Nakamura AI4TS 96 26 0 30 Oct 2017
A Study of All-Convolutional Encoders for Connectionist Temporal Classification Kalpesh Krishna Liang Lu Kevin Gimpel Karen Livescu 59 11 0 28 Oct 2017
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models Yanzhang He Rohit Prabhavalkar Kanishka Rao Wei Li A. Bakhtin Ian McGraw AI4TS 73 91 0 26 Oct 2017
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR D. Lim 37 2 0 12 Oct 2017
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition Bowen Shi Karen Livescu 49 14 0 09 Oct 2017
Attention-based Wav2Text with Feature Transfer Learning Andros Tjandra S. Sakti Satoshi Nakamura 47 20 0 22 Sep 2017
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems Yonatan Belinkov James R. Glass 55 84 0 13 Sep 2017