v1v2 (latest)

Listen, Attend and Spell

5 August 2015

Papers citing "Listen, Attend and Spell"

50 / 1,041 papers shown

Title
End-to-End Speech Translation with Knowledge Distillation Yuchen Liu Hao Xiong Zhongjun He Jiajun Zhang Hua Wu Haifeng Wang Chengqing Zong 99 155 0 17 Apr 2019
Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition Jiabin Xue Jiqing Han Tieran Zheng Jiaxing Guo Boyong Wu 76 10 0 17 Apr 2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation Matthias Sperber Graham Neubig Jan Niehues A. Waibel 90 102 0 15 Apr 2019
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning Tao Tu Yuan-Jui Chen Cheng-chieh Yeh Hung-yi Lee 96 88 0 13 Apr 2019
Neuralogram: A Deep Neural Network Based Representation for Audio Signals Prateek Verma C. Chafe J. Berger AI4TS 13 9 0 10 Apr 2019
Performance Monitoring for End-to-End Speech Recognition Ruizhi Li Gregory Sell H. Hermansky 27 2 0 09 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition Tatiana Likhomanenko Gabriel Synnaeve R. Collobert 72 27 0 09 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation Fadi Biadsy Ron J. Weiss Pedro J. Moreno D. Kanvesky Ye Jia 97 115 0 08 Apr 2019
An Attentive Survey of Attention Models S. Chaudhari Varun Mithal Gungor Polatkan R. Ramanath 200 666 0 05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions Awni Y. Hannun Ann Lee Qiantong Xu R. Collobert 86 97 0 04 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models T. Nguyen Sebastian Stüker A. Waibel 45 2 0 31 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition Shiliang Zhang Ming Lei Zhijie Yan 36 16 0 27 Mar 2019
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition Yao Qin Nicholas Carlini Ian Goodfellow G. Cottrell Colin Raffel AAML 113 381 0 22 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model Yangyang Shi M. Hwang X. Lei AI4TS 32 14 0 12 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos Egor Lakomkin S. Magg C. Weber S. Wermter 41 19 0 01 Mar 2019
Neural Reverse Engineering of Stripped Binaries using Augmented Control Flow Graphs Yaniv David Uri Alon Eran Yahav 56 13 0 25 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences Albert Haque Michelle Guo Prateek Verma Li Fei-Fei 80 51 0 20 Feb 2019
A spelling correction model for end-to-end speech recognition Jinxi Guo Tara N. Sainath Ron J. Weiss AuLLM KELM 79 142 0 19 Feb 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings Shruti Palaskar Vikas Raunak Florian Metze 38 17 0 18 Feb 2019
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping Linhao Dong Feng Wang Bo Xu 59 91 0 18 Feb 2019
Insertion Transformer: Flexible Sequence Generation via Insertion Operations Mitchell Stern William Chan J. Kiros Jakob Uszkoreit KELM 103 252 0 08 Feb 2019
End-to-end Anchored Speech Recognition Yiming Wang Xing Fan I-Fan Chen Yuzong Liu Tongfei Chen Björn Hoffmeister 80 20 0 06 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition Kazuki Irie Rohit Prabhavalkar Anjuli Kannan A. Bruguier David Rybach Patrick Nguyen 78 37 0 05 Feb 2019
Attention in Natural Language Processing Andrea Galassi Marco Lippi Paolo Torroni GNN 83 484 0 04 Feb 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition Julian Salazar Katrin Kirchhoff Zhiheng Huang AI4TS 57 118 0 22 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units Amit Das Jinyu Li Guoli Ye Rui Zhao Jiawei Liu 61 26 0 31 Dec 2018
Greedy Layerwise Learning Can Scale to ImageNet Eugene Belilovsky Michael Eickenberg Edouard Oyallon 157 181 0 29 Dec 2018
Using an Ancillary Neural Network to Capture Weekends and Holidays in an Adjoint Neural Network Architecture for Intelligent Building Management Zhicheng Ding Mehmet Kerem Turkcan A. Boulanger 23 3 0 26 Dec 2018
An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition Devesh Walawalkar Yihui He R. Pillai 50 1 0 21 Dec 2018
End-to-End Classification of Reverberant Rooms using DNNs C. Papayiannis C. Evers Patrick A. Naylor 97 12 0 21 Dec 2018
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks Raphael Tang Gefei Yang H. Wei Yajie Mao Ferhan Ture Jimmy J. Lin 48 3 0 19 Dec 2018
Sequence Prediction using Spectral RNNs Moritz Wolter Juergen Gall Angela Yao AI4TS 44 2 0 13 Dec 2018
Bayesian Sparsification of Gated Recurrent Neural Networks E. Lobacheva Nadezhda Chirkova Dmitry Vetrov BDL 33 2 0 12 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings Sanjeev Khudanpur Adithya Renduchintala Shinji Watanabe Shuoyang Ding Najim Dehak Sanjeev Khudanpur 91 31 0 10 Dec 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition Jan Kremer Lasse Borgholt Lars Maaløe 64 6 0 28 Nov 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes Yue Liu Yu Zhang Tara N. Sainath Yonghui Wu William Chan AuLLM 79 131 0 22 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition Pan Zhou Wenwen Yang Wei Chen Yanfeng Wang Jia Jia 92 69 0 13 Nov 2018
An Online Attention-based Model for Speech Recognition Ruchao Fan Pan Zhou Wei Chen Jia Jia Gang Liu 71 48 0 13 Nov 2018
Exploring RNN-Transducer for Chinese Speech Recognition Senmao Wang Pan Zhou Wei Chen Jia Jia Lei Xie 87 31 0 13 Nov 2018
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training Yao Wan Wenqiang Yan Jianwei Gao Zhou Zhao Jian Wu Philip S. Yu 78 10 0 12 Nov 2018
Stream attention-based multi-array end-to-end speech recognition Xiaofei Wang Ruizhi Li Sri Harish Reddy Mallidi Takaaki Hori Shinji Watanabe H. Hermansky 77 21 0 12 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech recognition Ruizhi Li Xiaofei Wang Sri Harish Reddy Mallidi Takaaki Hori Shinji Watanabe H. Hermansky 58 13 0 12 Nov 2018
Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition Hiroshi Seki Takaaki Hori Shinji Watanabe 39 2 0 12 Nov 2018
Multimodal Grounding for Sequence-to-Sequence Speech Recognition Ozan Caglayan Ramon Sanabria Shruti Palaskar Loïc Barrault Florian Metze 80 25 0 09 Nov 2018
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Kou Tanaka Hirokazu Kameoka Takuhiro Kaneko Nobukatsu Hojo 79 112 0 09 Nov 2018
Few-shot learning with attention-based sequence-to-sequence models Bertrand Higy P. Bell 60 6 0 08 Nov 2018
Phonetic-attention scoring for deep speaker features in speaker verification Lantian Li Zhiyuan Tang Ying Shi Dong Wang 24 3 0 08 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems Jiayang Liu M. Baskar Weiming Zhang Takaaki Hori Sanjeev Khudanpur Jan ''Honza'' Cernocký 84 18 0 07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language model fusion S. Hariri Jaejin Cho M. Baskar Tatsuya Kawahara R. Brunner 81 43 0 06 Nov 2018
Language model integration based on memory control for sequence to sequence speech recognition Aaron Springer Shinji Watanabe Takaaki Hori M. Baskar Hirofumi Inaguma Jesus Villalba Najim Dehak KELM 77 5 0 06 Nov 2018