Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.01211
Cited By
v1
v2 (latest)
Listen, Attend and Spell
5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Listen, Attend and Spell"
50 / 1,041 papers shown
Title
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
99
155
0
17 Apr 2019
Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition
Jiabin Xue
Jiqing Han
Tieran Zheng
Jiaxing Guo
Boyong Wu
76
10
0
17 Apr 2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
Matthias Sperber
Graham Neubig
Jan Niehues
A. Waibel
90
102
0
15 Apr 2019
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning
Tao Tu
Yuan-Jui Chen
Cheng-chieh Yeh
Hung-yi Lee
96
88
0
13 Apr 2019
Neuralogram: A Deep Neural Network Based Representation for Audio Signals
Prateek Verma
C. Chafe
J. Berger
AI4TS
13
9
0
10 Apr 2019
Performance Monitoring for End-to-End Speech Recognition
Ruizhi Li
Gregory Sell
H. Hermansky
27
2
0
09 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
72
27
0
09 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
97
115
0
08 Apr 2019
An Attentive Survey of Attention Models
S. Chaudhari
Varun Mithal
Gungor Polatkan
R. Ramanath
200
666
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
86
97
0
04 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models
T. Nguyen
Sebastian Stüker
A. Waibel
45
2
0
31 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
36
16
0
27 Mar 2019
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Yao Qin
Nicholas Carlini
Ian Goodfellow
G. Cottrell
Colin Raffel
AAML
113
381
0
22 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
32
14
0
12 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
41
19
0
01 Mar 2019
Neural Reverse Engineering of Stripped Binaries using Augmented Control Flow Graphs
Yaniv David
Uri Alon
Eran Yahav
56
13
0
25 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
80
51
0
20 Feb 2019
A spelling correction model for end-to-end speech recognition
Jinxi Guo
Tara N. Sainath
Ron J. Weiss
AuLLM
KELM
79
142
0
19 Feb 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
38
17
0
18 Feb 2019
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Linhao Dong
Feng Wang
Bo Xu
59
91
0
18 Feb 2019
Insertion Transformer: Flexible Sequence Generation via Insertion Operations
Mitchell Stern
William Chan
J. Kiros
Jakob Uszkoreit
KELM
103
252
0
08 Feb 2019
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
80
20
0
06 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
78
37
0
05 Feb 2019
Attention in Natural Language Processing
Andrea Galassi
Marco Lippi
Paolo Torroni
GNN
83
484
0
04 Feb 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
57
118
0
22 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
61
26
0
31 Dec 2018
Greedy Layerwise Learning Can Scale to ImageNet
Eugene Belilovsky
Michael Eickenberg
Edouard Oyallon
157
181
0
29 Dec 2018
Using an Ancillary Neural Network to Capture Weekends and Holidays in an Adjoint Neural Network Architecture for Intelligent Building Management
Zhicheng Ding
Mehmet Kerem Turkcan
A. Boulanger
23
3
0
26 Dec 2018
An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition
Devesh Walawalkar
Yihui He
R. Pillai
50
1
0
21 Dec 2018
End-to-End Classification of Reverberant Rooms using DNNs
C. Papayiannis
C. Evers
Patrick A. Naylor
97
12
0
21 Dec 2018
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks
Raphael Tang
Gefei Yang
H. Wei
Yajie Mao
Ferhan Ture
Jimmy J. Lin
48
3
0
19 Dec 2018
Sequence Prediction using Spectral RNNs
Moritz Wolter
Juergen Gall
Angela Yao
AI4TS
44
2
0
13 Dec 2018
Bayesian Sparsification of Gated Recurrent Neural Networks
E. Lobacheva
Nadezhda Chirkova
Dmitry Vetrov
BDL
33
2
0
12 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Sanjeev Khudanpur
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
91
31
0
10 Dec 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition
Jan Kremer
Lasse Borgholt
Lars Maaløe
64
6
0
28 Nov 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Yue Liu
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
AuLLM
79
131
0
22 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou
Wenwen Yang
Wei Chen
Yanfeng Wang
Jia Jia
92
69
0
13 Nov 2018
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
71
48
0
13 Nov 2018
Exploring RNN-Transducer for Chinese Speech Recognition
Senmao Wang
Pan Zhou
Wei Chen
Jia Jia
Lei Xie
87
31
0
13 Nov 2018
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Yao Wan
Wenqiang Yan
Jianwei Gao
Zhou Zhao
Jian Wu
Philip S. Yu
78
10
0
12 Nov 2018
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
77
21
0
12 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
58
13
0
12 Nov 2018
Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
39
2
0
12 Nov 2018
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Ozan Caglayan
Ramon Sanabria
Shruti Palaskar
Loïc Barrault
Florian Metze
80
25
0
09 Nov 2018
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms
Kou Tanaka
Hirokazu Kameoka
Takuhiro Kaneko
Nobukatsu Hojo
79
112
0
09 Nov 2018
Few-shot learning with attention-based sequence-to-sequence models
Bertrand Higy
P. Bell
60
6
0
08 Nov 2018
Phonetic-attention scoring for deep speaker features in speaker verification
Lantian Li
Zhiyuan Tang
Ying Shi
Dong Wang
24
3
0
08 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Sanjeev Khudanpur
Jan ''Honza'' Cernocký
84
18
0
07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
81
43
0
06 Nov 2018
Language model integration based on memory control for sequence to sequence speech recognition
Aaron Springer
Shinji Watanabe
Takaaki Hori
M. Baskar
Hirofumi Inaguma
Jesus Villalba
Najim Dehak
KELM
77
5
0
06 Nov 2018
Previous
1
2
3
...
17
18
19
20
21
Next