Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.01211
Cited By
Listen, Attend and Spell
5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Listen, Attend and Spell"
50 / 1,033 papers shown
Title
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech Recognition Models
T. Nguyen
Sebastian Stüker
A. Waibel
22
2
0
31 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
22
15
0
27 Mar 2019
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition
Yao Qin
Nicholas Carlini
Ian Goodfellow
G. Cottrell
Colin Raffel
AAML
38
376
0
22 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
20
14
0
12 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
18
19
0
01 Mar 2019
Neural Reverse Engineering of Stripped Binaries using Augmented Control Flow Graphs
Yaniv David
Uri Alon
Eran Yahav
11
13
0
25 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
28
51
0
20 Feb 2019
A spelling correction model for end-to-end speech recognition
Jinxi Guo
Tara N. Sainath
Ron J. Weiss
AuLLM
KELM
32
139
0
19 Feb 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
22
17
0
18 Feb 2019
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping
Linhao Dong
Feng Wang
Bo Xu
20
90
0
18 Feb 2019
Insertion Transformer: Flexible Sequence Generation via Insertion Operations
Mitchell Stern
William Chan
J. Kiros
Jakob Uszkoreit
KELM
31
247
0
08 Feb 2019
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
13
20
0
06 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
10
37
0
05 Feb 2019
Attention in Natural Language Processing
Andrea Galassi
Marco Lippi
Paolo Torroni
GNN
36
469
0
04 Feb 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
14
117
0
22 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
11
26
0
31 Dec 2018
Greedy Layerwise Learning Can Scale to ImageNet
Eugene Belilovsky
Michael Eickenberg
Edouard Oyallon
9
180
0
29 Dec 2018
Using an Ancillary Neural Network to Capture Weekends and Holidays in an Adjoint Neural Network Architecture for Intelligent Building Management
Zhicheng Ding
Mehmet Kerem Turkcan
A. Boulanger
11
3
0
26 Dec 2018
An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition
Devesh Walawalkar
Yihui He
R. Pillai
28
1
0
21 Dec 2018
End-to-End Classification of Reverberant Rooms using DNNs
C. Papayiannis
C. Evers
Patrick A. Naylor
8
12
0
21 Dec 2018
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks
Raphael Tang
Gefei Yang
H. Wei
Yajie Mao
Ferhan Ture
Jimmy J. Lin
30
3
0
19 Dec 2018
Sequence Prediction using Spectral RNNs
Moritz Wolter
Juergen Gall
Angela Yao
AI4TS
19
2
0
13 Dec 2018
Bayesian Sparsification of Gated Recurrent Neural Networks
E. Lobacheva
Nadezhda Chirkova
Dmitry Vetrov
BDL
18
2
0
12 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Matthew Wiesner
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
21
32
0
10 Dec 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition
Jan Kremer
Lasse Borgholt
Lars Maaløe
34
6
0
28 Nov 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Bo-wen Li
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
AuLLM
22
129
0
22 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou
Wenwen Yang
Wei Chen
Yanfeng Wang
Jia Jia
24
69
0
13 Nov 2018
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
11
48
0
13 Nov 2018
Exploring RNN-Transducer for Chinese Speech Recognition
Senmao Wang
Pan Zhou
Wei Chen
Jia Jia
Lei Xie
27
30
0
13 Nov 2018
Improved Dynamic Memory Network for Dialogue Act Classification with Adversarial Training
Yao Wan
Wenqiang Yan
Jianwei Gao
Zhou Zhao
Jian Wu
Philip S. Yu
27
10
0
12 Nov 2018
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
25
21
0
12 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
22
13
0
12 Nov 2018
Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
19
2
0
12 Nov 2018
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Ozan Caglayan
Ramon Sanabria
Shruti Palaskar
Loïc Barrault
Florian Metze
26
25
0
09 Nov 2018
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms
Kou Tanaka
Hirokazu Kameoka
Takuhiro Kaneko
Nobukatsu Hojo
17
111
0
09 Nov 2018
Few-shot learning with attention-based sequence-to-sequence models
Bertrand Higy
P. Bell
19
6
0
08 Nov 2018
Phonetic-attention scoring for deep speaker features in speaker verification
Lantian Li
Zhiyuan Tang
Ying Shi
Dong Wang
8
3
0
08 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Matthew Wiesner
Jan ''Honza'' Cernocký
33
18
0
07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
6
42
0
06 Nov 2018
Language model integration based on memory control for sequence to sequence speech recognition
Aaron Springer
Shinji Watanabe
Takaaki Hori
M. Baskar
Hirofumi Inaguma
Jesus Villalba
Najim Dehak
KELM
41
5
0
06 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
16
76
0
05 Nov 2018
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
31
159
0
05 Nov 2018
Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs
Themos Stafylakis
M. H. Khan
Georgios Tzimiropoulos
VLM
16
59
0
03 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
6
46
0
02 Nov 2018
Cycle-consistency training for end-to-end speech recognition
Takaaki Hori
Ramón Fernández Astudillo
Tomoki Hayashi
Yu Zhang
Shinji Watanabe
Jonathan Le Roux
20
87
0
02 Nov 2018
Improving the Robustness of Speech Translation
Xiang-Yang Li
Haiyang Xue
Wei Chen
Yang Liu
Yang Feng
Qun Liu
14
17
0
02 Nov 2018
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
24
286
0
01 Nov 2018
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Zhiping Zeng
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Chng Eng Siong
Haizhou Li
16
92
0
01 Nov 2018
On The Inductive Bias of Words in Acoustics-to-Word Models
Hao Tang
James R. Glass
16
0
0
31 Oct 2018
Previous
1
2
3
...
17
18
19
20
21
Next