Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1508.01211
Cited By
v1
v2 (latest)
Listen, Attend and Spell
5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Listen, Attend and Spell"
50 / 1,041 papers shown
Title
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
87
77
0
05 Nov 2018
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
98
163
0
05 Nov 2018
Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs
Themos Stafylakis
M. H. Khan
Georgios Tzimiropoulos
VLM
70
60
0
03 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
85
47
0
02 Nov 2018
Cycle-consistency training for end-to-end speech recognition
Takaaki Hori
Ramón Fernández Astudillo
Tomoki Hayashi
Yu Zhang
Shinji Watanabe
Jonathan Le Roux
97
87
0
02 Nov 2018
Improving the Robustness of Speech Translation
Xiang-Yang Li
Haiyang Xue
Wei Chen
Yang Liu
Yang Feng
Qun Liu
33
17
0
02 Nov 2018
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
128
292
0
01 Nov 2018
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Zhiping Zeng
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Chng Eng Siong
Haizhou Li
78
92
0
01 Nov 2018
On The Inductive Bias of Words in Acoustics-to-Word Models
Hao Tang
James R. Glass
50
0
0
31 Oct 2018
Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
David B. Ramsay
Kevin Kilgour
Dominik Roblek
Matthew Sharifi
BDL
29
3
0
31 Oct 2018
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator
Andros Tjandra
S. Sakti
Satoshi Nakamura
74
44
0
31 Oct 2018
Towards End-to-End Code-Switching Speech Recognition
Ne Luo
Dongwei Jiang
Shuaijiang Zhao
Caixia Gong
Wei Zou
Xiangang Li
60
47
0
31 Oct 2018
Towards End-to-end Automatic Code-Switching Speech Recognition
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
53
12
0
30 Oct 2018
Contextual Speech Recognition with Difficult Negative Training Examples
Uri Alon
Golan Pundak
Tara N. Sainath
45
40
0
29 Oct 2018
An improved hybrid CTC-Attention model for speech recognition
Zhe Yuan
Zhuoran Lyu
Jiwei Li
Xi Zhou
35
9
0
29 Oct 2018
Bayesian Compression for Natural Language Processing
Nadezhda Chirkova
E. Lobacheva
Dmitry Vetrov
BDL
65
15
0
25 Oct 2018
Tackling Sequence to Sequence Mapping Problems with Neural Networks
Lei Yu
AIMat
41
3
0
25 Oct 2018
The MeMAD Submission to the IWSLT 2018 Speech Translation Task
U. Sulubacak
Jörg Tiedemann
Aku Rouhe
Stig-Arne Gronroos
M. Kurimo
21
3
0
24 Oct 2018
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
85
130
0
16 Oct 2018
The State of Speech in HCI: Trends, Themes and Challenges
L. Clark
Philip R. Doyle
Diego Garaialde
E. Gilmartin
Stephan Schlögl
Jens Edlund
M. Aylett
João P. Cabral
Cosmin Munteanu
Benjamin R. Cowan
75
211
0
16 Oct 2018
Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes
Letizia Marchegiani
Paul Newman
64
38
0
11 Oct 2018
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling
Jaejin Cho
M. Baskar
Ruizhi Li
Sanjeev Khudanpur
Sri Harish Reddy Mallidi
Nelson Yalta
M. Karafiát
Shinji Watanabe
Takaaki Hori
84
122
0
04 Oct 2018
Optimal Completion Distillation for Sequence Learning
S. Sabour
William Chan
Mohammad Norouzi
89
45
0
02 Oct 2018
From Audio to Semantics: Approaches to end-to-end spoken language understanding
Parisa Haghani
A. Narayanan
M. Bacchiani
Galen Chuang
Neeraj Gaur
Pedro J. Moreno
Rohit Prabhavalkar
Zhongdi Qu
Austin Waters
63
152
0
24 Sep 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
67
23
0
19 Sep 2018
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Matthias Dorfer
Jan Hajic
Gerhard Widmer
25
2
0
15 Sep 2018
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction
Liang-Chieh Chen
Maxwell D. Collins
Yukun Zhu
George Papandreou
Barret Zoph
Florian Schroff
Hartwig Adam
Jonathon Shlens
3DV
97
412
0
11 Sep 2018
Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Michael C. Mozer
C. Pal
Yoshua Bengio
CLL
83
86
0
11 Sep 2018
Indicatements that character language models learn English morpho-syntactic units and regularities
Yova Kementchedjhieva
Adam Lopez
65
10
0
31 Aug 2018
End-to-end Speech Recognition with Adaptive Computation Steps
Mohan Li
Min Liu
Masanori Hattori
44
34
0
30 Aug 2018
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
89
96
0
29 Aug 2018
Quantum enhanced cross-validation for near-optimal neural networks architecture selection
P. D. Santos
Rodrigo S. Sousa
Ismael C. S. Araújo
A. J. D. Silva
50
7
0
27 Aug 2018
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Soojeong Kim
Gyeong-In Yu
Hojin Park
Sungwoo Cho
Eunji Jeong
Hyeonmin Ha
Sanha Lee
Joo Seong Jeong
Byung-Gon Chun
67
75
0
08 Aug 2018
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Shubham Toshniwal
Anjuli Kannan
Chung-Cheng Chiu
Yonghui Wu
Tara N. Sainath
Karen Livescu
84
157
0
27 Jul 2018
A small Griko-Italian speech translation corpus
Marcely Zanon Boito
Antonios Anastasopoulos
M. Lekakou
Aline Villavicencio
Laurent Besacier
53
11
0
27 Jul 2018
Zero-shot keyword spotting for visual speech recognition in-the-wild
Themos Stafylakis
Georgios Tzimiropoulos
68
38
0
23 Jul 2018
Acoustic-to-Word Recognition with Sequence-to-Sequence Models
Shruti Palaskar
Florian Metze
47
19
0
23 Jul 2018
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model
Andros Tjandra
S. Sakti
Satoshi Nakamura
26
13
0
22 Jul 2018
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
56
83
0
18 Jul 2018
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units
Zhangyu Xiao
Zhijian Ou
Wei Chu
Hui-Ching Lin
88
38
0
13 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
126
72
0
11 Jul 2018
Detecting Visual Relationships Using Box Attention
Alexander Kolesnikov
Alina Kuznetsova
Christoph H. Lampert
V. Ferrari
99
65
0
05 Jul 2018
Exploring End-to-End Techniques for Low-Resource Speech Recognition
Vladimir Bataev
M. Korenevsky
Ivan Medennikov
Alexander Zatvornitsky
43
9
0
02 Jul 2018
Punctuation Prediction Model for Conversational Speech
Piotr Żelasko
Piotr Szymañski
Jan Mizgajski
Adrian Szymczak
Yishay Carmiel
Najim Dehak
53
54
0
02 Jul 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
Linhao Dong
Shiyu Zhou
Wei Chen
Bo Xu
59
22
0
17 Jun 2018
Fusing Recency into Neural Machine Translation with an Inter-Sentence Gate Model
Shaohui Kuang
Deyi Xiong
78
27
0
12 Jun 2018
Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Rosemary Ke
Konrad Zolna
Alessandro Sordoni
Zhouhan Lin
Adam Trischler
Yoshua Bengio
Joelle Pineau
Laurent Charlin
C. Pal
AIMat
63
25
0
12 Jun 2018
Natural Language Generation for Electronic Health Records
Scott H. Lee
SyDa
59
82
0
01 Jun 2018
Learn to Combine Modalities in Multimodal Deep Learning
Kuan Liu
Yanen Li
N. Xu
Premkumar Natarajan
93
150
0
29 May 2018
Can DNNs Learn to Lipread Full Sentences?
George Sterpu
Christian Saam
N. Harte
51
8
0
29 May 2018
Previous
1
2
3
...
18
19
20
21
Next