ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.01211
  4. Cited By
Listen, Attend and Spell

Listen, Attend and Spell

5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
    RALM
ArXivPDFHTML

Papers citing "Listen, Attend and Spell"

50 / 1,033 papers shown
Title
Low-Dimensional Bottleneck Features for On-Device Continuous Speech
  Recognition
Low-Dimensional Bottleneck Features for On-Device Continuous Speech Recognition
David B. Ramsay
Kevin Kilgour
Dominik Roblek
Matthew Sharifi
BDL
6
3
0
31 Oct 2018
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through
  Estimator
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator
Andros Tjandra
S. Sakti
Satoshi Nakamura
13
44
0
31 Oct 2018
Towards End-to-End Code-Switching Speech Recognition
Towards End-to-End Code-Switching Speech Recognition
Ne Luo
Dongwei Jiang
Shuaijiang Zhao
Caixia Gong
Wei Zou
Xiangang Li
11
47
0
31 Oct 2018
Towards End-to-end Automatic Code-Switching Speech Recognition
Towards End-to-end Automatic Code-Switching Speech Recognition
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
8
12
0
30 Oct 2018
Contextual Speech Recognition with Difficult Negative Training Examples
Contextual Speech Recognition with Difficult Negative Training Examples
Uri Alon
Golan Pundak
Tara N. Sainath
9
39
0
29 Oct 2018
An improved hybrid CTC-Attention model for speech recognition
An improved hybrid CTC-Attention model for speech recognition
Zhe Yuan
Zhuoran Lyu
Jiwei Li
Xi Zhou
13
9
0
29 Oct 2018
Bayesian Compression for Natural Language Processing
Bayesian Compression for Natural Language Processing
Nadezhda Chirkova
E. Lobacheva
Dmitry Vetrov
BDL
19
15
0
25 Oct 2018
Tackling Sequence to Sequence Mapping Problems with Neural Networks
Tackling Sequence to Sequence Mapping Problems with Neural Networks
Lei Yu
AIMat
20
2
0
25 Oct 2018
The MeMAD Submission to the IWSLT 2018 Speech Translation Task
The MeMAD Submission to the IWSLT 2018 Speech Translation Task
U. Sulubacak
Jörg Tiedemann
Aku Rouhe
Stig-Arne Gronroos
M. Kurimo
14
3
0
24 Oct 2018
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
11
129
0
16 Oct 2018
The State of Speech in HCI: Trends, Themes and Challenges
The State of Speech in HCI: Trends, Themes and Challenges
L. Clark
Philip R. Doyle
Diego Garaialde
E. Gilmartin
Stephan Schlögl
Jens Edlund
M. Aylett
João P. Cabral
Cosmin Munteanu
Benjamin R. Cowan
15
206
0
16 Oct 2018
Listening for Sirens: Locating and Classifying Acoustic Alarms in City
  Scenes
Listening for Sirens: Locating and Classifying Acoustic Alarms in City Scenes
Letizia Marchegiani
Paul Newman
11
35
0
11 Oct 2018
Multilingual sequence-to-sequence speech recognition: architecture,
  transfer learning, and language modeling
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling
Jaejin Cho
M. Baskar
Ruizhi Li
Matthew Wiesner
Sri Harish Reddy Mallidi
Nelson Yalta
M. Karafiát
Shinji Watanabe
Takaaki Hori
28
120
0
04 Oct 2018
Optimal Completion Distillation for Sequence Learning
Optimal Completion Distillation for Sequence Learning
S. Sabour
William Chan
Mohammad Norouzi
24
45
0
02 Oct 2018
From Audio to Semantics: Approaches to end-to-end spoken language
  understanding
From Audio to Semantics: Approaches to end-to-end spoken language understanding
Parisa Haghani
A. Narayanan
M. Bacchiani
Galen Chuang
Neeraj Gaur
Pedro J. Moreno
Rohit Prabhavalkar
Zhongdi Qu
Austin Waters
13
150
0
24 Sep 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
24
23
0
19 Sep 2018
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Attention as a Perspective for Learning Tempo-invariant Audio Queries
Matthias Dorfer
Jan Hajic
Gerhard Widmer
14
2
0
15 Sep 2018
Searching for Efficient Multi-Scale Architectures for Dense Image
  Prediction
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction
Liang-Chieh Chen
Maxwell D. Collins
Yukun Zhu
George Papandreou
Barret Zoph
Florian Schroff
Hartwig Adam
Jonathon Shlens
3DV
24
408
0
11 Sep 2018
Sparse Attentive Backtracking: Temporal CreditAssignment Through
  Reminding
Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Michael C. Mozer
C. Pal
Yoshua Bengio
CLL
19
85
0
11 Sep 2018
Indicatements that character language models learn English
  morpho-syntactic units and regularities
Indicatements that character language models learn English morpho-syntactic units and regularities
Yova Kementchedjhieva
Adam Lopez
11
10
0
31 Aug 2018
End-to-end Speech Recognition with Adaptive Computation Steps
End-to-end Speech Recognition with Adaptive Computation Steps
Mohan Li
Min Liu
Masanori Hattori
6
33
0
30 Aug 2018
Revisiting Character-Based Neural Machine Translation with Capacity and
  Compression
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
Colin Cherry
George F. Foster
Ankur Bapna
Orhan Firat
Wolfgang Macherey
23
94
0
29 Aug 2018
Quantum enhanced cross-validation for near-optimal neural networks
  architecture selection
Quantum enhanced cross-validation for near-optimal neural networks architecture selection
P. D. Santos
Rodrigo S. Sousa
Ismael C. S. Araújo
A. J. D. Silva
19
7
0
27 Aug 2018
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Soojeong Kim
Gyeong-In Yu
Hojin Park
Sungwoo Cho
Eunji Jeong
Hyeonmin Ha
Sanha Lee
Joo Seong Jeong
Byung-Gon Chun
23
73
0
08 Aug 2018
A Comparison of Techniques for Language Model Integration in
  Encoder-Decoder Speech Recognition
A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition
Shubham Toshniwal
Anjuli Kannan
Chung-Cheng Chiu
Yonghui Wu
Tara N. Sainath
Karen Livescu
19
157
0
27 Jul 2018
A small Griko-Italian speech translation corpus
A small Griko-Italian speech translation corpus
Marcely Zanon Boito
Antonios Anastasopoulos
M. Lekakou
Aline Villavicencio
Laurent Besacier
18
11
0
27 Jul 2018
Zero-shot keyword spotting for visual speech recognition in-the-wild
Zero-shot keyword spotting for visual speech recognition in-the-wild
Themos Stafylakis
Georgios Tzimiropoulos
32
38
0
23 Jul 2018
Acoustic-to-Word Recognition with Sequence-to-Sequence Models
Acoustic-to-Word Recognition with Sequence-to-Sequence Models
Shruti Palaskar
Florian Metze
6
19
0
23 Jul 2018
Multi-scale Alignment and Contextual History for Attention Mechanism in
  Sequence-to-sequence Model
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model
Andros Tjandra
S. Sakti
Satoshi Nakamura
6
12
0
22 Jul 2018
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech
  Synthesis
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
13
83
0
18 Jul 2018
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword
  Units
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units
Zhangyu Xiao
Zhijian Ou
Wei Chu
Hui-Ching Lin
38
38
0
13 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
19
72
0
11 Jul 2018
Detecting Visual Relationships Using Box Attention
Detecting Visual Relationships Using Box Attention
Alexander Kolesnikov
Alina Kuznetsova
Christoph H. Lampert
V. Ferrari
45
65
0
05 Jul 2018
Exploring End-to-End Techniques for Low-Resource Speech Recognition
Exploring End-to-End Techniques for Low-Resource Speech Recognition
Vladimir Bataev
M. Korenevsky
Ivan Medennikov
Alexander Zatvornitsky
19
9
0
02 Jul 2018
Punctuation Prediction Model for Conversational Speech
Punctuation Prediction Model for Conversational Speech
Piotr Żelasko
Piotr Szymañski
Jan Mizgajski
Adrian Szymczak
Yishay Carmiel
Najim Dehak
19
54
0
02 Jul 2018
Extending Recurrent Neural Aligner for Streaming End-to-End Speech
  Recognition in Mandarin
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin
Linhao Dong
Shiyu Zhou
Wei Chen
Bo Xu
24
22
0
17 Jun 2018
Fusing Recency into Neural Machine Translation with an Inter-Sentence
  Gate Model
Fusing Recency into Neural Machine Translation with an Inter-Sentence Gate Model
Shaohui Kuang
Deyi Xiong
34
26
0
12 Jun 2018
Focused Hierarchical RNNs for Conditional Sequence Processing
Focused Hierarchical RNNs for Conditional Sequence Processing
Nan Rosemary Ke
Konrad Zolna
Alessandro Sordoni
Zhouhan Lin
Adam Trischler
Yoshua Bengio
Joelle Pineau
Laurent Charlin
C. Pal
AIMat
24
25
0
12 Jun 2018
Natural Language Generation for Electronic Health Records
Natural Language Generation for Electronic Health Records
Scott H. Lee
SyDa
16
81
0
01 Jun 2018
Learn to Combine Modalities in Multimodal Deep Learning
Learn to Combine Modalities in Multimodal Deep Learning
Kuan Liu
Yanen Li
N. Xu
Premkumar Natarajan
14
149
0
29 May 2018
Can DNNs Learn to Lipread Full Sentences?
Can DNNs Learn to Lipread Full Sentences?
George Sterpu
Christian Saam
N. Harte
16
8
0
29 May 2018
Multimodal Speaker Segmentation and Diarization using Lexical and
  Acoustic Cues via Sequence to Sequence Neural Networks
Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks
Tae Jin Park
P. Georgiou
19
37
0
28 May 2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Yu-An Chung
W. Weng
S. Tong
James R. Glass
17
99
0
18 May 2018
A comparable study of modeling units for end-to-end Mandarin speech
  recognition
A comparable study of modeling units for end-to-end Mandarin speech recognition
Wei Zou
Dongwei Jiang
Shuaijiang Zhao
Xiangang Li
21
32
0
10 May 2018
Improved training of end-to-end attention models for speech recognition
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
16
269
0
08 May 2018
A Regression Model of Recurrent Deep Neural Networks for Noise Robust
  Estimation of the Fundamental Frequency Contour of Speech
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech
Akihiro Kato
Tomi Kinnunen
14
7
0
08 May 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Albert Haque
Corinna Fukushima
11
0
0
30 Apr 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
29
6
0
29 Apr 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the
  Transformer in Mandarin Chinese
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
21
116
0
28 Apr 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
26
160
0
25 Apr 2018
Previous
123...18192021
Next