ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.01211
  4. Cited By
Listen, Attend and Spell
v1v2 (latest)

Listen, Attend and Spell

5 August 2015
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
    RALM
ArXiv (abs)PDFHTML

Papers citing "Listen, Attend and Spell"

50 / 1,041 papers shown
Title
Multimodal Speaker Segmentation and Diarization using Lexical and
  Acoustic Cues via Sequence to Sequence Neural Networks
Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks
Tae Jin Park
P. Georgiou
63
37
0
28 May 2018
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces
Yu-An Chung
W. Weng
S. Tong
James R. Glass
91
100
0
18 May 2018
A comparable study of modeling units for end-to-end Mandarin speech
  recognition
A comparable study of modeling units for end-to-end Mandarin speech recognition
Wei Zou
Dongwei Jiang
Shuaijiang Zhao
Xiangang Li
60
33
0
10 May 2018
Improved training of end-to-end attention models for speech recognition
Improved training of end-to-end attention models for speech recognition
Albert Zeyer
Kazuki Irie
Ralf Schluter
Hermann Ney
VLM
83
270
0
08 May 2018
A Regression Model of Recurrent Deep Neural Networks for Noise Robust
  Estimation of the Fundamental Frequency Contour of Speech
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech
Akihiro Kato
Tomi Kinnunen
43
7
0
08 May 2018
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Automatic Documentation of ICD Codes with Far-Field Speech Recognition
Albert Haque
Corinna Fukushima
23
0
0
30 Apr 2018
From Credit Assignment to Entropy Regularization: Two New Algorithms for
  Neural Sequence Prediction
From Credit Assignment to Entropy Regularization: Two New Algorithms for Neural Sequence Prediction
Zihang Dai
Qizhe Xie
Eduard H. Hovy
46
6
0
29 Apr 2018
Syllable-Based Sequence-to-Sequence Speech Recognition with the
  Transformer in Mandarin Chinese
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Shiyu Zhou
Linhao Dong
Shuang Xu
Bo Xu
99
118
0
28 Apr 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
77
160
0
25 Apr 2018
Multi-Head Decoder for End-to-End Speech Recognition
Multi-Head Decoder for End-to-End Speech Recognition
Tomoki Hayashi
Shinji Watanabe
Tomoki Toda
K. Takeda
57
16
0
22 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using
  Collective Low Precision and Structured Compression
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression
Shihui Yin
Gaurav Srivastava
S. Venkataramanaiah
C. Chakrabarti
Visar Berisha
Jae-sun Seo
27
8
0
19 Apr 2018
Conditional End-to-End Audio Transforms
Conditional End-to-End Audio Transforms
Albert Haque
Michelle Guo
Prateek Verma
114
41
0
30 Mar 2018
ESPnet: End-to-End Speech Processing Toolkit
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
128
1,515
0
30 Mar 2018
Single Stream Parallelization of Recurrent Neural Networks for Low Power
  and Fast Inference
Single Stream Parallelization of Recurrent Neural Networks for Low Power and Fast Inference
Wonyong Sung
Jinhwan Park
36
5
0
30 Mar 2018
Attention-based End-to-End Models for Small-Footprint Keyword Spotting
Attention-based End-to-End Models for Small-Footprint Keyword Spotting
Changhao Shan
Junbo Zhang
Yujun Wang
Lei Xie
AI4TS
61
110
0
29 Mar 2018
Machine Speech Chain with One-shot Speaker Adaptation
Machine Speech Chain with One-shot Speaker Adaptation
Andros Tjandra
S. Sakti
Satoshi Nakamura
71
56
0
28 Mar 2018
Multi-Modal Data Augmentation for End-to-End ASR
Multi-Modal Data Augmentation for End-to-End ASR
Adithya Renduchintala
Shuoyang Ding
Sanjeev Khudanpur
Shinji Watanabe
80
36
0
27 Mar 2018
Comprehending Real Numbers: Development of Bengali Real Number Speech
  Corpus
Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus
Md Mahadi Hasan Nahid
Md. Ashraful Islam
Bishwajit Purkaystha
Md. Saiful Islam
32
5
0
27 Mar 2018
Self-Attentional Acoustic Models
Self-Attentional Acoustic Models
Matthias Sperber
Jan Niehues
Graham Neubig
Sebastian Stüker
A. Waibel
62
153
0
26 Mar 2018
Leveraging translations for speech transcription in low-resource
  settings
Leveraging translations for speech transcription in low-resource settings
Antonios Anastasopoulos
David Chiang
63
27
0
23 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
71
56
0
21 Mar 2018
ORGaNICs: A Theory of Working Memory in Brains and Machines
ORGaNICs: A Theory of Working Memory in Brains and Machines
D. Heeger
Wayne E. Mackey
90
7
0
16 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
70
97
0
13 Mar 2018
Feature Selective Small Object Detection via Knowledge-based Recurrent
  Attentive Neural Network
Feature Selective Small Object Detection via Knowledge-based Recurrent Attentive Neural Network
Kai Yi
Zhiqiang Jian
Shi-tao Chen
N. Zheng
ObjD
55
6
0
13 Mar 2018
Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with
  Adversarial Examples
Seq2Sick: Evaluating the Robustness of Sequence-to-Sequence Models with Adversarial Examples
Minhao Cheng
Jinfeng Yi
Pin-Yu Chen
Huan Zhang
Cho-Jui Hsieh
SILMAAML
118
245
0
03 Mar 2018
XNMT: The eXtensible Neural Machine Translation Toolkit
XNMT: The eXtensible Neural Machine Translation Toolkit
Graham Neubig
Matthias Sperber
Xinyi Wang
Matthieu Felix
Austin Matthews
...
Philip Arthur
Pierre Godard
John Hewitt
Rachid Riad
Liming Wang
79
67
0
01 Mar 2018
Learning Longer-term Dependencies in RNNs with Auxiliary Losses
Learning Longer-term Dependencies in RNNs with Auxiliary Losses
Trieu H. Trinh
Andrew M. Dai
Thang Luong
Quoc V. Le
98
181
0
01 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
87
713
0
26 Feb 2018
Towards end-to-end spoken language understanding
Towards end-to-end spoken language understanding
Dmitriy Serdyuk
Yongqiang Wang
Christian Fuegen
Anuj Kumar
Baiyang Liu
Yoshua Bengio
60
234
0
23 Feb 2018
Tied Multitask Learning for Neural Speech Translation
Tied Multitask Learning for Neural Speech Translation
Antonios Anastasopoulos
David Chiang
182
174
0
19 Feb 2018
Structured-based Curriculum Learning for End-to-end English-Japanese
  Speech Translation
Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation
Takatomo Kano
S. Sakti
Satoshi Nakamura
79
46
0
13 Feb 2018
Recurrent Neural Network-Based Semantic Variational Autoencoder for
  Sequence-to-Sequence Learning
Recurrent Neural Network-Based Semantic Variational Autoencoder for Sequence-to-Sequence Learning
Myeongjun Jang
Seungwan Seo
Pilsung Kang
DRL
90
57
0
09 Feb 2018
Joint Modeling of Accents and Acoustics for Multi-Accent Speech
  Recognition
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Xuesong Yang
Kartik Audhkhasi
Andrew Rosenberg
Samuel Thomas
Bhuvana Ramabhadran
M. Hasegawa-Johnson
60
71
0
07 Feb 2018
Learning from Past Mistakes: Improving Automatic Speech Recognition
  Output via Noisy-Clean Phrase Context Modeling
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Prashanth Gurunath Shivakumar
Haoqi Li
Kevin Knight
P. Georgiou
66
28
0
07 Feb 2018
DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk
  Prediction
DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction
Brandon Ballinger
Johnson Hsieh
Avesh Singh
N. Sohoni
Jack Wang
...
G. Marcus
Jose M. Sanchez
Carol Maguire
J. Olgin
M. Pletcher
HAI
92
132
0
07 Feb 2018
Letter-Based Speech Recognition with Gated ConvNets
Letter-Based Speech Recognition with Gated ConvNets
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
83
72
0
22 Dec 2017
Subword and Crossword Units for CTC Acoustic Models
Subword and Crossword Units for CTC Acoustic Models
Thomas Zenkel
Ramon Sanabria
Florian Metze
A. Waibel
59
33
0
19 Dec 2017
Monotonic Chunkwise Attention
Monotonic Chunkwise Attention
Chung-Cheng Chiu
Colin Raffel
98
256
0
14 Dec 2017
Building competitive direct acoustics-to-word models for English
  conversational speech recognition
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
72
152
0
08 Dec 2017
Minimum Word Error Rate Training for Attention-based
  Sequence-to-Sequence Models
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models
Rohit Prabhavalkar
Tara N. Sainath
Yonghui Wu
Patrick Nguyen
Zhiwen Chen
Chung-Cheng Chiu
Anjuli Kannan
82
162
0
05 Dec 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks
SkipNet: Learning Dynamic Routing in Convolutional Networks
Xin Wang
Feng Yu
Zi-Yi Dou
Trevor Darrell
Joseph E. Gonzalez
154
640
0
26 Nov 2017
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent
  Networks
Sparse Attentive Backtracking: Long-Range Credit Assignment in Recurrent Networks
Nan Rosemary Ke
Anirudh Goyal
O. Bilaniuk
Jonathan Binas
Laurent Charlin
C. Pal
Yoshua Bengio
78
15
0
07 Nov 2017
Multilingual Speech Recognition With A Single End-To-End Model
Multilingual Speech Recognition With A Single End-To-End Model
Shubham Toshniwal
Tara N. Sainath
Ron J. Weiss
Yue Liu
Pedro J. Moreno
Eugene Weinstein
Kanishka Rao
72
264
0
06 Nov 2017
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
AI4TS
96
26
0
30 Oct 2017
A Study of All-Convolutional Encoders for Connectionist Temporal
  Classification
A Study of All-Convolutional Encoders for Connectionist Temporal Classification
Kalpesh Krishna
Liang Lu
Kevin Gimpel
Karen Livescu
59
11
0
28 Oct 2017
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence
  Models
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models
Yanzhang He
Rohit Prabhavalkar
Kanishka Rao
Wei Li
A. Bakhtin
Ian McGraw
AI4TS
73
91
0
26 Oct 2017
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
D. Lim
37
2
0
12 Oct 2017
Multitask training with unlabeled data for end-to-end sign language
  fingerspelling recognition
Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition
Bowen Shi
Karen Livescu
49
14
0
09 Oct 2017
Attention-based Wav2Text with Feature Transfer Learning
Attention-based Wav2Text with Feature Transfer Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
47
20
0
22 Sep 2017
Analyzing Hidden Representations in End-to-End Automatic Speech
  Recognition Systems
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Yonatan Belinkov
James R. Glass
55
84
0
13 Sep 2017
Previous
123...192021
Next