Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1506.07503
Cited By
Attention-Based Models for Speech Recognition
24 June 2015
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention-Based Models for Speech Recognition"
50 / 395 papers shown
Title
Attention Forcing for Sequence-to-sequence Model Training
Qingyun Dou
Yiting Lu
Joshua Efiong
Mark Gales
27
6
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
27
37
0
22 Sep 2019
Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition
Lei Kang
Marçal Rusiñol
Alicia Fornés
Pau Riba
M. Villegas
16
23
0
18 Sep 2019
Acoustic scene analysis with multi-head attention networks
Weimin Wang
Weiran Wang
Ming Sun
Chao Wang
19
3
0
16 Sep 2019
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
190
237
0
13 Sep 2019
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
21
8
0
30 Aug 2019
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
32
147
0
29 Aug 2019
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Bin Ding
Chengjiang Long
Ling Zhang
Chunxia Xiao
GAN
3DH
33
151
0
04 Aug 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
Cesare Alippi
AI4TS
27
220
0
22 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
24
38
0
13 Jul 2019
Learning Blended, Precise Semantic Program Embeddings
Ke Wang
Z. Su
NAI
30
25
0
03 Jul 2019
Attention model for articulatory features detection
I. Karaulov
Dmytro Tkanov
14
6
0
02 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
36
797
0
25 Jun 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
22
99
0
25 Jun 2019
Saliency-driven Word Alignment Interpretation for Neural Machine Translation
Shuoyang Ding
Hainan Xu
Philipp Koehn
22
55
0
25 Jun 2019
Query-based Interactive Recommendation by Meta-Path and Adapted Attention-GRU
Yu Zhu
Yu Gong
Qingwen Liu
Yingcai Ma
Wenwu Ou
Junxiong Zhu
Beidou Wang
Ziyu Guan
Deng Cai
LRM
19
15
0
24 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
13
27
0
17 Jun 2019
Real to H-space Encoder for Speech Recognition
Titouan Parcollet
Mohamed Morchid
G. Linarès
R. Mori
23
0
0
17 Jun 2019
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
36
50
0
13 Jun 2019
Gradual Machine Learning for Aspect-level Sentiment Analysis
Yanyan Wang
Qun Chen
Jiquan Shen
Boyi Hou
Ahmed Murtadha
Zhanhuai Li
25
1
0
06 Jun 2019
Sequential Neural Networks as Automata
William Merrill
23
74
0
04 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
27
125
0
27 May 2019
Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks
Guanzhong Tian
Yi Yuan
Yong-Jin Liu
CVBM
18
45
0
27 May 2019
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
22
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
16
4
0
17 May 2019
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
27
209
0
14 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
44
101
0
13 May 2019
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition
Zecheng Xie
Yaoxiong Huang
Yuanzhi Zhu
Lianwen Jin
Yuliang Liu
Lele Xie
25
92
0
17 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
21
112
0
08 Apr 2019
Relation-Aware Global Attention for Person Re-identification
Zhizheng Zhang
Cuiling Lan
Wenjun Zeng
Xin Jin
Zhibo Chen
3DPC
28
476
0
05 Apr 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
21
112
0
18 Mar 2019
Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees
J. Chorowski
A. Lancucki
Bartosz Kostka
Michal Zapotoczny
19
5
0
14 Jan 2019
Speaker Adaptation for End-to-End CTC Models
Ke Li
Jinyu Li
Yong Zhao
Kshitiz Kumar
Jiawei Liu
18
24
0
04 Jan 2019
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
18
156
0
18 Dec 2018
Automatic Grammar Augmentation for Robust Voice Command Recognition
Yang Yang
Anusha Lalitha
Jinwon Lee
Chris Lott
21
3
0
14 Nov 2018
Exploring RNN-Transducer for Chinese Speech Recognition
Senmao Wang
Pan Zhou
Wei Chen
Jia Jia
Lei Xie
27
30
0
13 Nov 2018
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
25
21
0
12 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
22
13
0
12 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
30
25
0
12 Nov 2018
Few-shot learning with attention-based sequence-to-sequence models
Bertrand Higy
P. Bell
19
6
0
08 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Matthew Wiesner
Jan ''Honza'' Cernocký
33
18
0
07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
6
42
0
06 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
6
46
0
02 Nov 2018
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition
Hui Li
Peng Wang
Chunhua Shen
Guyu Zhang
16
373
0
02 Nov 2018
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Zhiping Zeng
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Chng Eng Siong
Haizhou Li
16
92
0
01 Nov 2018
Multi-Head Attention with Disagreement Regularization
Jian Li
Zhaopeng Tu
Baosong Yang
Michael R. Lyu
Tong Zhang
27
145
0
24 Oct 2018
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Georgios Tzimiropoulos
M. Pantic
14
129
0
28 Sep 2018
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
27
687
0
06 Sep 2018
Previous
1
2
3
4
5
6
7
8
Next