ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1506.07503
  4. Cited By
Attention-Based Models for Speech Recognition

Attention-Based Models for Speech Recognition

24 June 2015
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
ArXivPDFHTML

Papers citing "Attention-Based Models for Speech Recognition"

50 / 395 papers shown
Title
Attention Forcing for Sequence-to-sequence Model Training
Attention Forcing for Sequence-to-sequence Model Training
Qingyun Dou
Yiting Lu
Joshua Efiong
Mark Gales
27
6
0
26 Sep 2019
Learning Visual Relation Priors for Image-Text Matching and Image
  Captioning with Neural Scene Graph Generators
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators
Kuang-Huei Lee
Hamid Palangi
Xi Chen
Houdong Hu
Jianfeng Gao
VLM
27
37
0
22 Sep 2019
Unsupervised Adaptation for Synthetic-to-Real Handwritten Word
  Recognition
Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition
Lei Kang
Marçal Rusiñol
Alicia Fornés
Pau Riba
M. Villegas
16
23
0
18 Sep 2019
Acoustic scene analysis with multi-head attention networks
Acoustic scene analysis with multi-head attention networks
Weimin Wang
Weiran Wang
Ming Sun
Chao Wang
19
3
0
16 Sep 2019
End-to-End Neural Speaker Diarization with Self-attention
End-to-End Neural Speaker Diarization with Self-attention
Yusuke Fujita
Naoyuki Kanda
Shota Horiguchi
Yawen Xue
Kenji Nagamatsu
Shinji Watanabe
190
237
0
13 Sep 2019
Initial investigation of an encoder-decoder end-to-end TTS framework
  using marginalization of monotonic hard latent alignments
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments
Yusuke Yasuda
Xin Wang
Junichi Yamagishi
21
8
0
30 Aug 2019
Two-Pass End-to-End Speech Recognition
Two-Pass End-to-End Speech Recognition
Tara N. Sainath
Ruoming Pang
David Rybach
Yanzhang He
Rohit Prabhavalkar
...
Qiao Liang
Trevor Strohman
Yonghui Wu
Ian McGraw
Chung-Cheng Chiu
32
147
0
29 Aug 2019
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow
  Detection and Removal
ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal
Bin Ding
Chengjiang Long
Ling Zhang
Chunxia Xiao
GAN
3DH
33
151
0
04 Aug 2019
Deep Learning for Time Series Forecasting: The Electric Load Case
Deep Learning for Time Series Forecasting: The Electric Load Case
Alberto Gasparin
S. Lukovic
Cesare Alippi
AI4TS
27
220
0
22 Jul 2019
Learn Spelling from Teachers: Transferring Knowledge from Language
  Models to Sequence-to-Sequence Speech Recognition
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
KELM
24
38
0
13 Jul 2019
Learning Blended, Precise Semantic Program Embeddings
Learning Blended, Precise Semantic Program Embeddings
Ke Wang
Z. Su
NAI
30
25
0
03 Jul 2019
Attention model for articulatory features detection
Attention model for articulatory features detection
I. Karaulov
Dmytro Tkanov
14
6
0
02 Jul 2019
Deep Modular Co-Attention Networks for Visual Question Answering
Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu
Jun Yu
Yuhao Cui
Dacheng Tao
Q. Tian
36
797
0
25 Jun 2019
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled
  Linguistic and Speaker Representations
Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
22
99
0
25 Jun 2019
Saliency-driven Word Alignment Interpretation for Neural Machine
  Translation
Saliency-driven Word Alignment Interpretation for Neural Machine Translation
Shuoyang Ding
Hainan Xu
Philipp Koehn
22
55
0
25 Jun 2019
Query-based Interactive Recommendation by Meta-Path and Adapted
  Attention-GRU
Query-based Interactive Recommendation by Meta-Path and Adapted Attention-GRU
Yu Zhu
Yu Gong
Qingwen Liu
Yingcai Ma
Wenwu Ou
Junxiong Zhu
Beidou Wang
Ziyu Guan
Deng Cai
LRM
19
15
0
24 Jun 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep
  Pre-Trained Language Models
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models
Wei Fang
Yu-An Chung
James R. Glass
13
27
0
17 Jun 2019
Real to H-space Encoder for Speech Recognition
Real to H-space Encoder for Speech Recognition
Titouan Parcollet
Mohamed Morchid
G. Linarès
R. Mori
23
0
0
17 Jun 2019
2D Attentional Irregular Scene Text Recognizer
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
36
50
0
13 Jun 2019
Gradual Machine Learning for Aspect-level Sentiment Analysis
Gradual Machine Learning for Aspect-level Sentiment Analysis
Yanyan Wang
Qun Chen
Jiquan Shen
Boyi Hou
Ahmed Murtadha
Zhanhuai Li
25
1
0
06 Jun 2019
Sequential Neural Networks as Automata
Sequential Neural Networks as Automata
William Merrill
23
74
0
04 Jun 2019
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong
Bo Xu
27
125
0
27 May 2019
Audio2Face: Generating Speech/Face Animation from Single Audio with
  Attention-Based Bidirectional LSTM Networks
Audio2Face: Generating Speech/Face Animation from Single Audio with Attention-Based Bidirectional LSTM Networks
Guanzhong Tian
Yi Yuan
Yong-Jin Liu
CVBM
18
45
0
27 May 2019
Acoustic-to-Word Models with Conversational Context Information
Acoustic-to-Word Models with Conversational Context Information
Suyoun Kim
Florian Metze
22
7
0
21 May 2019
End-to-end Adaptation with Backpropagation through WFST for On-device
  Speech Recognition System
End-to-end Adaptation with Backpropagation through WFST for On-device Speech Recognition System
E. Tsunoo
Yosuke Kashiwagi
S. Asakawa
Toshiyuki Kumakura
16
4
0
17 May 2019
Sparse Sequence-to-Sequence Models
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
27
209
0
14 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
44
101
0
13 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo-wen Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
24
586
0
30 Apr 2019
Aggregation Cross-Entropy for Sequence Recognition
Aggregation Cross-Entropy for Sequence Recognition
Zecheng Xie
Yaoxiong Huang
Yuanzhi Zhu
Lianwen Jin
Yuliang Liu
Lele Xie
25
92
0
17 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
16
46
0
17 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its
  Applications to Hearing-Impaired Speech and Speech Separation
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
21
112
0
08 Apr 2019
Relation-Aware Global Attention for Person Re-identification
Relation-Aware Global Attention for Person Re-identification
Zhizheng Zhang
Cuiling Lan
Wenjun Zeng
Xin Jin
Zhibo Chen
3DPC
28
476
0
05 Apr 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
21
112
0
18 Mar 2019
Towards Using Context-Dependent Symbols in CTC Without State-Tying
  Decision Trees
Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees
J. Chorowski
A. Lancucki
Bartosz Kostka
Michal Zapotoczny
19
5
0
14 Jan 2019
Speaker Adaptation for End-to-End CTC Models
Speaker Adaptation for End-to-End CTC Models
Ke Li
Jinyu Li
Yong Zhao
Kshitiz Kumar
Jiawei Liu
18
24
0
04 Jan 2019
wav2letter++: The Fastest Open-source Speech Recognition System
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap
Awni Y. Hannun
Qiantong Xu
Jeff Cai
Jacob Kahn
Gabriel Synnaeve
Vitaliy Liptchinsky
R. Collobert
VLM
18
156
0
18 Dec 2018
Automatic Grammar Augmentation for Robust Voice Command Recognition
Automatic Grammar Augmentation for Robust Voice Command Recognition
Yang Yang
Anusha Lalitha
Jinwon Lee
Chris Lott
21
3
0
14 Nov 2018
Exploring RNN-Transducer for Chinese Speech Recognition
Exploring RNN-Transducer for Chinese Speech Recognition
Senmao Wang
Pan Zhou
Wei Chen
Jia Jia
Lei Xie
27
30
0
13 Nov 2018
Stream attention-based multi-array end-to-end speech recognition
Stream attention-based multi-array end-to-end speech recognition
Xiaofei Wang
Ruizhi Li
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
25
21
0
12 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech
  recognition
Multi-encoder multi-resolution framework for end-to-end speech recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
22
13
0
12 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of
  Attention-based Sequence-to-Sequence Speech Recognition
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
30
25
0
12 Nov 2018
Few-shot learning with attention-based sequence-to-sequence models
Few-shot learning with attention-based sequence-to-sequence models
Bertrand Higy
P. Bell
19
6
0
08 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Matthew Wiesner
Jan ''Honza'' Cernocký
33
18
0
07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language
  model fusion
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
6
42
0
06 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a
  Criticizing Language Model
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
6
46
0
02 Nov 2018
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text
  Recognition
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition
Hui Li
Peng Wang
Chunhua Shen
Guyu Zhang
16
373
0
02 Nov 2018
On the End-to-End Solution to Mandarin-English Code-switching Speech
  Recognition
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Zhiping Zeng
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Chng Eng Siong
Haizhou Li
16
92
0
01 Nov 2018
Multi-Head Attention with Disagreement Regularization
Multi-Head Attention with Disagreement Regularization
Jian Li
Zhaopeng Tu
Baosong Yang
Michael R. Lyu
Tong Zhang
27
145
0
24 Oct 2018
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Georgios Tzimiropoulos
M. Pantic
14
129
0
28 Sep 2018
Deep Audio-Visual Speech Recognition
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
27
687
0
06 Sep 2018
Previous
12345678
Next