ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01769
  4. Cited By
State-of-the-art Speech Recognition With Sequence-to-Sequence Models

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

5 December 2017
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhehuai Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Katya Gonina
Navdeep Jaitly
Bo Li
J. Chorowski
M. Bacchiani
    AI4TS
ArXivPDFHTML

Papers citing "State-of-the-art Speech Recognition With Sequence-to-Sequence Models"

50 / 501 papers shown
Title
DoPa: A Comprehensive CNN Detection Methodology against Physical
  Adversarial Attacks
DoPa: A Comprehensive CNN Detection Methodology against Physical Adversarial Attacks
Zirui Xu
Fuxun Yu
Xiang Chen
AAML
19
0
0
21 May 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
44
101
0
13 May 2019
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data
  Augmentation
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation
Christoph Luscher
Eugen Beck
Kazuki Irie
M. Kitza
Wilfried Michel
Albert Zeyer
Ralf Schluter
Hermann Ney
VLM
13
234
0
08 May 2019
Deep Learning for Audio Signal Processing
Deep Learning for Audio Signal Processing
Hendrik Purwins
Bo Li
Tuomas Virtanen
Jan Schlüter
Shuo-yiin Chang
Tara N. Sainath
VLM
34
587
0
30 Apr 2019
Incorporating Symbolic Sequential Modeling for Speech Enhancement
Incorporating Symbolic Sequential Modeling for Speech Enhancement
Chien-Feng Liao
Yu Tsao
Xugang Lu
Hisashi Kawai
27
18
0
30 Apr 2019
A Comparison of Online Automatic Speech Recognition Systems and the
  Nonverbal Responses to Unintelligible Speech
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Joshua Y. Kim
Chunfeng Liu
R. Calvo
K. McCabe
Silas C. R. Taylor
Björn W. Schuller
Kaihang Wu
26
38
0
29 Apr 2019
Towards Efficient Model Compression via Learned Global Ranking
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
16
170
0
28 Apr 2019
An Investigation of End-to-End Multichannel Speech Recognition for
  Reverberant and Mismatch Conditions
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Aswin Shanmugam Subramanian
Xiaofei Wang
Shinji Watanabe
T. Taniguchi
Dung T. Tran
Yuya Fujita
14
20
0
19 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
31
3,414
0
18 Apr 2019
End-to-End Speech Translation with Knowledge Distillation
End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu
Hao Xiong
Zhongjun He
Jiajun Zhang
Hua Wu
Haifeng Wang
Chengqing Zong
36
151
0
17 Apr 2019
Direct speech-to-speech translation with a sequence-to-sequence model
Direct speech-to-speech translation with a sequence-to-sequence model
Ye Jia
Ron J. Weiss
Fadi Biadsy
Wolfgang Macherey
Melvin Johnson
Zhehuai Chen
Yonghui Wu
23
223
0
12 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
16
27
0
09 Apr 2019
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its
  Applications to Hearing-Impaired Speech and Speech Separation
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
Fadi Biadsy
Ron J. Weiss
Pedro J. Moreno
D. Kanvesky
Ye Jia
27
113
0
08 Apr 2019
Completely Unsupervised Speech Recognition By A Generative Adversarial
  Network Harmonized With Iteratively Refined Hidden Markov Models
Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
Kuan-Yu Chen
Che-Ping Tsai
Da-Rong Liu
Hung-yi Lee
Lin-Shan Lee
GAN
36
23
0
08 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
28
95
0
04 Apr 2019
Massively Multilingual Adversarial Speech Recognition
Massively Multilingual Adversarial Speech Recognition
Oliver Adams
Sanjeev Khudanpur
Shinji Watanabe
David Yarowsky
9
76
0
03 Apr 2019
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word
  Speech Recognition
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition
Shane Settle
Kartik Audhkhasi
Karen Livescu
M. Picheny
30
34
0
29 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End
  Speech Recognition
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
22
15
0
27 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
25
14
0
12 Mar 2019
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence
  Modeling
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
Jonathan Shen
Patrick Nguyen
Yonghui Wu
Zhehuai Chen
Mengzhao Chen
...
William Chan
Shubham Toshniwal
Baohua Liao
M. Nirschl
Pat Rondon
VLM
27
209
0
21 Feb 2019
A spelling correction model for end-to-end speech recognition
A spelling correction model for end-to-end speech recognition
Jinxi Guo
Tara N. Sainath
Ron J. Weiss
AuLLM
KELM
32
139
0
19 Feb 2019
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Learned In Speech Recognition: Contextual Acoustic Word Embeddings
Shruti Palaskar
Vikas Raunak
Florian Metze
22
17
0
18 Feb 2019
Grids versus Graphs: Partitioning Space for Improved Taxi Demand-Supply
  Forecasts
Grids versus Graphs: Partitioning Space for Improved Taxi Demand-Supply Forecasts
Neema Davis
G. Raina
Krishna Jagannathan
AI4TS
30
27
0
18 Feb 2019
KINN: Incorporating Expert Knowledge in Neural Networks
KINN: Incorporating Expert Knowledge in Neural Networks
M. A. Chattha
Shoaib Ahmed Siddiqui
M. I. Malik
L. V. Elst
Andreas Dengel
Sheraz Ahmed
14
6
0
15 Feb 2019
End-to-end Anchored Speech Recognition
End-to-end Anchored Speech Recognition
Yiming Wang
Xing Fan
I-Fan Chen
Yuzong Liu
Tongfei Chen
Björn Hoffmeister
21
20
0
06 Feb 2019
On the Choice of Modeling Unit for Sequence-to-Sequence Speech
  Recognition
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
Kazuki Irie
Rohit Prabhavalkar
Anjuli Kannan
A. Bruguier
David Rybach
Patrick Nguyen
26
37
0
05 Feb 2019
Unsupervised speech representation learning using WaveNet autoencoders
Unsupervised speech representation learning using WaveNet autoencoders
J. Chorowski
Ron J. Weiss
Samy Bengio
Aaron van den Oord
SSL
25
318
0
25 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in
  Speech Recognition
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
25
117
0
22 Jan 2019
Speaker Adaptation for End-to-End CTC Models
Speaker Adaptation for End-to-End CTC Models
Ke Li
Jinyu Li
Yong Zhao
Kshitiz Kumar
Jiawei Liu
18
24
0
04 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
19
26
0
31 Dec 2018
Pansori: ASR Corpus Generation from Open Online Video Contents
Pansori: ASR Corpus Generation from Open Online Video Contents
Yoona Choi
Bowon Lee
22
6
0
23 Dec 2018
Streaming Voice Query Recognition using Causal Convolutional Recurrent
  Neural Networks
Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks
Raphael Tang
Gefei Yang
H. Wei
Yajie Mao
Ferhan Ture
Jimmy J. Lin
30
3
0
19 Dec 2018
Pretraining by Backtranslation for End-to-end ASR in Low-Resource
  Settings
Pretraining by Backtranslation for End-to-end ASR in Low-Resource Settings
Sanjeev Khudanpur
Adithya Renduchintala
Shinji Watanabe
Shuoyang Ding
Najim Dehak
Sanjeev Khudanpur
21
32
0
10 Dec 2018
Deep-RBF Networks Revisited: Robust Classification with Rejection
Deep-RBF Networks Revisited: Robust Classification with Rejection
P. Zadeh
Reshad Hosseini
S. Sra
AAML
OOD
11
28
0
07 Dec 2018
Context-Aware Dialog Re-Ranking for Task-Oriented Dialog Systems
Context-Aware Dialog Re-Ranking for Task-Oriented Dialog Systems
Junki Ohmura
M. Eskénazi
16
5
0
28 Nov 2018
Bytes are All You Need: End-to-End Multilingual Speech Recognition and
  Synthesis with Bytes
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Bo Li
Yu Zhang
Tara N. Sainath
Yonghui Wu
William Chan
AuLLM
24
129
0
22 Nov 2018
GPipe: Efficient Training of Giant Neural Networks using Pipeline
  Parallelism
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism
Yanping Huang
Yonglong Cheng
Ankur Bapna
Orhan Firat
Mia Xu Chen
...
HyoukJoong Lee
Jiquan Ngiam
Quoc V. Le
Yonghui Wu
Zhifeng Chen
GNN
MoE
27
7
0
16 Nov 2018
Streaming End-to-end Speech Recognition For Mobile Devices
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
54
624
0
15 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou
Wenwen Yang
Wei Chen
Yanfeng Wang
Jia Jia
24
69
0
13 Nov 2018
An Online Attention-based Model for Speech Recognition
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
21
48
0
13 Nov 2018
Multi-encoder multi-resolution framework for end-to-end speech
  recognition
Multi-encoder multi-resolution framework for end-to-end speech recognition
Ruizhi Li
Xiaofei Wang
Sri Harish Reddy Mallidi
Takaaki Hori
Shinji Watanabe
H. Hermansky
22
13
0
12 Nov 2018
Vectorization of hypotheses and speech for faster beam search in encoder
  decoder-based speech recognition
Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition
Hiroshi Seki
Takaaki Hori
Shinji Watanabe
27
2
0
12 Nov 2018
Improving End-to-end Speech Recognition with Pronunciation-assisted
  Sub-word Modeling
Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Hainan Xu
Shuoyang Ding
Shinji Watanabe
45
37
0
10 Nov 2018
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and
  Context Preservation Mechanisms
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms
Kou Tanaka
Hirokazu Kameoka
Takuhiro Kaneko
Nobukatsu Hojo
19
111
0
09 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Sanjeev Khudanpur
Jan ''Honza'' Cernocký
33
18
0
07 Nov 2018
CNN-based MultiChannel End-to-End Speech Recognition for everyday home
  environments
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
Hyungjun Lim
Younggwan Kim
Takaaki Hori
Myunghun Jung
Hoirin Kim
29
12
0
07 Nov 2018
Transfer learning of language-independent end-to-end ASR with language
  model fusion
Transfer learning of language-independent end-to-end ASR with language model fusion
S. Hariri
Jaejin Cho
M. Baskar
Tatsuya Kawahara
R. Brunner
22
42
0
06 Nov 2018
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text
  Translation
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia
Melvin Johnson
Wolfgang Macherey
Ron J. Weiss
Yuan Cao
Chung-Cheng Chiu
Naveen Ari
Stella Laurenzo
Yonghui Wu
31
159
0
05 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a
  Criticizing Language Model
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
22
46
0
02 Nov 2018
Improving the Robustness of Speech Translation
Improving the Robustness of Speech Translation
Xiang-Yang Li
Haiyang Xue
Wei Chen
Yang Liu
Yang Feng
Qun Liu
19
17
0
02 Nov 2018
Previous
123...101189
Next