ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01769
  4. Cited By
State-of-the-art Speech Recognition With Sequence-to-Sequence Models

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

5 December 2017
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
Zhehuai Chen
Anjuli Kannan
Ron J. Weiss
Kanishka Rao
Katya Gonina
Navdeep Jaitly
Bo Li
J. Chorowski
M. Bacchiani
    AI4TS
ArXivPDFHTML

Papers citing "State-of-the-art Speech Recognition With Sequence-to-Sequence Models"

50 / 501 papers shown
Title
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Asset Allocation: From Markowitz to Deep Reinforcement Learning
Ricard Durall
33
4
0
14 Jul 2022
Revisiting Label Smoothing and Knowledge Distillation Compatibility:
  What was Missing?
Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing?
Keshigeyan Chandrasegaran
Ngoc-Trung Tran
Yunqing Zhao
Ngai-man Cheung
95
41
0
29 Jun 2022
Sequence-level Speaker Change Detection with Difference-based Continuous
  Integrate-and-fire
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire
Zhiyun Fan
Linhao Dong
Meng Cai
Zejun Ma
Bo Xu
36
4
0
27 Jun 2022
On Comparison of Encoders for Attention based End to End Speech
  Recognition in Standalone and Rescoring Mode
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
Raviraj Joshi
Subodh Kumar
36
2
0
26 Jun 2022
Detecting the Severity of Major Depressive Disorder from Speech: A Novel HARD-Training Methodology
Edward L. Campbell
J. Dineley
Pauline Conde
F. Matcham
F. Lamers
S. Siddi
Laura Docío-Fernández
C. García-Mateo
N. Cummins
the RADAR-CNS Consortium
39
4
0
02 Jun 2022
Minimising Biasing Word Errors for Contextual ASR with the
  Tree-Constrained Pointer Generator
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Guangzhi Sun
Chuxu Zhang
P. Woodland
36
14
0
18 May 2022
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes
Shaojin Ding
Weiran Wang
Ding Zhao
Tara N. Sainath
Yanzhang He
...
Qiao Liang
Dongseong Hwang
Ian McGraw
Rohit Prabhavalkar
Trevor Strohman
35
17
0
13 Apr 2022
Combining Spectral and Self-Supervised Features for Low Resource Speech
  Recognition and Translation
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
Dan Berrebbi
Jiatong Shi
Brian Yan
Osbel López-Francisco
Jonathan D. Amith
Shinji Watanabe
15
26
0
05 Apr 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text
  for Low-Resource Automatic Speech Recognition
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Ye Du
Jie Zhang
Qiu-shi Zhu
Lirong Dai
Ming Wu
Xin Fang
Zhouwang Yang
34
2
0
05 Apr 2022
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained
  ASR Embeddings for Speech Emotion Recognition
CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
Chengxin Chen
Pengyuan Zhang
AI4TS
26
10
0
31 Mar 2022
Analyzing the factors affecting usefulness of Self-Supervised
  Pre-trained Representations for Speech Recognition
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition
Ashish Seth
L. D. Prasad
Sreyan Ghosh
S. Umesh
33
3
0
31 Mar 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Jaesong Lee
Lukas Lee
Shinji Watanabe
38
8
0
31 Mar 2022
4-bit Conformer with Native Quantization Aware Training for Speech
  Recognition
4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Shaojin Ding
Phoenix Meadowlark
Yanzhang He
Lukasz Lew
Shivani Agrawal
Oleg Rybakov
MQ
31
32
0
29 Mar 2022
An Overview & Analysis of Sequence-to-Sequence Emotional Voice
  Conversion
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion
Zijiang Yang
Xin Jing
Andreas Triantafyllopoulos
Meishu Song
Ilhan Aslan
Björn W. Schuller
20
14
0
29 Mar 2022
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain
  Data
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Chen Chen
Nana Hou
Yuchen Hu
Shashank Shirol
Chng Eng Siong
NoLa
25
43
0
29 Mar 2022
Transformer-based Streaming ASR with Cumulative Attention
Transformer-based Streaming ASR with Cumulative Attention
Mohan Li
Shucong Zhang
Catalin Zorila
R. Doddipatla
27
9
0
11 Mar 2022
Language Adaptive Cross-lingual Speech Representation Learning with
  Sparse Sharing Sub-networks
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks
Yizhou Lu
Mingkun Huang
Xinghua Qu
Pengfei Wei
Zejun Ma
32
19
0
09 Mar 2022
Language technology practitioners as language managers: arbitrating data
  bias and predictive bias in ASR
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR
Nina Markl
S. McNulty
26
9
0
25 Feb 2022
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end
  Long-form Speech Recognition
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Jinhan Wang
Xiaosu Tong
Jinxi Guo
Di He
Roland Maas
29
5
0
22 Feb 2022
End-to-end contextual asr based on posterior distribution adaptation for
  hybrid ctc/attention system
End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Zheng-Wei Zhang
Pan Zhou
47
6
0
18 Feb 2022
Conversational Speech Recognition By Learning Conversation-level
  Characteristics
Conversational Speech Recognition By Learning Conversation-level Characteristics
Kun Wei
Yike Zhang
Sining Sun
Lei Xie
Long Ma
43
7
0
16 Feb 2022
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
I'm Hearing (Different) Voices: Anonymous Voices to Protect User Privacy
H.C.M. Turner
Giulio Lovisotto
Simon Eberz
Ivan Martinovic
16
1
0
13 Feb 2022
Conversational Agents: Theory and Applications
Conversational Agents: Theory and Applications
M. Wahde
M. Virgolin
LLMAG
37
25
0
07 Feb 2022
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End
  Mandarin Chinese ASR
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR
Zhao Yang
Dianwen Ng
Xiao Fu
Liping Han
Wei Xi
Ruimeng Wang
Rui Jiang
Jizhong Zhao
40
2
0
26 Jan 2022
Internal Language Model Estimation Through Explicit Context Vector
  Learning for Attention-based Encoder-decoder ASR
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR
Yufei Liu
Rao Ma
Haihua Xu
Yi He
Zejun Ma
Weibin Zhang
28
12
0
26 Jan 2022
Graph Neural Networks: a bibliometrics overview
Graph Neural Networks: a bibliometrics overview
Abdalsamad Keramatfar
Mohadeseh Rafiee
Hossein Amirkhani
GNN
AI4CE
43
24
0
03 Jan 2022
Multi-Variant Consistency based Self-supervised Learning for Robust
  Automatic Speech Recognition
Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
30
4
0
23 Dec 2021
Neural Networks for Infectious Diseases Detection: Prospects and
  Challenges
Neural Networks for Infectious Diseases Detection: Prospects and Challenges
Muhammad Azeem
Shumaila Javaid
Hamza Fahim
Nasir Saeed
24
6
0
07 Dec 2021
Attention based end to end Speech Recognition for Voice Search in Hindi
  and English
Attention based end to end Speech Recognition for Voice Search in Hindi and English
Raviraj Joshi
Venkateshan Kannan
28
7
0
15 Nov 2021
Context-Aware Transformer Transducer for Speech Recognition
Context-Aware Transformer Transducer for Speech Recognition
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
21
79
0
05 Nov 2021
Sequence-to-Sequence Modeling for Action Identification at High Temporal
  Resolution
Sequence-to-Sequence Modeling for Action Identification at High Temporal Resolution
Aakash Kaku
Kangning Liu
A. Parnandi
H. Rajamohan
Kannan Venkataramanan
Anita Venkatesan
Audre Wirtanen
Natasha Pandit
Heidi M. Schambra
C. Fernandez‐Granda
27
5
0
03 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
40
363
0
02 Nov 2021
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial
  Attack Framework
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework
Lifan Yuan
Yichi Zhang
Yangyi Chen
Wei Wei
AAML
29
33
0
28 Oct 2021
Understanding How Encoder-Decoder Architectures Attend
Understanding How Encoder-Decoder Architectures Attend
Kyle Aitken
V. Ramasesh
Yuan Cao
Niru Maheswaranathan
39
17
0
28 Oct 2021
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End
  Speech Recognition and Understanding
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding
Wei Wang
Shuo Ren
Yao Qian
Shujie Liu
Yu Shi
Y. Qian
Michael Zeng
40
17
0
23 Oct 2021
An Investigation of Enhancing CTC Model for Triggered Attention-based
  Streaming ASR
An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR
Huaibo Zhao
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
22
4
0
20 Oct 2021
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Jing Pan
Tao Lei
Kwangyoun Kim
Kyu Jeong Han
Shinji Watanabe
VLM
34
9
0
11 Oct 2021
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text
  Generation
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Yosuke Higuchi
Nanxin Chen
Yuya Fujita
Hirofumi Inaguma
Tatsuya Komatsu
Jaesong Lee
Jumon Nozaki
Tianzi Wang
Shinji Watanabe
38
41
0
11 Oct 2021
Advancing Momentum Pseudo-Labeling with Conformer and Initialization
  Strategy
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
19
11
0
11 Oct 2021
DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent
  Adaptation
DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation
Suraj Kothawade
Anmol Reddy Mekala
D. ChandraSekhara
Mayank Kothyari
Rishabh K. Iyer
Ganesh Ramakrishnan
Preethi Jyothi
33
5
0
10 Oct 2021
Have best of both worlds: two-pass hybrid and E2E cascading framework
  for speech recognition
Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Guoli Ye
V. Mazalov
Jinyu Li
Jiawei Liu
25
9
0
10 Oct 2021
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular
  Subword Units
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Yosuke Higuchi
Keita Karube
Tetsuji Ogawa
Tetsunori Kobayashi
18
23
0
08 Oct 2021
Explaining the Attention Mechanism of End-to-End Speech Recognition
  Using Decision Trees
Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Yuanchao Wang
Wenjing Du
Chenghao Cai
Yanyan Xu
47
1
0
08 Oct 2021
ABCP: Automatic Block-wise and Channel-wise Network Pruning via Joint
  Search
ABCP: Automatic Block-wise and Channel-wise Network Pruning via Joint Search
Jiaqi Li
Haoran Li
Yaran Chen
Zixiang Ding
Nannan Li
Mingjun Ma
Zicheng Duan
Dong Zhao
42
9
0
08 Oct 2021
Back from the future: bidirectional CTC decoding using future
  information in speech recognition
Back from the future: bidirectional CTC decoding using future information in speech recognition
Namkyu Jung
Geon-min Kim
Han-Gyu Kim
35
3
0
07 Oct 2021
Internal Language Model Adaptation with Text-Only Data for End-to-End
  Speech Recognition
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Zhong Meng
Yashesh Gaur
Naoyuki Kanda
Jinyu Li
Xie Chen
Yu Wu
Yifan Gong
AuLLM
24
32
0
06 Oct 2021
Integrating Categorical Features in End-to-End ASR
Integrating Categorical Features in End-to-End ASR
Rongqing Huang
26
1
0
06 Oct 2021
GAN-based Reactive Motion Synthesis with Class-aware Discriminators for
  Human-human Interaction
GAN-based Reactive Motion Synthesis with Class-aware Discriminators for Human-human Interaction
Qianhui Men
Hubert P. H. Shum
Edmond S. L. Ho
Howard Leung
36
28
0
01 Oct 2021
Factorized Neural Transducer for Efficient Language Model Adaptation
Factorized Neural Transducer for Efficient Language Model Adaptation
Xie Chen
Zhong Meng
S. Parthasarathy
Jinyu Li
23
39
0
27 Sep 2021
A Survey on Cost Types, Interaction Schemes, and Annotator Performance
  Models in Selection Algorithms for Active Learning in Classification
A Survey on Cost Types, Interaction Schemes, and Annotator Performance Models in Selection Algorithms for Active Learning in Classification
M. Herde
Denis Huseljic
Bernhard Sick
A. Calma
42
25
0
23 Sep 2021
Previous
123456...91011
Next