ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.13148
  4. Cited By
Streaming end-to-end multi-talker speech recognition

Streaming end-to-end multi-talker speech recognition

26 November 2020
Liang Lu
Naoyuki Kanda
Jinyu Li
Jiawei Liu
ArXivPDFHTML

Papers citing "Streaming end-to-end multi-talker speech recognition"

26 / 26 papers shown
Title
Guided Speaker Embedding
Guided Speaker Embedding
Shota Horiguchi
Takafumi Moriya
Atsushi Ando
Takanori Ashihara
Hiroshi Sato
Naohiro Tawara
Marc Delcroix
60
0
0
03 Jan 2025
Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC
Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC
Jiawen Kang
Lingwei Meng
Mingyu Cui
Yuejiao Wang
Xixin Wu
Xunying Liu
Helen Meng
58
2
0
19 Sep 2024
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Naoyuki Kanda
Zhong Meng
Liang Lu
Yashesh Gaur
Xiaofei Wang
Zhuo Chen
Takuya Yoshioka
41
17
0
03 Nov 2020
Developing Real-time Streaming Transformer Transducer for Speech
  Recognition on Large-scale Dataset
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen
Yu-Huan Wu
Zhenghao Wang
Shujie Liu
Jinyu Li
66
170
0
22 Oct 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with
  Customization Capability
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Jinyu Li
Rui Zhao
Zhong Meng
Yanqing Liu
Wenning Wei
...
V. Mazalov
Zhenghao Wang
Lei He
Sheng Zhao
Jiawei Liu
22
107
0
30 Jul 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification
  for Overlapped Speech of Any Number of Speakers
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
27
75
0
19 Jun 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Serialized Output Training for End-to-End Overlapped Speech Recognition
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
41
113
0
28 Mar 2020
Transformer Transducer: A Streamable Speech Recognition Model with
  Transformer Encoders and RNN-T Loss
Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Qian Zhang
Han Lu
Hasim Sak
Anshuman Tripathi
Erik McDermott
Stephen Koo
Shankar Kumar
40
477
0
07 Feb 2020
Continuous speech separation: dataset and analysis
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
39
212
0
30 Jan 2020
Streaming End-to-end Speech Recognition For Mobile Devices
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
63
624
0
15 Nov 2018
End-to-End Monaural Multi-speaker ASR System without Pretraining
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang
Y. Qian
Yi Liang
Deming Chen
35
76
0
05 Nov 2018
Advancing Acoustic-to-Word CTC Model
Advancing Acoustic-to-Word CTC Model
Jinyu Li
Guoli Ye
Amit Das
Rui Zhao
Jiawei Liu
36
97
0
15 Mar 2018
Building competitive direct acoustics-to-word models for English
  conversational speech recognition
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
46
151
0
08 Dec 2017
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu
Tara N. Sainath
Yonghui Wu
Rohit Prabhavalkar
Patrick Nguyen
...
Katya Gonina
Navdeep Jaitly
Yue Liu
J. Chorowski
M. Bacchiani
AI4TS
65
1,150
0
05 Dec 2017
TasNet: time-domain audio separation network for real-time,
  single-channel speech separation
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo
N. Mesgarani
52
626
0
01 Nov 2017
Single-Channel Multi-talker Speech Recognition with Permutation
  Invariant Training
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training
Y. Qian
Xuankai Chang
Dong Yu
15
79
0
19 Jul 2017
Recognizing Multi-talker Speech with Permutation Invariant Training
Recognizing Multi-talker Speech with Permutation Invariant Training
Dong Yu
Xuankai Chang
Y. Qian
31
90
0
22 Mar 2017
Permutation Invariant Training of Deep Models for Speaker-Independent
  Multi-talker Speech Separation
Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation
Dong Yu
Morten Kolbæk
Zheng-Hua Tan
Jesper Jensen
66
854
0
01 Jul 2016
Segmental Recurrent Neural Networks for End-to-end Speech Recognition
Segmental Recurrent Neural Networks for End-to-end Speech Recognition
Liang Lu
Lingpeng Kong
Chris Dyer
Noah A. Smith
Steve Renals
42
81
0
01 Mar 2016
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
126
7,683
0
31 Aug 2015
Deep clustering: Discriminative embeddings for segmentation and
  separation
Deep clustering: Discriminative embeddings for segmentation and separation
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
31
1,316
0
18 Aug 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
119
2,257
0
05 Aug 2015
EESEN: End-to-End Speech Recognition using Deep RNN Models and
  WFST-based Decoding
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding
Yajie Miao
M. Gowayyed
Florian Metze
68
753
0
29 Jul 2015
Attention-Based Models for Speech Recognition
Attention-Based Models for Speech Recognition
J. Chorowski
Dzmitry Bahdanau
Dmitriy Serdyuk
Kyunghyun Cho
Yoshua Bengio
76
2,602
0
24 Jun 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
290
149,474
0
22 Dec 2014
Sequence Transduction with Recurrent Neural Networks
Sequence Transduction with Recurrent Neural Networks
Alex Graves
70
1,858
0
14 Nov 2012
1