Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.08549
Cited By
Speaker Change Detection for Transformer Transducer ASR
16 February 2023
Jian Wu
Zhuo Chen
Min Hu
Xiong Xiao
Jinyu Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Speaker Change Detection for Transformer Transducer ASR"
20 / 20 papers shown
Title
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Zhiyun Fan
Zhenlin Liang
Linhao Dong
Yi Liu
Shiyu Zhou
Meng Cai
Jun Zhang
Zejun Ma
Bo Xu
45
2
0
17 Nov 2022
Turn-Taking Prediction for Natural Conversational Speech
Shuo-yiin Chang
Yue Liu
Tara N. Sainath
Chaoyang Zhang
Trevor Strohman
Qiao Liang
Yanzhang He
53
21
0
29 Aug 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Naoyuki Kanda
Jian Wu
Yu Wu
Xiong Xiao
Zhong Meng
Xiaofei Wang
Yashesh Gaur
Zhuo Chen
Jinyu Li
Takuya Yoshioka
69
57
0
02 Feb 2022
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
110
369
0
02 Nov 2021
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Sanyuan Chen
Chengyi Wang
Zhengyang Chen
Yu-Huan Wu
Shujie Liu
...
Yao Qian
Jian Wu
Micheal Zeng
Xiangzhan Yu
Furu Wei
SSL
223
1,855
0
26 Oct 2021
Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Wei Xia
Han Lu
Quan Wang
Anshuman Tripathi
Yiling Huang
Ignacio López Moreno
Hasim Sak
65
51
0
23 Sep 2021
On Addressing Practical Challenges for RNN-Transducer
Rui Zhao
Jian Xue
Jinyu Li
Wenning Wei
Lei He
Jiawei Liu
42
31
0
27 Apr 2021
Continuous Speech Separation with Ad Hoc Microphone Arrays
Dongmei Wang
Takuya Yoshioka
Zhuo Chen
Xiaofei Wang
Tianyan Zhou
Zhong Meng
39
27
0
03 Mar 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
315
331
0
24 Jan 2021
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen
Yu-Huan Wu
Zhenghao Wang
Shujie Liu
Jinyu Li
108
174
0
22 Oct 2020
FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Jiahui Yu
Chung-Cheng Chiu
Yue Liu
Shuo-yiin Chang
Tara N. Sainath
...
A. Narayanan
Wei Han
Anmol Gulati
Yonghui Wu
Ruoming Pang
56
92
0
21 Oct 2020
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
Jinyu Li
Yu-Huan Wu
Yashesh Gaur
Chengyi Wang
Rui Zhao
Shujie Liu
39
137
0
28 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
48
40
0
16 May 2020
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
63
213
0
30 Jan 2020
Joint Speech Recognition and Speaker Diarization via Sequence Transduction
Laurent El Shafey
H. Soltau
Izhak Shafran
67
102
0
09 Jul 2019
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He
Tara N. Sainath
Rohit Prabhavalkar
Ian McGraw
R. Álvarez
...
K. Sim
Tom Bagby
Shuo-yiin Chang
Kanishka Rao
A. Gruenstein
103
627
0
15 Nov 2018
Multimodal Speaker Segmentation and Diarization using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks
Tae Jin Park
P. Georgiou
52
37
0
28 May 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
677
131,414
0
12 Jun 2017
Speaker Change Detection Using Features through A Neural Network Speaker Classifier
Zhenhao Ge
A. N. Iyer
S. Cheluvaraja
A. Ganapathiraju
37
9
0
08 Feb 2017
Sequence Transduction with Recurrent Neural Networks
Alex Graves
187
1,868
0
14 Nov 2012
1