Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.05863
Cited By
v1
v2
v3 (latest)
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
10 December 2021
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech"
28 / 28 papers shown
Title
Continuous Streaming Multi-Talker ASR with Dual-path Transducers
Desh Raj
Liang Lu
Zhuo Chen
Yashesh Gaur
Jinyu Li
50
18
0
17 Sep 2021
End-to-End Speaker-Attributed ASR with Transformer
Naoyuki Kanda
Guoli Ye
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
54
49
0
05 Apr 2021
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
Xuankai Chang
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
RALM
53
15
0
06 Jan 2021
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording
Cong Han
Yi Luo
Chenda Li
Tianyan Zhou
K. Kinoshita
...
Marc Delcroix
Hakan Erdogan
J. Hershey
N. Mesgarani
Zhuo Chen
58
8
0
17 Dec 2020
Attention is All You Need in Speech Separation
Cem Subakan
Mirco Ravanelli
Samuele Cornell
Mirko Bronzi
Jianyuan Zhong
95
561
0
25 Oct 2020
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Naoyuki Kanda
Xuankai Chang
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
52
49
0
11 Aug 2020
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jing-jing Chen
Qi-rong Mao
Dong Liu
85
287
0
28 Jul 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Tianyan Zhou
Takuya Yoshioka
53
78
0
19 Jun 2020
Asteroid: the PyTorch-based audio source separation toolkit for researchers
Manuel Pariente
Samuele Cornell
Joris Cosentino
S. Sivasankaran
Efthymios Tzinis
...
Juan M. Martín-Donas
David Ditter
Ariel Frank
Antoine Deleforge
Emmanuel Vincent
68
155
0
08 May 2020
In defence of metric learning for speaker recognition
Joon Son Chung
Jaesung Huh
Seongkyu Mun
Minjae Lee
Hee-Soo Heo
Soyeon Choe
Chiheon Ham
Sung-Ye Jung
Bong-Jin Lee
Icksang Han
67
437
0
26 Mar 2020
Voice Separation with an Unknown Number of Multiple Speakers
Eliya Nachmani
Yossi Adi
Lior Wolf
61
175
0
29 Feb 2020
Wavesplit: End-to-End Speech Separation by Speaker Clustering
Neil Zeghidour
David Grangier
VLM
90
264
0
20 Feb 2020
An empirical study of Conv-TasNet
Berkan Kadıoğlu
Michael Horgan
Xiaoyu Liu
Jordi Pons
Dan Darcy
Vivek Kumar
40
44
0
20 Feb 2020
Continuous speech separation: dataset and analysis
Zhuo Chen
Takuya Yoshioka
Liang Lu
Tianyan Zhou
Zhong Meng
Yi Luo
Jian Wu
Xiong Xiao
Jinyu Li
71
216
0
30 Jan 2020
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Yi Luo
Zhuo Chen
Takuya Yoshioka
AI4TS
86
771
0
14 Oct 2019
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
T. Menne
Ilya Sklyar
Ralf Schluter
Hermann Ney
145
35
0
09 May 2019
SDR - half-baked or well done?
F. Sánchez-Martínez
M. Esplà-Gomis
Hakan Erdogan
J. Hershey
153
1,202
0
06 Nov 2018
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang
Hannah Muckenhirn
K. Wilson
Prashant Sridhar
Zelin Wu
J. Hershey
Rif A. Saurous
Ron J. Weiss
Ye Jia
Ignacio López Moreno
71
370
0
11 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
55
82
0
08 Oct 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
159
1,794
0
20 Sep 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker
Shinji Watanabe
Emmanuel Vincent
J. Trmal
59
685
0
28 Mar 2018
Speaker Diarization with LSTM
Quan Wang
Carlton Downey
Li Wan
Philip Mansfield
Ignacio López Moreno
65
319
0
28 Oct 2017
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
77
1,374
0
24 Aug 2017
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
56
247
0
12 Jul 2017
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
57
726
0
18 Mar 2017
English Conversational Telephone Speech Recognition by Humans and Machines
G. Saon
Gakuto Kurata
Tom Sercu
Kartik Audhkhasi
Samuel Thomas
...
Bhuvana Ramabhadran
M. Picheny
L. Lim
Bergul Roomi
Phil Hall
65
365
0
06 Mar 2017
Achieving Human Parity in Conversational Speech Recognition
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
89
581
0
17 Oct 2016
Deep clustering: Discriminative embeddings for segmentation and separation
J. Hershey
Zhuo Chen
Jonathan Le Roux
Shinji Watanabe
62
1,319
0
18 Aug 2015
1