Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.05030
Cited By
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
4 March 2018
Shiliang Zhang
Ming Lei
Zhijie Yan
Lirong Dai
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep-FSMN for Large Vocabulary Continuous Speech Recognition"
27 / 27 papers shown
Title
Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing
chuan Wen
Guy Torfs
Sarah Verhulst
43
0
0
17 Feb 2025
Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures
Lingyun Zuo
Keyu An
Shiliang Zhang
Zhijie Yan
30
1
0
19 Dec 2023
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation
Shengkui Zhao
Yukun Ma
Chongjia Ni
Chong Zhang
Hao Wang
Trung Hieu Nguyen
Kun Zhou
J. Yip
Dianwen Ng
Bin Ma
36
23
0
19 Dec 2023
Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring
Kaiqi Fu
Shaojun Gao
Shuju Shi
Xiaohai Tian
Wei Li
Zejun Ma
31
2
0
19 May 2023
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
Zhifu Gao
Zerui Li
Jiaming Wang
Haoneng Luo
Xian Shi
...
Yabin Li
Lingyun Zuo
Zhihao Du
Zhangyu Xiao
Shiliang Zhang
37
54
0
18 May 2023
BiBench: Benchmarking and Analyzing Network Binarization
Haotong Qin
Mingyuan Zhang
Yifu Ding
Aoyu Li
Zhongang Cai
Ziwei Liu
Feng Yu
Xianglong Liu
MQ
AAML
44
36
0
26 Jan 2023
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
36
2
0
09 Dec 2022
Phonemic Adversarial Attack against Audio Recognition in Real World
Jiakai Wang
Zhendong Chen
Zixin Yin
Qinghong Yang
Xianglong Liu
AAML
40
3
0
19 Nov 2022
Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis
Zhihao Du
Shiliang Zhang
Siqi Zheng
Zhijie Yan
24
14
0
18 Nov 2022
BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance
Haotong Qin
Xudong Ma
Yifu Ding
X. Li
Yang Zhang
Zejun Ma
Jiakai Wang
Jie Luo
Xianglong Liu
MQ
40
20
0
13 Nov 2022
Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Rao Ma
Xiaobo Wu
Jin Qiu
Yanan Qin
Haihua Xu
Peihao Wu
Zejun Ma
32
2
0
02 Nov 2022
Boosting Tail Neural Network for Realtime Custom Keyword Spotting
Sihao Xue
Qianyao Shen
Guoqing Li
37
0
0
24 May 2022
Integrating Lattice-Free MMI into End-to-End Speech Recognition
Jinchuan Tian
Jianwei Yu
Chao Weng
Yuexian Zou
Dong Yu
35
8
0
29 Mar 2022
Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information
Kaiqi Fu
Shaojun Gao
Kai Wang
Wei Li
Xiaohai Tian
Zejun Ma
19
8
0
01 Mar 2022
Multi-Task Deep Residual Echo Suppression with Echo-aware Loss
Shimin Zhang
Ziteng Wang
Jiayao Sun
Yihui Fu
Biao Tian
Q. Fu
Lei Xie
27
31
0
14 Feb 2022
Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection
Jing Du
Shiliang Pu
Qinbo Dong
Chao Jin
Xin Qi
Dian Gu
Ru Wu
Hongwei Zhou
30
9
0
10 Jan 2022
Controllable Multichannel Speech Dereverberation based on Deep Neural Networks
Ziteng Wang
Yueyue Na
Biao Tian
Q. Fu
21
0
0
16 Oct 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
18
352
0
29 Jun 2021
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion
Songxiang Liu
Yuewen Cao
Dan Su
Helen Meng
DiffM
29
56
0
28 May 2021
Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge
Ziteng Wang
Yueyue Na
Zhang Liu
Biao Tian
Q. Fu
24
36
0
17 Feb 2021
PPG-based singing voice conversion with adversarial representation learning
Zhonghao Li
Benlai Tang
Xiang Yin
Yuan Wan
Linjia Xu
Chen Shen
Zejun Ma
19
37
0
28 Oct 2020
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng Luo
Shiliang Zhang
Ming Lei
Lei Xie
35
33
0
21 May 2020
Automatic Dialogic Instruction Detection for K-12 Online One-on-one Classes
Shiting Xu
Wenbiao Ding
Zitao Liu
VLM
14
12
0
16 May 2020
DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition
Zhao You
Dan Su
Jie Chen
Chao Weng
Dong Yu
28
13
0
28 Oct 2019
Almost Unsupervised Text to Speech and Automatic Speech Recognition
Yi Ren
Xu Tan
Tao Qin
Sheng Zhao
Zhou Zhao
Tie-Yan Liu
44
101
0
13 May 2019
Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models
T. Nguyen
Sebastian Stüker
A. Waibel
33
1
0
02 Feb 2019
Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context
Jie Li
Yahui Shan
Xiaorui Wang
Yan Li
16
3
0
26 Nov 2018
1