Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.12450
Cited By
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
21 May 2023
Mohan Shi
Yuchun Shu
Lingyun Zuo
Qiang Chen
Shiliang Zhang
Jie Zhang
Lirong Dai
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction"
6 / 6 papers shown
Title
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems
Shaan Bijwadia
Shuo-yiin Chang
Yue Liu
Tara N. Sainath
Chaoyang Zhang
Yanzhang He
70
8
0
01 Nov 2022
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Wenjie Huang
Shuo-yiin Chang
David Rybach
Rohit Prabhavalkar
Tara N. Sainath
Cyril Allauzen
Cal Peyser
Zhiyun Lu
VLM
88
24
0
22 Apr 2022
Universal ASR: Unifying Streaming and Non-Streaming ASR Using a Single Encoder-Decoder Model
Zhifu Gao
Shiliang Zhang
Ming Lei
Ian Mcloughlin
CVBM
54
15
0
27 Oct 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
229
3,164
0
16 May 2020
Transformer ASR with Contextual Block Processing
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
110
64
0
16 Oct 2019
ESPnet: End-to-End Speech Processing Toolkit
Shinji Watanabe
Takaaki Hori
Shigeki Karita
Tomoki Hayashi
Jiro Nishitoba
...
Jahn Heymann
Sanjeev Khudanpur
Nanxin Chen
Adithya Renduchintala
Tsubasa Ochiai
VLM
122
1,515
0
30 Mar 2018
1