Title |
---|
![]() Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based
Speech Recognition Ye Bai Jingping Chen Jitong Chen Wei Chen Zhuo Chen ...Wanyi Zhang Yang Zhang Yawei Zhang Yijie Zheng Ming Zou |
![]() BESTOW: Efficient and Streamable Speech Language Model with the Best of
Two Worlds in GPT and T5 Zhehuai Chen He Huang Oleksii Hrinchuk Krishna Puvvada Nithin Rao Koluguri Piotr Żelasko Jagadeesh Balam Boris Ginsburg |
![]() An Embarrassingly Simple Approach for LLM with Strong ASR Capacity Ziyang Ma Guanrou Yang Yifan Yang Zhifu Gao Jiaming Wang ...Fan Yu Qian Chen Siqi Zheng Shiliang Zhang Xie Chen |
![]() Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition Peng Shen Xugang Lu Hisashi Kawai |