Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.05466
Cited By
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
8 April 2024
He Wang
Pengcheng Guo
Xucheng Wan
Huan Zhou
Lei Xie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder"
4 / 4 papers shown
Title
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024
He Wang
Lei Xie
19
0
0
05 Aug 2024
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
58
105
0
30 Sep 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
89
40
0
25 Jan 2022
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
84
225
0
12 Feb 2021
1