ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.05466
  4. Cited By
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder

Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder

8 April 2024
He Wang
Pengcheng Guo
Xucheng Wan
Huan Zhou
Lei Xie
ArXivPDFHTML

Papers citing "Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder"

4 / 4 papers shown
Title
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC
  2024
The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024
He Wang
Lei Xie
19
0
0
05 Aug 2024
E-Branchformer: Branchformer with Enhanced merging for speech
  recognition
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
58
105
0
30 Sep 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition
  for Single and Multi-Person Video
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
89
40
0
25 Jan 2022
End-to-end Audio-visual Speech Recognition with Conformers
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
84
225
0
12 Feb 2021
1