ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.04988
  4. Cited By
LCANet: End-to-End Lipreading with Cascaded Attention-CTC

LCANet: End-to-End Lipreading with Cascaded Attention-CTC

13 March 2018
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
ArXivPDFHTML

Papers citing "LCANet: End-to-End Lipreading with Cascaded Attention-CTC"

15 / 15 papers shown
Title
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
54
0
0
07 May 2025
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
Zhuojiang Cai
Yuhan Ma
Feng Lu
30
0
0
26 Jan 2024
Show Me Your Face, And I'll Tell You How You Speak
Show Me Your Face, And I'll Tell You How You Speak
Christen Millerdurai
L. A. Khaliq
Timon Ulrich
CVBM
68
0
0
28 Jun 2022
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction
  and Lip Reading
LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading
Leyuan Qu
C. Weber
S. Wermter
38
23
0
09 Dec 2021
"Notic My Speech" -- Blending Speech Patterns With Multimedia
"Notic My Speech" -- Blending Speech Patterns With Multimedia
Dhruva Sahrawat
Yaman Kumar Singla
Shashwat Aggarwal
Yifang Yin
R. Shah
Roger Zimmermann
33
3
0
12 Jun 2020
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Prajwal K R
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
18
110
0
17 May 2020
Mutual Information Maximization for Effective Lip Reading
Mutual Information Maximization for Effective Lip Reading
Xingyuan Zhao
Shuang Yang
Shiguang Shan
Xilin Chen
18
58
0
13 Mar 2020
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep
  Visual Speech Recognition
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition
Yuanhang Zhang
Shuang Yang
Jingyun Xiao
Shiguang Shan
Xilin Chen
12
64
0
06 Mar 2020
Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Audio-Visual Decision Fusion for WFST-based and seq2seq Models
R. Aralikatti
Sharad Roy
Abhinav Thanda
D. Margam
Pujitha Appan Kandala
Tanay Sharma
S. Venkatesan
19
1
0
29 Jan 2020
Spatial Group-wise Enhance: Improving Semantic Feature Learning in
  Convolutional Networks
Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks
Xiang Li
Xiaolin Hu
Jian Yang
24
193
0
23 May 2019
Selective Kernel Networks
Selective Kernel Networks
Xiang Li
Wenhai Wang
Xiaolin Hu
Jian Yang
25
2,001
0
15 Mar 2019
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Georgios Tzimiropoulos
M. Pantic
14
128
0
28 Sep 2018
Zero-shot keyword spotting for visual speech recognition in-the-wild
Zero-shot keyword spotting for visual speech recognition in-the-wild
Themos Stafylakis
Georgios Tzimiropoulos
32
38
0
23 Jul 2018
Large-Scale Visual Speech Recognition
Large-Scale Visual Speech Recognition
Brendan Shillingford
Yannis Assael
Matthew W. Hoffman
T. Paine
Cían Hughes
...
Marie Mulville
Ben Coppin
Ben Laurie
A. Senior
Nando de Freitas
29
152
0
13 Jul 2018
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
185
784
0
16 Nov 2016
1