ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.13004
  4. Cited By
Comparison of Conventional Hybrid and CTC/Attention Decoders for
  Continuous Visual Speech Recognition

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition

20 February 2024
David Gimeno-Gómez
Carlos David Martínez Hinarejos
ArXivPDFHTML

Papers citing "Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition"

13 / 13 papers shown
Title
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild
David Gimeno-Gómez
Carlos David Martínez Hinarejos
29
8
0
21 Nov 2023
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Pingchuan Ma
A. Haliassos
Adriana Fernandez-Lopez
Honglie Chen
Stavros Petridis
Maja Pantic
39
109
0
25 Mar 2023
Visual Keyword Spotting with Attention
Visual Keyword Spotting with Attention
Prajwal K R
Liliane Momeni
Triantafyllos Afouras
Andrew Zisserman
31
13
0
29 Oct 2021
Look Who's Talking: Active Speaker Detection in the Wild
Look Who's Talking: Active Speaker Detection in the Wild
You Jin Kim
Hee-Soo Heo
Soyeon Choe
Soo-Whan Chung
Yoohwan Kwon
Bong-Jin Lee
Youngki Kwon
Joon Son Chung
73
20
0
17 Aug 2021
Is Someone Speaking? Exploring Long-term Temporal Features for
  Audio-visual Active Speaker Detection
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection
Ruijie Tao
Zexu Pan
Rohan Kumar Das
Xinyuan Qian
Mike Zheng Shou
Haizhou Li
41
179
0
14 Jul 2021
End-to-end Audio-visual Speech Recognition with Conformers
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
Maja Pantic
98
228
0
12 Feb 2021
Silent Speech Interfaces for Speech Restoration: A Review
Silent Speech Interfaces for Speech Restoration: A Review
Jose Andres Gonzalez Lopez
Alejandro Gomez-Alanis
Juan M. Martín-Donas
J. L. Pérez-Córdoba
A. Gómez
48
85
0
04 Sep 2020
Conformer: Convolution-augmented Transformer for Speech Recognition
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati
James Qin
Chung-Cheng Chiu
Niki Parmar
Yu Zhang
...
Wei Han
Shibo Wang
Zhengdong Zhang
Yonghui Wu
Ruoming Pang
195
3,082
0
16 May 2020
LRS3-TED: a large-scale dataset for visual speech recognition
LRS3-TED: a large-scale dataset for visual speech recognition
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
56
437
0
03 Sep 2018
Scaling Neural Machine Translation
Scaling Neural Machine Translation
Myle Ott
Sergey Edunov
David Grangier
Michael Auli
AIMat
159
612
0
01 Jun 2018
Resolution limits on visual speech recognition
Resolution limits on visual speech recognition
Helen L. Bear
R. Harvey
B. Theobald
Yuxuan Lan
32
21
0
03 Oct 2017
Lip Reading Sentences in the Wild
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
235
788
0
16 Nov 2016
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
136
2,261
0
05 Aug 2015
1