Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition

20 February 2024

Papers citing "Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition"

13 / 13 papers shown

Title
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild David Gimeno-Gómez Carlos David Martínez Hinarejos 29 8 0 21 Nov 2023
Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Pingchuan Ma A. Haliassos Adriana Fernandez-Lopez Honglie Chen Stavros Petridis Maja Pantic 39 109 0 25 Mar 2023
Visual Keyword Spotting with Attention Prajwal K R Liliane Momeni Triantafyllos Afouras Andrew Zisserman 31 13 0 29 Oct 2021
Look Who's Talking: Active Speaker Detection in the Wild You Jin Kim Hee-Soo Heo Soyeon Choe Soo-Whan Chung Yoohwan Kwon Bong-Jin Lee Youngki Kwon Joon Son Chung 73 20 0 17 Aug 2021
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection Ruijie Tao Zexu Pan Rohan Kumar Das Xinyuan Qian Mike Zheng Shou Haizhou Li 41 179 0 14 Jul 2021
End-to-end Audio-visual Speech Recognition with Conformers Pingchuan Ma Stavros Petridis Maja Pantic 98 228 0 12 Feb 2021
Silent Speech Interfaces for Speech Restoration: A Review Jose Andres Gonzalez Lopez Alejandro Gomez-Alanis Juan M. Martín-Donas J. L. Pérez-Córdoba A. Gómez 48 85 0 04 Sep 2020
Conformer: Convolution-augmented Transformer for Speech Recognition Anmol Gulati James Qin Chung-Cheng Chiu Niki Parmar Yu Zhang ... Wei Han Shibo Wang Zhengdong Zhang Yonghui Wu Ruoming Pang 195 3,082 0 16 May 2020
LRS3-TED: a large-scale dataset for visual speech recognition Triantafyllos Afouras Joon Son Chung Andrew Zisserman 56 437 0 03 Sep 2018
Scaling Neural Machine Translation Myle Ott Sergey Edunov David Grangier Michael Auli AIMat 159 612 0 01 Jun 2018
Resolution limits on visual speech recognition Helen L. Bear R. Harvey B. Theobald Yuxuan Lan 32 21 0 03 Oct 2017
Lip Reading Sentences in the Wild Joon Son Chung A. Senior Oriol Vinyals Andrew Zisserman 235 788 0 16 Nov 2016
Listen, Attend and Spell William Chan Navdeep Jaitly Quoc V. Le Oriol Vinyals RALM 136 2,261 0 05 Aug 2015