Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.01768
Cited By
Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception
5 September 2022
Jiadong Wang
Xinyuan Qian
Haizhou Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predict-and-Update Network: Audio-Visual Speech Recognition Inspired by Human Speech Perception"
6 / 6 papers shown
Title
Target Active Speaker Detection with Audio-visual Cues
Yiding Jiang
Ruijie Tao
Zexu Pan
Haizhou Li
28
16
0
22 May 2023
Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert
Jiadong Wang
Xinyuan Qian
Malu Zhang
R. Tan
Haizhou Li
EGVM
22
93
0
29 Mar 2023
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
89
40
0
25 Jan 2022
Fusing information streams in end-to-end audio-visual speech recognition
Wentao Yu
Steffen Zeiler
D. Kolossa
81
12
0
19 Apr 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
84
225
0
12 Feb 2021
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
164
784
0
16 Nov 2016
1