Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.10839
Cited By
Deep Learning for Visual Speech Analysis: A Survey
22 May 2022
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Y. Guo
Xin Xu
M. Pietikäinen
Li Liu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Learning for Visual Speech Analysis: A Survey"
15 / 15 papers shown
Title
OXSeg: Multidimensional attention UNet-based lip segmentation using semi-supervised lip contours
Hanie Moghaddasi
Christina Chambers
Sarah N. Mattson
Jeffrey R. Wozniak
Claire D. Coles
Raja Mukherjee
Michael Suttie
34
0
0
08 May 2025
CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking
Yiming Li
Kaiying Yan
Shuo Shao
Tongqing Zhai
Shu-Tao Xia
Zhanyue Qin
D. Tao
AAML
146
0
0
02 Mar 2025
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning
William A. Stigall
62
0
0
14 Oct 2024
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
Yixiang Zhuang
Baoping Cheng
Yao Cheng
Yuntao Jin
Renshuai Liu
Chengyang Li
Xuan Cheng
Jing Liao
Juncong Lin
CVBM
3DH
34
6
0
19 Apr 2024
MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading
Wenhao Zhang
Jun Wang
Yong Luo
Lei Yu
Wei Yu
Zheng He
Jialie Shen
43
0
0
18 Apr 2024
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
Ziqiao Peng
Hao Wu
Zhenbo Song
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
CVBM
24
99
0
20 Mar 2023
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning
Suzhe Wang
Lincheng Li
Yueqing Ding
Xin Yu
CVBM
66
117
0
06 Dec 2021
End-to-end Audio-visual Speech Recognition with Conformers
Pingchuan Ma
Stavros Petridis
M. Pantic
84
225
0
12 Feb 2021
Lipreading using Temporal Convolutional Networks
Brais Martínez
Pingchuan Ma
Stavros Petridis
M. Pantic
168
239
0
23 Jan 2020
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Z. Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
207
820
0
12 Jun 2018
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,567
0
17 Apr 2017
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,450
0
21 Nov 2016
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
162
784
0
16 Nov 2016
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
233
7,904
0
13 Jun 2015
1