Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.04121
Cited By
v1
v2 (latest)
The Conversation: Deep Audio-Visual Speech Enhancement
11 April 2018
Triantafyllos Afouras
Joon Son Chung
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Conversation: Deep Audio-Visual Speech Enhancement"
19 / 19 papers shown
Title
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
Akam Rahimi
Triantafyllos Afouras
Andrew Zisserman
116
29
0
02 Jan 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
67
0
0
04 Oct 2024
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
353
2,279
0
14 Jun 2018
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Andrew Owens
Alexei A. Efros
SSL
98
752
0
10 Apr 2018
End-to-end Audiovisual Speech Recognition
Stavros Petridis
Themos Stafylakis
Pingchuan Ma
Feipeng Cai
Georgios Tzimiropoulos
Maja Pantic
69
251
0
18 Feb 2018
Visual Speech Enhancement
Aviv Gabbay
Asaph Shamir
Shmuel Peleg
54
16
0
23 Nov 2017
Does Phase Matter For Monaural Source Separation?
M. L. Dubey
Garrett Kenyon
Nils Carlson
A. Thresher
55
8
0
02 Nov 2017
Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
Jen-Cheng Hou
Syu-Siang Wang
Ying-Hui Lai
Yu Tsao
Hsiu-Wen Chang
H. Wang
77
198
0
01 Sep 2017
Supervised Speech Separation Based on Deep Learning: An Overview
DeLiang Wang
Jitong Chen
SSL
77
1,373
0
24 Aug 2017
Improved Speech Reconstruction from Silent Video
Ariel Ephrat
Tavi Halperin
Shmuel Peleg
71
89
0
01 Aug 2017
You said that?
Joon Son Chung
A. Jamaludin
Andrew Zisserman
CVBM
72
259
0
08 May 2017
Complex spectrogram enhancement by convolutional neural network with multi-metrics learning
Szu-Wei Fu
Ting-Yao Hu
Yu Tsao
Xugang Lu
55
178
0
27 Apr 2017
Combining Residual Networks with LSTMs for Lipreading
Themos Stafylakis
Georgios Tzimiropoulos
VLM
74
308
0
12 Mar 2017
Lip Reading Sentences in the Wild
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
261
790
0
16 Nov 2016
LipNet: End-to-End Sentence-level Lipreading
Yannis Assael
Brendan Shillingford
Shimon Whiteson
Nando de Freitas
82
397
0
05 Nov 2016
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
1.4K
14,575
0
07 Oct 2016
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
354
10,184
0
16 Mar 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
Speech Recognition by Machine, A Review
M. Anusuya
S. Katti
89
393
0
13 Jan 2010
1