Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1908.02590
Cited By
Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders
7 August 2019
M. Sadeghi
Simon Leglaive
Xavier Alameda-Pineda
Laurent Girin
Radu Horaud
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders"
14 / 14 papers shown
Title
Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation
Akam Rahimi
Triantafyllos Afouras
Andrew Zisserman
40
28
0
02 Jan 2025
Diffusion-based Unsupervised Audio-visual Speech Enhancement
Jean-Eudes Ayilo
Mostafa Sadeghi
Romain Serizel
Xavier Alameda-Pineda
DiffM
25
0
0
04 Oct 2024
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
Chaeyoung Jung
Suyeon Lee
Ji-Hoon Kim
Joon Son Chung
DiffM
47
4
0
13 Jun 2024
Missingness-resilient Video-enhanced Multimodal Disfluency Detection
Payal Mohapatra
Shamika Likhite
Subrata Biswas
Bashima Islam
Qi Zhu
49
2
0
11 Jun 2024
Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
Tassadaq Hussain
K. Dashtipour
Yu Tsao
Amir Hussain
29
2
0
26 Feb 2024
Audio-visual video-to-speech synthesis with synthesized input audio
Triantafyllos Kefalas
Yannis Panagakis
M. Pantic
VGen
DiffM
38
1
0
31 Jul 2023
Integrating Uncertainty into Neural Network-based Speech Enhancement
Hu Fang
Dennis Becker
S. Wermter
Timo Gerkmann
UQCV
32
2
0
15 May 2023
Neural Target Speech Extraction: An Overview
Kateřina Žmolíková
Marc Delcroix
Tsubasa Ochiai
K. Kinoshita
JanHonza'' vCernocký
Dong Yu
23
86
0
31 Jan 2023
A weighted-variance variational autoencoder model for speech enhancement
A. Golmakani
M. Sadeghi
Xavier Alameda-Pineda
Romain Serizel
30
1
0
02 Nov 2022
Visual Acoustic Matching
Changan Chen
Ruohan Gao
P. Calamia
Kristen Grauman
21
56
0
14 Feb 2022
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement
Tassadaq Hussain
Wei-Chien Wang
M. Gogate
K. Dashtipour
Yu Tsao
Xugang Lu
A. Ahsan
Amir Hussain
21
3
0
24 Jan 2022
SINVAD: Search-based Image Space Navigation for DNN Image Classifier Test Input Generation
Sungmin Kang
R. Feldt
S. Yoo
AAML
26
32
0
19 May 2020
Robust Speaker Recognition Using Speech Enhancement And Attention Model
Yanpei Shi
Qiang Huang
Thomas Hain
27
25
0
14 Jan 2020
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
M. Sadeghi
Xavier Alameda-Pineda
13
21
0
23 Dec 2019
1