
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
Papers citing "AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement"
10 / 10 papers shown
Title |
---|
![]() ModDrop: adaptive multi-modal gesture recognition Natalia Neverova Christian Wolf Graham W. Taylor Florian Nebout |