Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.09034
Cited By
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
14 December 2023
Davide Berghi
Peipei Wu
Jinzheng Zhao
Wenwu Wang
Philip J. B. Jackson
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"
2 / 2 papers shown
Title
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module
Xinyu Wang
Qian Wang
Haolin Huang
Yu Fang
Mengjie Xu
Qian Wang
31
0
0
31 Aug 2024
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Davide Berghi
Philip J. B. Jackson
42
0
0
01 Jun 2024
1