ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.09034
  4. Cited By
Fusion of Audio and Visual Embeddings for Sound Event Localization and
  Detection

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

14 December 2023
Davide Berghi
Peipei Wu
Jinzheng Zhao
Wenwu Wang
Philip J. B. Jackson
ArXivPDFHTML

Papers citing "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"

2 / 2 papers shown
Title
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module
Xinyu Wang
Qian Wang
Haolin Huang
Yu Fang
Mengjie Xu
Qian Wang
31
0
0
31 Aug 2024
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Davide Berghi
Philip J. B. Jackson
42
0
0
01 Jun 2024
1