Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.02172
Cited By
A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognition
3 November 2021
Ziwang Fu
Feng Liu
Hanyang Wang
Jiayin Qi
Xiangling Fu
Aimin Zhou
Zhibin Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognition"
7 / 7 papers shown
Title
Multimodal Emotion Recognition using Audio-Video Transformer Fusion with Cross Attention
Joe Dhanith
Shravan Venkatraman
Modigari Narendra
Vigya Sharma
Santhosh Malarvannan
84
0
0
20 Feb 2025
MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion Recognition
Peihao Xiang
Chaohao Lin
Kaida Wu
Ou Bai
34
3
0
28 Apr 2024
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Samir Sadok
Simon Leglaive
Renaud Séguier
SSL
81
6
0
05 May 2023
Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition
Vincent Karas
M. Tellamekala
Adria Mallol-Ragolta
M. Valstar
Björn W. Schuller
22
13
0
24 Mar 2022
Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Hengshun Zhou
Debin Meng
Yuanyuan Zhang
Xiaojiang Peng
Jun Du
Kai Wang
Yu Qiao
39
64
0
27 Dec 2020
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
297
10,225
0
16 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
167
1,464
0
06 Jun 2016
1