Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.00832
Cited By
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
12 February 2020
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos"
24 / 24 papers shown
Title
Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
Jun-chen Yu
Yang Zheng
Lei Wang
Yongqi Wang
Shengfan Xu
CVBM
75
0
0
15 Mar 2025
Smile upon the Face but Sadness in the Eyes: Emotion Recognition based on Facial Expressions and Eye Behaviors
Yuanyuan Liu
Lin Wei
Kejun Liu
Yibing Zhan
Zijing Chen
Zhe Chen
Shiguang Shan
33
1
0
08 Nov 2024
VEMOCLAP: A video emotion classification web application
Serkan Sulun
Paula Viana
M. Davies
VLM
30
0
0
22 Oct 2024
Dual-path Collaborative Generation Network for Emotional Video Captioning
Cheng Ye
Weidong Chen
Jingyu Li
L. Zhang
Zhendong Mao
92
1
0
06 Aug 2024
Benchmarking Micro-action Recognition: Dataset, Methods, and Applications
Dan Guo
Kun Li
Bin Hu
Yan Zhang
Meng Wang
62
38
0
08 Mar 2024
Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics
Pengcheng Zhao
Yanxiang Chen
Yang Zhao
Wei Jia
Zhao Zhang
Ronggang Wang
Richang Hong
DiffM
22
1
0
24 Jan 2024
Affective Video Content Analysis: Decade Review and New Perspectives
Junxiao Xue
Jie Wang
Xuecheng Wu
Qian Zhang
25
0
0
26 Oct 2023
MACP: Efficient Model Adaptation for Cooperative Perception
Yunsheng Ma
Juanwu Lu
Can Cui
Sicheng Zhao
Xu Cao
Wenqian Ye
Ziran Wang
24
11
0
25 Oct 2023
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
36
30
0
25 Jul 2023
CORAE: A Tool for Intuitive and Continuous Retrospective Evaluation of Interactions
Michael J. Sack
Maria Teresa Parreira
Jenny Xiyu Fu
Asher Lipman
Hifza Javed
Nawid Jamali
Malte Jung
12
3
0
29 Jun 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Amr Abdelraouf
Kyungtae Han
Rohit Gupta
Ziran Wang
43
11
0
13 May 2023
Noise-Resistant Multimodal Transformer for Emotion Recognition
Y. Liu
Haoyu Zhang
Yibing Zhan
Zijing Chen
Guanghao Yin
Lin Wei
Zhe Chen
ViT
37
3
0
04 May 2023
ViT-DD: Multi-Task Vision Transformer for Semi-Supervised Driver Distraction Detection
Yunsheng Ma
Ziran Wang
ViT
41
14
0
19 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
56
31
0
19 Sep 2022
Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition
Yi Zhang
Mingyuan Chen
Jundong Shen
Chongjun Wang
21
59
0
15 Jan 2022
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
17
20
0
14 Dec 2021
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
31
40
0
09 Nov 2021
Affective Image Content Analysis: Two Decades Review and New Perspectives
Sicheng Zhao
Xingxu Yao
Jufeng Yang
G. Jia
Guiguang Ding
Tat-Seng Chua
Björn W. Schuller
Kurt Keutzer
3DV
33
78
0
30 Jun 2021
Computational Emotion Analysis From Images: Recent Advances and Future Directions
Sicheng Zhao
Quanwei Huang
Youbao Tang
Xingxu Yao
Jufeng Yang
Guiguang Ding
Björn W. Schuller
24
18
0
19 Mar 2021
Privacy-Preserving Video Classification with Convolutional Neural Networks
Sikha Pentyala
Rafael Dowsley
Martine De Cock
PICV
27
21
0
06 Feb 2021
Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources
Sicheng Zhao
Yang Xiao
Jiang Guo
Xiangyu Yue
Jufeng Yang
Ravi Krishna
Pengfei Xu
Kurt Keutzer
27
17
0
17 Nov 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
Amirhossein Hajavi
Ali Etemad
18
2
0
03 Sep 2020
Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space
Sicheng Zhao
Yaxian Li
Xingxu Yao
Weizhi Nie
Pengfei Xu
Jufeng Yang
Kurt Keutzer
19
29
0
22 Aug 2020
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
33
62
0
11 Sep 2019
1