ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.00832
  4. Cited By
An End-to-End Visual-Audio Attention Network for Emotion Recognition in
  User-Generated Videos

An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos

12 February 2020
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
ArXivPDFHTML

Papers citing "An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos"

24 / 24 papers shown
Title
Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
Jun-chen Yu
Yang Zheng
Lei Wang
Yongqi Wang
Shengfan Xu
CVBM
75
0
0
15 Mar 2025
Smile upon the Face but Sadness in the Eyes: Emotion Recognition based
  on Facial Expressions and Eye Behaviors
Smile upon the Face but Sadness in the Eyes: Emotion Recognition based on Facial Expressions and Eye Behaviors
Yuanyuan Liu
Lin Wei
Kejun Liu
Yibing Zhan
Zijing Chen
Zhe Chen
Shiguang Shan
33
1
0
08 Nov 2024
VEMOCLAP: A video emotion classification web application
VEMOCLAP: A video emotion classification web application
Serkan Sulun
Paula Viana
M. Davies
VLM
30
0
0
22 Oct 2024
Dual-path Collaborative Generation Network for Emotional Video
  Captioning
Dual-path Collaborative Generation Network for Emotional Video Captioning
Cheng Ye
Weidong Chen
Jingyu Li
L. Zhang
Zhendong Mao
92
1
0
06 Aug 2024
Benchmarking Micro-action Recognition: Dataset, Methods, and
  Applications
Benchmarking Micro-action Recognition: Dataset, Methods, and Applications
Dan Guo
Kun Li
Bin Hu
Yan Zhang
Meng Wang
62
38
0
08 Mar 2024
Audio-Infused Automatic Image Colorization by Exploiting Audio Scene
  Semantics
Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics
Pengcheng Zhao
Yanxiang Chen
Yang Zhao
Wei Jia
Zhao Zhang
Ronggang Wang
Richang Hong
DiffM
22
1
0
24 Jan 2024
Affective Video Content Analysis: Decade Review and New Perspectives
Affective Video Content Analysis: Decade Review and New Perspectives
Junxiao Xue
Jie Wang
Xuecheng Wu
Qian Zhang
25
0
0
26 Oct 2023
MACP: Efficient Model Adaptation for Cooperative Perception
MACP: Efficient Model Adaptation for Cooperative Perception
Yunsheng Ma
Juanwu Lu
Can Cui
Sicheng Zhao
Xu Cao
Wenqian Ye
Ziran Wang
24
11
0
25 Oct 2023
Unlocking the Emotional World of Visual Media: An Overview of the
  Science, Research, and Impact of Understanding Emotion
Unlocking the Emotional World of Visual Media: An Overview of the Science, Research, and Impact of Understanding Emotion
James Z. Wang
Sicheng Zhao
Chenyan Wu
Reginald B. Adams
M. Newman
T. Shafir
Rachelle Tsachor
36
30
0
25 Jul 2023
CORAE: A Tool for Intuitive and Continuous Retrospective Evaluation of
  Interactions
CORAE: A Tool for Intuitive and Continuous Retrospective Evaluation of Interactions
Michael J. Sack
Maria Teresa Parreira
Jenny Xiyu Fu
Asher Lipman
Hifza Javed
Nawid Jamali
Malte Jung
12
3
0
29 Jun 2023
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and
  External Cameras via Spatial-Temporal Transformers
CEMFormer: Learning to Predict Driver Intentions from In-Cabin and External Cameras via Spatial-Temporal Transformers
Yunsheng Ma
Wenqian Ye
Xu Cao
Amr Abdelraouf
Kyungtae Han
Rohit Gupta
Ziran Wang
43
11
0
13 May 2023
Noise-Resistant Multimodal Transformer for Emotion Recognition
Noise-Resistant Multimodal Transformer for Emotion Recognition
Y. Liu
Haoyu Zhang
Yibing Zhan
Zijing Chen
Guanghao Yin
Lin Wei
Zhe Chen
ViT
37
3
0
04 May 2023
ViT-DD: Multi-Task Vision Transformer for Semi-Supervised Driver
  Distraction Detection
ViT-DD: Multi-Task Vision Transformer for Semi-Supervised Driver Distraction Detection
Yunsheng Ma
Ziran Wang
ViT
41
14
0
19 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space
  Using Joint Cross-Attention
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
56
31
0
19 Sep 2022
Tailor Versatile Multi-modal Learning for Multi-label Emotion
  Recognition
Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition
Yi Zhang
Mingyuan Chen
Jundong Shen
Chongjun Wang
21
59
0
15 Jan 2022
Multi-Modal Perception Attention Network with Self-Supervised Learning
  for Audio-Visual Speaker Tracking
Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking
Yidi Li
Hong Liu
Hao Tang
17
20
0
14 Dec 2021
Cross Attentional Audio-Visual Fusion for Dimensional Emotion
  Recognition
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
31
40
0
09 Nov 2021
Affective Image Content Analysis: Two Decades Review and New
  Perspectives
Affective Image Content Analysis: Two Decades Review and New Perspectives
Sicheng Zhao
Xingxu Yao
Jufeng Yang
G. Jia
Guiguang Ding
Tat-Seng Chua
Björn W. Schuller
Kurt Keutzer
3DV
33
78
0
30 Jun 2021
Computational Emotion Analysis From Images: Recent Advances and Future
  Directions
Computational Emotion Analysis From Images: Recent Advances and Future Directions
Sicheng Zhao
Quanwei Huang
Youbao Tang
Xingxu Yao
Jufeng Yang
Guiguang Ding
Björn W. Schuller
24
18
0
19 Mar 2021
Privacy-Preserving Video Classification with Convolutional Neural
  Networks
Privacy-Preserving Video Classification with Convolutional Neural Networks
Sikha Pentyala
Rafael Dowsley
Martine De Cock
PICV
27
21
0
06 Feb 2021
Curriculum CycleGAN for Textual Sentiment Domain Adaptation with
  Multiple Sources
Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources
Sicheng Zhao
Yang Xiao
Jiang Guo
Xiangyu Yue
Jufeng Yang
Ravi Krishna
Pengfei Xu
Kurt Keutzer
27
17
0
17 Nov 2020
Fine-grained Early Frequency Attention for Deep Speaker Representation
  Learning
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning
Amirhossein Hajavi
Ali Etemad
18
2
0
03 Sep 2020
Emotion-Based End-to-End Matching Between Image and Music in
  Valence-Arousal Space
Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space
Sicheng Zhao
Yaxian Li
Xingxu Yao
Weizhi Nie
Pengfei Xu
Jufeng Yang
Kurt Keutzer
19
29
0
22 Aug 2020
PDANet: Polarity-consistent Deep Attention Network for Fine-grained
  Visual Emotion Regression
PDANet: Polarity-consistent Deep Attention Network for Fine-grained Visual Emotion Regression
Sicheng Zhao
Zizhou Jia
Hui Chen
Leida Li
Guiguang Ding
Kurt Keutzer
33
62
0
11 Sep 2019
1