ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.06405
  4. Cited By
Heterogeneous bimodal attention fusion for speech emotion recognition

Heterogeneous bimodal attention fusion for speech emotion recognition

9 March 2025
Jiachen Luo
Huy Phan
Lin Wang
Joshua Reiss
ArXivPDFHTML

Papers citing "Heterogeneous bimodal attention fusion for speech emotion recognition"

26 / 26 papers shown
Title
Revisiting Multi-modal Emotion Learning with Broad State Space Models
  and Probability-guidance Fusion
Revisiting Multi-modal Emotion Learning with Broad State Space Models and Probability-guidance Fusion
Yuntao Shou
Tao Meng
Fuchen Zhang
Nan Yin
Keqin Li
Mamba
67
22
0
27 Apr 2024
Learning Emotion Representations from Verbal and Nonverbal Communication
Learning Emotion Representations from Verbal and Nonverbal Communication
Sitao Zhang
Yimu Pan
Jianmin Wang
VLM
94
24
0
22 May 2023
cross-modal fusion techniques for utterance-level emotion recognition
  from text and speech
cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Jiacheng Luo
Huy P Phan
Joshua Reiss
44
12
0
05 Feb 2023
M2R2: Missing-Modality Robust emotion Recognition framework with
  iterative data augmentation
M2R2: Missing-Modality Robust emotion Recognition framework with iterative data augmentation
Ning Wang
50
13
0
05 May 2022
EmoCaps: Emotion Capsule based Model for Conversational Emotion
  Recognition
EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition
Zaijing Li
Fengxiao Tang
Ming Zhao
Yusen Zhu
73
99
0
25 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A Survey
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
66
109
0
02 Mar 2022
data2vec: A General Framework for Self-supervised Learning in Speech,
  Vision and Language
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSL
VLM
ViT
89
852
0
07 Feb 2022
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MA
AI4CE
76
168
0
11 Oct 2021
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion
  Recognition in Conversation
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation
Jingwen Hu
Yuchen Liu
Jinming Zhao
Qin Jin
63
205
0
14 Jul 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
46
360
0
08 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
820
29,167
0
26 Feb 2021
COSMIC: COmmonSense knowledge for eMotion Identification in
  Conversations
COSMIC: COmmonSense knowledge for eMotion Identification in Conversations
Deepanway Ghosal
Navonil Majumder
Alexander Gelbukh
Rada Mihalcea
Soujanya Poria
93
317
0
06 Oct 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech
  Representations
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
Alexei Baevski
Henry Zhou
Abdel-rahman Mohamed
Michael Auli
SSL
236
5,774
0
20 Jun 2020
EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's
  Principle
EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's Principle
Trisha Mittal
P. Guhan
Uttaran Bhattacharya
Rohan Chandra
Aniket Bera
Tianyi Zhou
102
133
0
14 Mar 2020
Multimodal Affective States Recognition Based on Multiscale CNNs and
  Biologically Inspired Decision Fusion Model
Multimodal Affective States Recognition Based on Multiscale CNNs and Biologically Inspired Decision Fusion Model
Yuxuan Zhao
Xinyan Cao
Jinlong Lin
Dunshan Yu
Xixin Cao
33
28
0
29 Nov 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
529
24,351
0
26 Jul 2019
COMET: Commonsense Transformers for Automatic Knowledge Graph
  Construction
COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
Antoine Bosselut
Hannah Rashkin
Maarten Sap
Chaitanya Malaviya
Asli Celikyilmaz
Yejin Choi
82
910
0
12 Jun 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai
Shaojie Bai
Paul Pu Liang
J. Zico Kolter
Louis-Philippe Morency
Ruslan Salakhutdinov
72
1,296
0
01 Jun 2019
Multitask learning for frame-level instrument recognition
Multitask learning for frame-level instrument recognition
Yun-Ning Hung
Yian Chen
Yi-Hsuan Yang
116
33
0
03 Nov 2018
DialogueRNN: An Attentive RNN for Emotion Detection in Conversations
DialogueRNN: An Attentive RNN for Emotion Detection in Conversations
Navonil Majumder
Soujanya Poria
Devamanyu Hazarika
Rada Mihalcea
Alexander Gelbukh
Min Zhang
64
718
0
01 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.6K
94,511
0
11 Oct 2018
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in
  Conversations
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Gautam Naik
Min Zhang
Rada Mihalcea
98
1,065
0
05 Oct 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
642
130,942
0
12 Jun 2017
Semi-supervised sequence tagging with bidirectional language models
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters
Bridger Waleed Ammar
Chandra Bhagavatula
Russell Power
76
635
0
29 Apr 2017
Distributed Representations of Sentences and Documents
Distributed Representations of Sentences and Documents
Quoc V. Le
Tomas Mikolov
FaML
241
9,235
0
16 May 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
363
33,500
0
16 Oct 2013
1