Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1808.05561
Cited By
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
16 August 2018
Samuel Albanie
Arsha Nagrani
Andrea Vedaldi
Andrew Zisserman
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Emotion Recognition in Speech using Cross-Modal Transfer in the Wild"
35 / 35 papers shown
Title
A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
Hui Ma
Jian Wang
Hongfei Lin
Bo Zhang
Yijia Zhang
Bo Xu
23
40
0
31 Oct 2023
VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Yanan Wang
Donghuo Zeng
Shinya Wada
Satoshi Kurihara
32
6
0
27 Sep 2023
Teacher-Student Architecture for Knowledge Distillation: A Survey
Chengming Hu
Xuan Li
Danyang Liu
Haolun Wu
Xi Chen
Ju Wang
Xue Liu
21
16
0
08 Aug 2023
Recursive Joint Attention for Audio-Visual Fusion in Regression based Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
19
10
0
17 Apr 2023
Speaker Recognition in Realistic Scenario Using Multimodal Data
Saqlain Hussain Shah
M. S. Saeed
Shah Nawaz
Muhammad Haroon Yousaf
CVBM
26
8
0
25 Feb 2023
Audio Representation Learning by Distilling Video as Privileged Information
Amirhossein Hajavi
Ali Etemad
18
4
0
06 Feb 2023
Vision Transformer with Attentive Pooling for Robust Facial Expression Recognition
Fanglei Xue
Qiangchang Wang
Zichang Tan
Zhongsong Ma
G. Guo
ViT
35
67
0
11 Dec 2022
Teacher-Student Architecture for Knowledge Learning: A Survey
Chengming Hu
Xuan Li
Dan Liu
Xi Chen
Ju Wang
Xue Liu
20
35
0
28 Oct 2022
Learning Diversified Feature Representations for Facial Expression Recognition in the Wild
Negar Heidari
Alexandros Iosifidis
CVBM
26
3
0
17 Oct 2022
Rethinking the Learning Paradigm for Facial Expression Recognition
Weijie Wang
N. Sebe
Bruno Lepri
36
2
0
30 Sep 2022
Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
48
31
0
19 Sep 2022
CIAO! A Contrastive Adaptation Mechanism for Non-Universal Facial Expression Recognition
Pablo V. A. Barros
A. Sciutti
CVBM
29
7
0
10 Aug 2022
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss
Riccardo Franceschini
Enrico Fini
Cigdem Beyan
Alessandro Conti
F. Arrigoni
Elisa Ricci
SSL
OffRL
34
16
0
23 Jul 2022
Deep Multimodal Guidance for Medical Image Classification
Mayur Mallya
Ghassan Hamarneh
30
13
0
10 Mar 2022
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors
Wen Wu
C. Zhang
Xixin Wu
P. Woodland
48
14
0
08 Mar 2022
Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models
Sarala Padi
S. O. Sadjadi
Tianyi Zhou
Ram D. Sriram
28
36
0
16 Feb 2022
Keyword localisation in untranscribed speech using visually grounded speech models
Kayode Olaleye
Dan Oneaţă
Herman Kamper
24
7
0
02 Feb 2022
Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
R Gnana Praveen
Eric Granger
P. Cardinal
CVBM
23
40
0
09 Nov 2021
TransFER: Learning Relation-aware Facial Expression Representations with Transformers
Fanglei Xue
Qiangchang Wang
G. Guo
ViT
39
183
0
25 Aug 2021
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation
Sarala Padi
S. O. Sadjadi
Tianyi Zhou
Ram D. Sriram
16
34
0
05 Aug 2021
Dive into Ambiguity: Latent Distribution Mining and Pairwise Uncertainty Estimation for Facial Expression Recognition
Jiahui She
Yibo Hu
Hailin Shi
Jun Wang
Qiu Shen
Tao Mei
25
186
0
01 Apr 2021
Speech Emotion Recognition using Semantic Information
Panagiotis Tzirakis
Anh-Tuan Nguyen
S. Zafeiriou
Björn W. Schuller
15
19
0
04 Mar 2021
Disentanglement for audio-visual emotion recognition using multitask setup
Raghuveer Peri
Srinivas Parthasarathy
Charles Bradshaw
Shiva Sundaram
23
11
0
11 Feb 2021
asya: Mindful verbal communication using deep learning
Ē. Urtāns
Ariel Tabaks
VLM
28
1
0
20 Aug 2020
Dynamic Emotion Modeling with Learnable Graphs and Graph Inception Network
A. Shirian
S. Tripathi
T. Guha
13
7
0
06 Aug 2020
Knowledge Distillation: A Survey
Jianping Gou
B. Yu
Stephen J. Maybank
Dacheng Tao
VLM
19
2,837
0
09 Jun 2020
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
M. S. Saeed
Shah Nawaz
Pietro Morerio
Arif Mahmood
I. Gallo
Muhammad Haroon Yousaf
Alessio Del Bue
CVBM
26
25
0
28 Apr 2020
Suppressing Uncertainties for Large-Scale Facial Expression Recognition
Kai Wang
Xiaojiang Peng
Jianfei Yang
Shijian Lu
Yu Qiao
8
482
0
24 Feb 2020
Disentangled Speech Embeddings using Cross-modal Self-supervision
Arsha Nagrani
Joon Son Chung
Samuel Albanie
Andrew Zisserman
SSL
21
88
0
20 Feb 2020
An empirical analysis of information encoded in disentangled neural speaker representations
Raghuveer Peri
Haoqi Li
Krishna Somandepalli
Arindam Jati
Shrikanth Narayanan
DRL
21
13
0
10 Feb 2020
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion Recognition
Didan Deng
Zhaokang Chen
Yuqian Zhou
Bertram Shi
17
45
0
21 Nov 2019
STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits
Uttaran Bhattacharya
Trisha Mittal
Rohan Chandra
Tanmay Randhavane
Aniket Bera
Tianyi Zhou
CVBM
23
100
0
28 Oct 2019
Who Do I Sound Like? Showcasing Speaker Recognition Technology by YouTube Voice Search
R. Krishnan
Bilal Soomro
Mahesh Subedar
Ville Hautamaki
Tomi Kinnunen
19
5
0
08 Nov 2018
Learnable PINs: Cross-Modal Embeddings for Person Identity
Arsha Nagrani
Samuel Albanie
Andrew Zisserman
SSL
26
140
0
02 May 2018
1