Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.06682
Cited By
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
15 August 2020
Shamane Siriwardhana
Andrew Reis
Rivindu Weerasekera
Suranga Nanayakkara
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition"
25 / 25 papers shown
Title
"Yeah Right!" -- Do LLMs Exhibit Multimodal Feature Transfer?
Benjamin Z. Reichman
Kartik Talamadupula
77
0
0
07 Jan 2025
VISTANet: VIsual Spoken Textual Additive Net for Interpretable Multimodal Emotion Recognition
Puneet Kumar
Sarthak Malik
Balasubramanian Raman
Xiaobai Li
106
2
0
24 Aug 2022
Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Zhiyun Lu
Liangliang Cao
Yu Zhang
Chung-Cheng Chiu
James Fan
39
72
0
21 Nov 2019
Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis
Zhongkai Sun
P. Sarma
W. Sethares
Yingyu Liang
48
320
0
13 Nov 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
62
330
0
10 Nov 2019
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
69
147
0
10 Nov 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
138
666
0
12 Oct 2019
DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation
Deepanway Ghosal
Navonil Majumder
Soujanya Poria
Niyati Chhaya
Alexander Gelbukh
77
512
0
30 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
217
3,667
0
06 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
524
24,351
0
26 Jul 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai
Shaojie Bai
Paul Pu Liang
J. Zico Kolter
Louis-Philippe Morency
Ruslan Salakhutdinov
72
1,296
0
01 Jun 2019
wav2vec: Unsupervised Pre-training for Speech Recognition
Steffen Schneider
Alexei Baevski
R. Collobert
Michael Auli
SSL
68
418
0
11 Apr 2019
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Santiago Pascual
Mirco Ravanelli
Joan Serrà
Antonio Bonafonte
Yoshua Bengio
SSL
107
251
0
06 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
95
3,147
0
01 Apr 2019
Revisiting Self-Supervised Visual Representation Learning
Alexander Kolesnikov
Xiaohua Zhai
Lucas Beyer
SSL
129
716
0
25 Jan 2019
Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations
Francesco Locatello
Stefan Bauer
Mario Lucic
Gunnar Rätsch
Sylvain Gelly
Bernhard Schölkopf
Olivier Bachem
OOD
113
1,466
0
29 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.5K
94,511
0
11 Oct 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
290
10,253
0
10 Jul 2018
Learning Factorized Multimodal Representations
Yao-Hung Hubert Tsai
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
Ruslan Salakhutdinov
DRL
98
407
0
16 Jun 2018
Adversarial Auto-encoders for Speech Based Emotion Recognition
Saurabh Sahu
Rahul Gupta
Ganesh Sivaraman
Wael AbdAlmageed
C. Espy-Wilson
GAN
50
66
0
06 Jun 2018
Multi-attention Recurrent Network for Human Communication Comprehension
Amir Zadeh
Paul Pu Liang
Soujanya Poria
Prateek Vij
Min Zhang
Louis-Philippe Morency
77
484
0
03 Feb 2018
Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
S. Latif
R. Rana
Junaid Qadir
J. Epps
SSL
DRL
62
101
0
23 Dec 2017
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
208
4,989
0
02 Nov 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
640
130,942
0
12 Jun 2017
MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos
Amir Zadeh
Rowan Zellers
Eli Pincus
Louis-Philippe Morency
76
453
0
20 Jun 2016
1