Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.10687
Cited By
v1
v2
v3 (latest)
Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers
16 September 2024
Ruchik Mishra
Andrew Frye
M. M. Rayguru
Dan O. Popa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Personalized Speech Emotion Recognition in Human-Robot Interaction using Vision Transformers"
20 / 20 papers shown
Title
Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Muhamamd Haris Khan
Selamawit Asfaw
Dmitrii Iarchuk
Miguel Altamirano Cabrera
Luis Moreno
Issatay Tokmurziyev
Dzmitry Tsetserukou
93
2
0
12 Jan 2025
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark
Adham Ibrahim
Shady Shehata
Ajinkya Kulkarni
Mukhtar Mohamed
Muhammad Abdul-Mageed
56
2
0
14 Jun 2024
Controlling Emotion in Text-to-Speech with Natural Language Prompts
Thomas Bott
Florian Lux
Ngoc Thang Vu
60
9
0
10 Jun 2024
Layer-Wise Analysis of Self-Supervised Acoustic Word Embeddings: A Study on Speech Emotion Recognition
Alexandra Saliba
Yuanchao Li
Ramon Sanabria
Catherine Lai
84
10
0
04 Feb 2024
Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition
Weidong Chen
Xiaofen Xing
Peihao Chen
Xiangmin Xu
VLM
69
39
0
20 Jul 2023
Social Impressions of the NAO Robot and its Impact on Physiology
Ruchik Mishra
K. Welch
61
2
0
03 Jul 2023
SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Kai-Wei Chang
Wei-Cheng Tseng
Shang-Wen Li
Hung-yi Lee
78
23
0
31 Mar 2022
SepTr: Separable Transformer for Audio Spectrogram Processing
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Fahad Shahbaz Khan
ViT
68
31
0
17 Mar 2022
Self-attention fusion for audiovisual emotion recognition with incomplete data
K. Chumachenko
Alexandros Iosifidis
Moncef Gabbouj
118
40
0
26 Jan 2022
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
292
2,845
0
15 Jun 2021
Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings
L. Pepino
Pablo Riera
Luciana Ferrer
76
365
0
08 Apr 2021
Self-paced ensemble learning for speech and audio classification
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
157
23
0
22 Mar 2021
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
79
192
0
28 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
682
41,483
0
22 Oct 2020
Non-linear Neurons with Human-like Apical Dendrite Activations
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
Nicolae-Cătălin Ristea
N. Sebe
81
21
0
02 Feb 2020
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Gautam Naik
Min Zhang
Rada Mihalcea
123
1,077
0
05 Oct 2018
On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
57
54
0
06 Apr 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
805
132,725
0
12 Jun 2017
Prototypical Networks for Few-shot Learning
Jake C. Snell
Kevin Swersky
R. Zemel
305
8,154
0
15 Mar 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
833
11,952
0
09 Mar 2017
1