ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1501.00102
  4. Cited By
ModDrop: adaptive multi-modal gesture recognition

ModDrop: adaptive multi-modal gesture recognition

31 December 2014
Natalia Neverova
Christian Wolf
Graham W. Taylor
Florian Nebout
ArXivPDFHTML

Papers citing "ModDrop: adaptive multi-modal gesture recognition"

33 / 33 papers shown
Title
TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition
TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition
Feng Liu
Ziwang Fu
Yansen Wang
Qijian Zheng
40
4
0
10 May 2025
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition
Andrew Rouditchenko
Saurabhchand Bhati
Samuel Thomas
Hilde Kuehne
Rogerio Feris
116
1
0
03 Feb 2025
Resource-Efficient Federated Multimodal Learning via Layer-wise and
  Progressive Training
Resource-Efficient Federated Multimodal Learning via Layer-wise and Progressive Training
Ye Lin Tun
Chu Myaet Thwal
Minh N. H. Nguyen
Choong Seon Hong
48
0
0
22 Jul 2024
An Attentional Recurrent Neural Network for Occlusion-Aware Proactive
  Anomaly Detection in Field Robot Navigation
An Attentional Recurrent Neural Network for Occlusion-Aware Proactive Anomaly Detection in Field Robot Navigation
Jihun Han
Tianchen Ji
Yoonsang Lee
Katherine Driggs-Campbell
24
3
0
28 Sep 2023
Multimodal Distillation for Egocentric Action Recognition
Multimodal Distillation for Egocentric Action Recognition
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
30
23
0
14 Jul 2023
Efficient Multimodal Neural Networks for Trigger-less Voice Assistants
Efficient Multimodal Neural Networks for Trigger-less Voice Assistants
Sai Srujana Buddi
U. Sarawgi
Tashweena Heeramun
Karan Sawnhey
Ed Yanosik
Saravana Rathinam
Saurabh N. Adya
33
5
0
20 May 2023
Correlation-Driven Multi-Level Multimodal Learning for Anomaly Detection
  on Multiple Energy Sources
Correlation-Driven Multi-Level Multimodal Learning for Anomaly Detection on Multiple Energy Sources
Taehee Kim
Yuxing Tang
21
0
0
01 May 2023
Features Fusion Framework for Multimodal Irregular Time-series Events
Features Fusion Framework for Multimodal Irregular Time-series Events
Peiwang Tang
Xianchao Zhang
AI4TS
26
2
0
05 Sep 2022
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer
  to Unlabeled Modality
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
Wei-Ning Hsu
Bowen Shi
SSL
VLM
27
41
0
14 Jul 2022
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition
  on Modality-Specific Annotated Videos
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
Saghir Alfasly
Jian Lu
C. Xu
Yuru Zou
42
18
0
06 Mar 2022
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster
  Prediction
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Bowen Shi
Wei-Ning Hsu
Kushal Lakhotia
Abdel-rahman Mohamed
SSL
46
305
0
05 Jan 2022
Gesture Recognition with a Skeleton-Based Keyframe Selection Module
Gesture Recognition with a Skeleton-Based Keyframe Selection Module
Yunsoo Kim
Hyun Myung
SLR
27
1
0
03 Dec 2021
Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal
  and Multimodal Detectors
Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal and Multimodal Detectors
Hasam Khalid
Minhan Kim
Shahroz Tariq
Simon S. Woo
23
82
0
07 Sep 2021
Stochastic Transformer Networks with Linear Competing Units: Application
  to end-to-end SL Translation
Stochastic Transformer Networks with Linear Competing Units: Application to end-to-end SL Translation
Andreas Voskou
Konstantinos P. Panousis
D. Kosmopoulos
Dimitris N. Metaxas
S. Chatzis
SLR
36
43
0
01 Sep 2021
Multi-modal Residual Perceptron Network for Audio-Video Emotion
  Recognition
Multi-modal Residual Perceptron Network for Audio-Video Emotion Recognition
Xin Chang
W. Skarbek
27
19
0
21 Jul 2021
MUFASA: Multimodal Fusion Architecture Search for Electronic Health
  Records
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
Zhen Xu
David R. So
Andrew M. Dai
Mamba
58
51
0
03 Feb 2021
A Comprehensive Study on Deep Learning-based Methods for Sign Language
  Recognition
A Comprehensive Study on Deep Learning-based Methods for Sign Language Recognition
Nikolas Adaloglou
Theocharis Chatzis
Ilias Papastratis
Andreas Stergioulas
Georgios Th. Papadopoulos
Vassia Zacharopoulou
George J. Xydopoulos
Klimnis Atzakas
D. Papazachariou
P. Daras
34
37
0
24 Jul 2020
Learning to plan with uncertain topological maps
Learning to plan with uncertain topological maps
E. Beeching
J. Dibangoye
Olivier Simonin
Christian Wolf
19
40
0
10 Jul 2020
Evaluation Of Hidden Markov Models Using Deep CNN Features In Isolated
  Sign Recognition
Evaluation Of Hidden Markov Models Using Deep CNN Features In Isolated Sign Recognition
Anil Osman Tur
H. Keles
25
12
0
19 Jun 2020
Towards Robust Pattern Recognition: A Review
Towards Robust Pattern Recognition: A Review
Xu-Yao Zhang
Cheng-Lin Liu
C. Suen
OOD
HAI
19
102
0
12 Jun 2020
Modality Dropout for Improved Performance-driven Talking Faces
Modality Dropout for Improved Performance-driven Talking Faces
Ahmed Hussen Abdelaziz
B. Theobald
Paul Dixon
Reinhard Knothe
N. Apostoloff
Sachin Kajareker
24
36
0
27 May 2020
Deep-Aligned Convolutional Neural Network for Skeleton-based Action
  Recognition and Segmentation
Deep-Aligned Convolutional Neural Network for Skeleton-based Action Recognition and Segmentation
Babak Hosseini
Romain Montagne
Barbara Hammer
29
22
0
12 Nov 2019
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Wang
Hongzhi Li
29
38
0
11 Oct 2019
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal
  Information Condensation
Dynamic Gesture Recognition by Using CNNs and Star RGB: a Temporal Information Condensation
Clebeson Canuto dos Santos
J. L. A. Samatelo
R. Vassallo
9
54
0
10 Apr 2019
Dense Multimodal Fusion for Hierarchically Joint Representation
Dense Multimodal Fusion for Hierarchically Joint Representation
Di Hu
Feiping Nie
Xuelong Li
32
43
0
08 Oct 2018
Blockchain as a Service: A Decentralized and Secure Computing Paradigm
Blockchain as a Service: A Decentralized and Secure Computing Paradigm
G. Mendis
Yifu Wu
Jin Wei
Moein Sabounchi
Rigoberto Roche'
21
21
0
05 Jul 2018
Deep Multimodal Subspace Clustering Networks
Deep Multimodal Subspace Clustering Networks
Mahdi Abavisani
Vishal M. Patel
28
163
0
17 Apr 2018
Glimpse Clouds: Human Activity Recognition from Unstructured Feature
  Points
Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points
Fabien Baradel
Christian Wolf
J. Mille
Graham W. Taylor
35
150
0
22 Feb 2018
Explaining First Impressions: Modeling, Recognizing, and Explaining
  Apparent Personality from Videos
Explaining First Impressions: Modeling, Recognizing, and Explaining Apparent Personality from Videos
Hugo Jair Escalante
Heysem Kaya
A. A. Salah
Sergio Escalera
Yağmur Güçlütürk
...
Furkan Gürpinar
Achmadnoer Sukma Wicaksana
Cynthia C. S. Liem
Marcel van Gerven
R. Lier
20
61
0
02 Feb 2018
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
15
2,865
0
26 May 2017
Deep Multimodal Representation Learning from Temporal Data
Deep Multimodal Representation Learning from Temporal Data
Xitong Yang
Palghat Ramesh
Radha Chitta
S. Madhvanath
Edgar A. Bernal
Jiebo Luo
AI4TS
22
94
0
11 Apr 2017
ChaLearn Looking at People: A Review of Events and Resources
ChaLearn Looking at People: A Review of Events and Resources
Sergio Escalera
Xavier Baro
Hugo Jair Escalante
Isabelle M Guyon
30
40
0
10 Jan 2017
EmoNets: Multimodal deep learning approaches for emotion recognition in
  video
EmoNets: Multimodal deep learning approaches for emotion recognition in video
Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Çağlar Gülçehre
Vincent Michalski
...
Aaron Courville
Pascal Vincent
Roland Memisevic
C. Pal
Yoshua Bengio
138
401
0
05 Mar 2015
1