ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11441
  4. Cited By
Multi-Modal Learning for AU Detection Based on Multi-Head Fused
  Transformers

Multi-Modal Learning for AU Detection Based on Multi-Head Fused Transformers

22 March 2022
Xiang Zhang
L. Yin
    ViT
ArXivPDFHTML

Papers citing "Multi-Modal Learning for AU Detection Based on Multi-Head Fused Transformers"

28 / 28 papers shown
Title
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
MAVEN: Multi-modal Attention for Valence-Arousal Emotion Network
Vrushank Ahire
Kunal Shah
Mudasir Nazir Khan
Nikhil Pakhale
L. Sookha
M. A. Ganaie
Abhinav Dhall
111
0
0
16 Mar 2025
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning
A. A. Salah
Itir Onal Ertugrul
CVBM
114
4
0
15 Jul 2024
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Aditya Prakash
Kashyap Chitta
Andreas Geiger
ViT
103
525
0
19 Apr 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
637
41,003
0
22 Oct 2020
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
216
5,073
0
08 Oct 2020
Learning Texture Transformer Network for Image Super-Resolution
Learning Texture Transformer Network for Image Super-Resolution
Fuzhi Yang
Huan Yang
Jianlong Fu
Hongtao Lu
B. Guo
SupR
ViT
74
722
0
07 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
743
41,932
0
28 May 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
385
13,035
0
26 May 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
419
20,127
0
23 Oct 2019
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression
  with AI, and Cross-Cultural Affect Recognition
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition
Fabien Ringeval
Björn Schuller
Michel Valstar
N. Cummins
R. Cowie
...
Ziping Zhao
Adria Mallol-Ragolta
Zhao Ren
M. Soleymani
Maja Pantic
55
299
0
10 Jul 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai
Shaojie Bai
Paul Pu Liang
J. Zico Kolter
Louis-Philippe Morency
Ruslan Salakhutdinov
76
1,301
0
01 Jun 2019
Semantic Relationships Guided Representation Learning for Facial Action
  Unit Recognition
Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition
Guanbin Li
Xin Zhu
Yirui Zeng
Qing Wang
Liang Lin
CVBM
44
155
0
22 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
287
5,732
0
26 Mar 2019
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition
  with Multimodal Training
Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
Mahdi Abavisani
Hamid Reza Vaezi Joze
Vishal M. Patel
63
131
0
14 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,770
0
11 Oct 2018
Multimodal Trajectory Predictions for Autonomous Driving using Deep
  Convolutional Networks
Multimodal Trajectory Predictions for Autonomous Driving using Deep Convolutional Networks
Henggang Cui
Vladan Radosavljevic
Fang-Chieh Chou
Tsung-Han Lin
Thi Nguyen
Tzu-Kuo Huang
J. Schneider
Nemanja Djuric
58
613
0
18 Sep 2018
Weakly-Supervised Convolutional Neural Networks for Multimodal Image
  Registration
Weakly-Supervised Convolutional Neural Networks for Multimodal Image Registration
Yipeng Hu
Marc Modat
Eli Gibson
Wenqi Li
N. Ghavami
...
M. Emberton
Sébastien Ourselin
J. A. Noble
D. Barratt
Tom Vercauteren
78
382
0
09 Jul 2018
Taskonomy: Disentangling Task Transfer Learning
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
118
1,218
0
23 Apr 2018
Multimodal Unsupervised Image-to-Image Translation
Multimodal Unsupervised Image-to-Image Translation
Xun Huang
Ming-Yuan Liu
Serge J. Belongie
Jan Kautz
130
2,482
0
12 Apr 2018
Deep Structure Inference Network for Facial Action Unit Recognition
Deep Structure Inference Network for Facial Action Unit Recognition
C. Corneanu
Meysam Madadi
Sergio Escalera
CVBM
48
128
0
15 Mar 2018
Deep Adaptive Attention for Joint Facial Action Unit Detection and Face
  Alignment
Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment
Zhiwen Shao
Zhilei Liu
Jianfei Cai
Lizhuang Ma
CVBM
90
175
0
15 Mar 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
687
131,526
0
12 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
91
2,928
0
26 May 2017
EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial
  Action Unit Detection
EAC-Net: A Region-based Deep Enhancing and Cropping Approach for Facial Action Unit Detection
Wei Li
Farnaz Abtahi
Zhigang Zhu
L. Yin
39
124
0
09 Feb 2017
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
410
10,482
0
21 Jul 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
193,878
0
10 Dec 2015
EmoNets: Multimodal deep learning approaches for emotion recognition in
  video
EmoNets: Multimodal deep learning approaches for emotion recognition in video
Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Çağlar Gülçehre
Vincent Michalski
...
Aaron Courville
Pascal Vincent
Roland Memisevic
C. Pal
Yoshua Bengio
186
401
0
05 Mar 2015
ModDrop: adaptive multi-modal gesture recognition
ModDrop: adaptive multi-modal gesture recognition
Natalia Neverova
Christian Wolf
Graham W. Taylor
Florian Nebout
89
295
0
31 Dec 2014
1