ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.06627
  4. Cited By
MARLIN: Masked Autoencoder for facial video Representation LearnINg

MARLIN: Masked Autoencoder for facial video Representation LearnINg

12 November 2022
Zhixi Cai
Shreya Ghosh
Kalin Stefanov
Abhinav Dhall
Jianfei Cai
Hamid Rezatofighi
Reza Haffari
Munawar Hayat
    ViT
    CVBM
ArXivPDFHTML

Papers citing "MARLIN: Masked Autoencoder for facial video Representation LearnINg"

39 / 39 papers shown
Title
PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning
PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning
Yifei Wang
Qi Liu
Fuli Min
Honghao Wang
22
0
0
17 Apr 2025
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Ashutosh Chaubey
Xulang Guan
Mohammad Soleymani
CVBM
MLLM
VLM
77
0
0
09 Apr 2025
PE-CLIP: A Parameter-Efficient Fine-Tuning of Vision Language Models for Dynamic Facial Expression Recognition
PE-CLIP: A Parameter-Efficient Fine-Tuning of Vision Language Models for Dynamic Facial Expression Recognition
Ibtissam Saadi
Abdenour Hadid
Douglas W. Cunningham
Abdelmalik Taleb-Ahmed
Y. E. Hillali
VLM
50
0
0
21 Mar 2025
Experimenting with Affective Computing Models in Video Interviews with Spanish-speaking Older Adults
Experimenting with Affective Computing Models in Video Interviews with Spanish-speaking Older Adults
Josep Lopez Camunas
Cristina Bustos
Yanjun Zhu
Raquel Ros
Àgata Lapedriza
54
0
0
28 Jan 2025
MVP: Multimodal Emotion Recognition based on Video and Physiological Signals
Valeriya Strizhkova
Hadi Kachmar
Hava Chaptoukaev
Raphael Kalandadze
Natia Kukhilava
...
Maria A. Zuluaga
Michal Balazia
A. Dantcheva
François Brémond
Laura M. Ferrari
41
0
0
06 Jan 2025
Spatio-Temporal Fuzzy-oriented Multi-Modal Meta-Learning for Fine-grained Emotion Recognition
Spatio-Temporal Fuzzy-oriented Multi-Modal Meta-Learning for Fine-grained Emotion Recognition
Wenwen Qiang
Yuxuan Yang
Jingyao Wang
Changwen Zheng
76
0
0
18 Dec 2024
Deepfake Media Generation and Detection in the Generative AI Era: A
  Survey and Outlook
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook
Florinel-Alin Croitoru
Andrei Iulian Hiji
Vlad Hondru
Nicolae-Cătălin Ristea
Paul Irofti
Marius Popescu
Cristian Rusu
Radu Tudor Ionescu
F. Khan
Mubarak Shah
89
3
0
29 Nov 2024
Progressive Representation Learning for Real-Time UAV Tracking
Progressive Representation Learning for Real-Time UAV Tracking
Changhong Fu
Xiang Lei
Haobo Zuo
L. Yao
Guangze Zheng
Jia-Yu Pan
AI4TS
37
4
0
25 Sep 2024
A Noval Feature via Color Quantisation for Fake Audio Detection
A Noval Feature via Color Quantisation for Fake Audio Detection
Zhiyong Wang
Xiaopeng Wang
Yuankun Xie
Ruibo Fu
Zhengqi Wen
...
Guanjun Li
Xin Qi
Yi Lu
Xuefei Liu
Yongwei Li
28
1
0
20 Aug 2024
Masked Image Modeling: A Survey
Masked Image Modeling: A Survey
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
N. Sebe
69
6
0
13 Aug 2024
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Representation Learning and Identity Adversarial Training for Facial Behavior Understanding
Mang Ning
A. A. Salah
Itir Onal Ertugrul
CVBM
87
4
0
15 Jul 2024
SignMusketeers: An Efficient Multi-Stream Approach for Sign Language
  Translation at Scale
SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale
Shester Gueuwou
Xiaodan Du
Greg Shakhnarovich
Karen Livescu
SLR
34
3
0
11 Jun 2024
Evolving from Single-modal to Multi-modal Facial Deepfake Detection: Progress and Challenges
Evolving from Single-modal to Multi-modal Facial Deepfake Detection: Progress and Challenges
Ping Liu
Qiqi Tao
Joey Tianyi Zhou
50
0
0
11 Jun 2024
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
AVFF: Audio-Visual Feature Fusion for Video Deepfake Detection
Trevine Oorloff
Surya Koppisetti
Nicolò Bonettini
Divyaraj Solanki
Ben Colman
Yaser Yacoob
Ali Shahriyari
Gaurav Bharaj
37
21
0
05 Jun 2024
Task-adaptive Q-Face
Task-adaptive Q-Face
Haomiao Sun
Mingjie He
Shiguang Shan
Hu Han
Xilin Chen
CVBM
43
4
0
15 May 2024
A Timely Survey on Vision Transformer for Deepfake Detection
A Timely Survey on Vision Transformer for Deepfake Detection
Zhikan Wang
Zhongyao Cheng
Jiajie Xiong
Xun Xu
Tianrui Li
B. Veeravalli
Xulei Yang
34
5
0
14 May 2024
Real, fake and synthetic faces -- does the coin have three sides?
Real, fake and synthetic faces -- does the coin have three sides?
Shahzeb Naeem
Ramzi Al-Sharawi
Muhammad Riyyan Khan
Usman Tariq
Abhinav Dhall
H. Al-Nashash
61
1
0
02 Apr 2024
Self-Supervised Facial Representation Learning with Facial Region
  Awareness
Self-Supervised Facial Representation Learning with Facial Region Awareness
Zheng Gao
Ioannis Patras
SSL
43
10
0
04 Mar 2024
LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and
  Generalizable Deepfake Detection
LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection
Dat Nguyen
Nesryne Mejri
I. Singh
Polina Kuleshova
Marcella Astrid
Anis Kacem
Enjie Ghorbel
Djamila Aouada
30
25
0
24 Jan 2024
Hearing Loss Detection from Facial Expressions in One-on-one
  Conversations
Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Yufeng Yin
Ishwarya Ananthabhotla
V. Ithapu
Stavros Petridis
Yu-Hsiang Wu
Christi Miller
CVBM
34
3
0
17 Jan 2024
From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial
  Expression Recognition in Videos
From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos
Yin Chen
Jia Li
Shiguang Shan
Meng Wang
Richang Hong
46
32
0
09 Dec 2023
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Zhixi Cai
Shreya Ghosh
Aman Pankaj Adatia
Munawar Hayat
Abhinav Dhall
Kalin Stefanov
21
27
0
26 Nov 2023
ProS: Facial Omni-Representation Learning via Prototype-based
  Self-Distillation
ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation
Xing Di
Yiyu Zheng
Xiaoming Liu
Yu Cheng
18
3
0
03 Nov 2023
LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
Di Chang
Yufeng Yin
Zongjia Li
Minh Tran
M. Soleymani
CVBM
52
12
0
18 Aug 2023
Glitch in the Matrix: A Large Scale Benchmark for Content Driven
  Audio-Visual Forgery Detection and Localization
Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
Théophile Cabannes
Shreya Ghosh
Raphaël Marinier
Tom Gedeon
Alexandre M. Bayen
Munawar Hayat
83
22
0
03 May 2023
Do I Have Your Attention: A Large Scale Engagement Prediction Dataset
  and Baselines
Do I Have Your Attention: A Large Scale Engagement Prediction Dataset and Baselines
Monisha Singh
Ximi Hoque
Donghuo Zeng
Yanan Wang
K. Ikeda
Abhinav Dhall
18
16
0
01 Feb 2023
General Facial Representation Learning in a Visual-Linguistic Manner
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
143
163
0
06 Dec 2021
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
Unsupervised Multimodal Language Representations using Convolutional
  Autoencoders
Unsupervised Multimodal Language Representations using Convolutional Autoencoders
Panagiotis Koromilas
Theodoros Giannakopoulos
SSL
25
10
0
06 Oct 2021
Intriguing Properties of Vision Transformers
Intriguing Properties of Vision Transformers
Muzammal Naseer
Kanchana Ranasinghe
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
ViT
256
621
0
21 May 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
245
484
0
20 Apr 2021
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection
Junke Wang
Zuxuan Wu
Wenhao Ouyang
Xintong Han
Jingjing Chen
Ser-Nam Lim
Yu-Gang Jiang
ViT
107
257
0
20 Apr 2021
FaceX-Zoo: A PyTorch Toolbox for Face Recognition
FaceX-Zoo: A PyTorch Toolbox for Face Recognition
Jun Wang
Yinglu Liu
Yibo Hu
Hailin Shi
Tao Mei
CVBM
40
101
0
12 Jan 2021
Transformers in Vision: A Survey
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
F. Khan
M. Shah
ViT
227
2,430
0
04 Jan 2021
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
224
2,234
0
14 Jun 2018
Deep Facial Expression Recognition: A Survey
Deep Facial Expression Recognition: A Survey
Shan Li
Weihong Deng
139
1,280
0
23 Apr 2018
Recasting Residual-based Local Descriptors as Convolutional Neural
  Networks: an Application to Image Forgery Detection
Recasting Residual-based Local Descriptors as Convolutional Neural Networks: an Application to Image Forgery Detection
D. Cozzolino
Giovanni Poggi
L. Verdoliva
101
324
0
14 Mar 2017
Xception: Deep Learning with Depthwise Separable Convolutions
Xception: Deep Learning with Depthwise Separable Convolutions
François Chollet
MDE
BDL
PINN
206
14,368
0
07 Oct 2016
1