ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.05311
  4. Cited By
MMAD: Multi-label Micro-Action Detection in Videos
v1v2 (latest)

MMAD: Multi-label Micro-Action Detection in Videos

7 July 2024
Kun Li
Pengyu Liu
Pengyu Liu
Guoliang Chen
Zhiliang Wu
Hehe Fan
Meng Wang
ArXiv (abs)PDFHTML

Papers citing "MMAD: Multi-label Micro-Action Detection in Videos"

47 / 47 papers shown
Title
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection
Shuming Liu
Chen Zhao
Fatimah Zohra
Mattia Soldan
Alejandro Pardo
...
Juan Carlos León Alcázar
A. Cioppa
Silvio Giancola
Carlos Hinojosa
Bernard Ghanem
112
3
0
27 Feb 2025
Prototype Learning for Micro-gesture Classification
Prototype Learning for Micro-gesture Classification
Guoliang Chen
Fei Wang
Kun Li
Zhiliang Wu
Hehe Fan
Yi Yang
Ming Wang
Dan Guo
105
5
0
06 Aug 2024
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Le Yang
Ziwei Zheng
Yizeng Han
Hao-Ran Cheng
Shiji Song
Gao Huang
Fan Li
104
11
0
03 Jul 2024
Benchmarking Micro-action Recognition: Dataset, Methods, and
  Applications
Benchmarking Micro-action Recognition: Dataset, Methods, and Applications
Dan Guo
Kun Li
Bin Hu
Yan Zhang
Meng Wang
123
44
0
08 Mar 2024
End-to-End Temporal Action Detection with 1B Parameters Across 1000
  Frames
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
Shuming Liu
Chen-Da Liu-Zhang
Chen Zhao
Guohao Li
121
29
0
28 Nov 2023
Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus
  Speech Emotion Recognition
Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition
Jiaxin Ye
Yujie Wei
Xin-Cheng Wen
Chenglong Ma
Zhizhong Huang
Kunhong Liu
Hongming Shan
114
2
0
04 Aug 2023
Data Augmentation for Human Behavior Analysis in Multi-Person
  Conversations
Data Augmentation for Human Behavior Analysis in Multi-Person Conversations
Kun Li
Dan Guo
Guoliang Chen
Feiyang Liu
Meng Wang
ViT
55
11
0
03 Aug 2023
Joint Skeletal and Semantic Embedding Loss for Micro-gesture
  Classification
Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification
Kun Li
Dan Guo
Guoliang Chen
Xin-lin Peng
Meng Wang
78
11
0
20 Jul 2023
TemporalMaxer: Maximize Temporal Context with only Max Pooling for
  Temporal Action Localization
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
Tuan N. Tang
Kwonyoung Kim
Kwanghoon Sohn
111
30
0
16 Mar 2023
TriDet: Temporal Action Detection with Relative Boundary Modeling
TriDet: Temporal Action Detection with Relative Boundary Modeling
Ding Shi
Yujie Zhong
Qiong Cao
Lin Ma
Jia Li
Dacheng Tao
ViT
116
134
0
13 Mar 2023
Uncertain Facial Expression Recognition via Multi-task Assisted
  Correction
Uncertain Facial Expression Recognition via Multi-task Assisted Correction
Yang Liu
Xingming Zhang
Janne Kauttonen
Guoying Zhao
CVBM
98
22
0
14 Dec 2022
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal
  Action Localization
Re^2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
Chen Zhao
Shuming Liu
K. Mangalam
Guohao Li
97
17
0
25 Nov 2022
PointTAD: Multi-Label Temporal Action Detection with Learnable Query
  Points
PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
Jing Tan
Xiaotong Zhao
Xintian Shi
Bingyi Kang
Limin Wang
118
26
0
20 Oct 2022
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action
  Recognition
Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition
Tianjiao Li
Lin Geng Foo
Qiuhong Ke
Hossein Rahmani
Anran Wang
Jinghua Wang
Jing Liu
81
23
0
03 Sep 2022
Bodily Behaviors in Social Interaction: Novel Annotations and
  State-of-the-Art Evaluation
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
Michal Balazia
Philippe Muller
Ákos Levente Tánczos
A. V. Liechtenstein
Franccois Brémond
85
22
0
26 Jul 2022
An Empirical Study of End-to-End Temporal Action Detection
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
94
60
0
06 Apr 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for
  Self-Supervised Video Pre-Training
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
252
1,222
0
23 Mar 2022
ActionFormer: Localizing Moments of Actions with Transformers
ActionFormer: Localizing Moments of Actions with Transformers
Chen-Da Liu-Zhang
Jianxin Wu
Yin Li
ViT
126
342
0
16 Feb 2022
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
Rui Dai
Srijan Das
Kumara Kahatapitiya
Michael S. Ryoo
Francois Bremond
ViT
107
73
0
07 Dec 2021
Learning from Temporal Gradient for Semi-supervised Action Recognition
Learning from Temporal Gradient for Semi-supervised Action Recognition
Junfei Xiao
Longlong Jing
Lin Zhang
Ju He
Qi She
Zongwei Zhou
Alan Yuille
Yingwei Li
89
53
0
25 Nov 2021
Few-shot Learning in Emotion Recognition of Spontaneous Speech Using a
  Siamese Neural Network with Adaptive Sample Pair Formation
Few-shot Learning in Emotion Recognition of Spontaneous Speech Using a Siamese Neural Network with Adaptive Sample Pair Formation
Kexin Feng
Theodora Chaspari
63
26
0
07 Sep 2021
RGB Stream Is Enough for Temporal Action Detection
RGB Stream Is Enough for Temporal Action Detection
Chenhao Wang
Hongxiang Cai
Yuxin Zou
Yichao Xiong
76
25
0
09 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding
  and Emotion Analysis
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Xin Liu
Henglin Shi
Haoyu Chen
Zitong Yu
Xiaobai Li
Guoying Zhao
86
83
0
01 Jul 2021
Video Swin Transformer
Video Swin Transformer
Ze Liu
Jia Ning
Yue Cao
Yixuan Wei
Zheng Zhang
Stephen Lin
Han Hu
ViT
127
1,503
0
24 Jun 2021
End-to-end Temporal Action Detection with Transformer
End-to-end Temporal Action Detection with Transformer
Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
S. Bai
X. Bai
ViT
124
234
0
18 Jun 2021
FineAction: A Fine-Grained Video Dataset for Temporal Action
  Localization
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
Yi Liu
Limin Wang
Yali Wang
Xiao Ma
Yu Qiao
102
62
0
24 May 2021
Learning Salient Boundary Feature for Anchor-free Temporal Action
  Localization
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
Chuming Lin
C. Xu
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Yanwei Fu
91
256
0
24 Mar 2021
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT3DVPINN
530
13,239
0
26 May 2020
FineGym: A Hierarchical Video Dataset for Fine-grained Action
  Understanding
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
Dian Shao
Yue Zhao
Bo Dai
Dahua Lin
83
331
0
14 Apr 2020
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action
  Recognition
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition
Ziyu Liu
Hongwen Zhang
Zhenghao Chen
Zhiyong Wang
Wanli Ouyang
156
838
0
31 Mar 2020
Emotion Recognition From Gait Analyses: Current Research and Future
  Directions
Emotion Recognition From Gait Analyses: Current Research and Future Directions
Shihao Xu
Jing Fang
Xiping Hu
Edith C. H. Ngai
Wei Wang
Yi Guo
Victor C. M. Leung
CVBM
66
35
0
13 Mar 2020
G-TAD: Sub-Graph Localization for Temporal Action Detection
G-TAD: Sub-Graph Localization for Temporal Action Detection
Mengmeng Xu
Chen Zhao
D. Rojas
Ali K. Thabet
Guohao Li
149
438
0
26 Nov 2019
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
Fast Learning of Temporal Action Proposal via Dense Boundary Generator
Chuming Lin
Jian Li
Yabiao Wang
Ying Tai
Donghao Luo
Zhipeng Cui
Chengjie Wang
Jilin Li
Feiyue Huang
Rongrong Ji
92
215
0
11 Nov 2019
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
Tianwei Lin
Xiao-Chang Liu
Xin Li
Errui Ding
Shilei Wen
168
610
0
23 Jul 2019
A Short Note on the Kinetics-700 Human Action Dataset
A Short Note on the Kinetics-700 Human Action Dataset
João Carreira
Eric Noland
Chloe Hillier
Andrew Zisserman
102
458
0
15 Jul 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
246
4,558
0
02 Feb 2019
SlowFast Networks for Video Recognition
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
221
3,302
0
10 Dec 2018
TSM: Temporal Shift Module for Efficient Video Understanding
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
146
1,699
0
20 Nov 2018
Diagnosing Error in Temporal Action Detectors
Diagnosing Error in Temporal Action Detectors
Humam Alwassel
Fabian Caba Heilbron
Victor Escorcia
Guohao Li
181
106
0
27 Jul 2018
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
Tianwei Lin
Xu Zhao
Haisheng Su
Chongjing Wang
Ming Yang
228
708
0
08 Jun 2018
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action
  Recognition
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Sijie Yan
Yuanjun Xiong
Dahua Lin
GNN
271
4,218
0
23 Jan 2018
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
406
8,072
0
22 May 2017
Temporal Segment Networks for Action Recognition in Videos
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
133
817
0
08 May 2017
Gaussian Error Linear Units (GELUs)
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
196
5,074
0
27 Jun 2016
Every Moment Counts: Dense Detailed Labeling of Actions in Complex
  Videos
Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
Serena Yeung
Olga Russakovsky
Ning Jin
Mykhaylo Andriluka
Greg Mori
Li Fei-Fei
VLM
115
441
0
21 Jul 2015
Recognizing Fine-Grained and Composite Activities using Hand-Centric
  Features and Script Data
Recognizing Fine-Grained and Composite Activities using Hand-Centric Features and Script Data
Marcus Rohrbach
Anna Rohrbach
Michaela Regneri
S. Amin
Mykhaylo Andriluka
Manfred Pinkal
Bernt Schiele
111
179
0
23 Feb 2015
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIPVGen
236
6,190
0
03 Dec 2012
1