Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.03056
Cited By
VPN: Learning Video-Pose Embedding for Activities of Daily Living
6 July 2020
Srijan Das
Saurav Sharma
Rui Dai
F. Brémond
Monique Thonnat
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VPN: Learning Video-Pose Embedding for Activities of Daily Living"
18 / 18 papers shown
Title
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?
Jianyang Xie
Yitian Zhao
Y. Meng
He Zhao
Anh Nguyen
Yalin Zheng
14
0
0
15 May 2025
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
Ahmed Abdelkawy
Asem A. Ali
Aly A. Farag
3DPC
26
0
0
10 Aug 2024
On the Utility of 3D Hand Poses for Action Recognition
Md Salman Shamil
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
40
5
0
14 Mar 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
DVANet: Disentangling View and Action Features for Multi-View Action Recognition
Nyle Siddiqui
Praveen Tirupattur
Mubarak Shah
ViT
29
18
0
10 Dec 2023
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Muhammad Adi Nugroho
Changick Kim
30
0
0
21 Nov 2023
Vision-Based Human Pose Estimation via Deep Learning: A Survey
Gongjin Lan
Yuehua Wu
Fei Hu
Qi Hao
3DH
36
44
0
26 Aug 2023
Understanding Policy and Technical Aspects of AI-Enabled Smart Video Surveillance to Address Public Safety
B. R. Ardabili
Armin Danesh Pazho
Ghazal Alinezhad Noghre
Christopher Neff
Sai Datta Bhaskararayuni
Arun K. Ravindran
Shannon Reid
Hamed Tabkhi
19
23
0
08 Feb 2023
Cross-Modal Learning with 3D Deformable Attention for Action Recognition
Sangwon Kim
Dasom Ahn
ByoungChul Ko
ViT
3DPC
35
24
0
12 Dec 2022
STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn
Sangwon Kim
H. Hong
ByoungChul Ko
ViT
28
97
0
14 Oct 2022
Modality Mixer for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Yeonju Park
Muhammad Adi Nugroho
Changick Kim
26
10
0
24 Aug 2022
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization
Zdravko Marinov
Alina Roitberg
David Schneider
Rainer Stiefelhagen
24
4
0
19 Aug 2022
Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living
Zdravko Marinov
David Schneider
Alina Roitberg
Rainer Stiefelhagen
VGen
32
2
0
03 Aug 2022
Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework
Mehwish Ghafoor
Arif Mahmood
3DH
21
17
0
08 Mar 2022
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
37
17
0
07 Dec 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
21
47
0
19 Jul 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
F. Brémond
ViT
43
66
0
17 May 2021
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos
Di Yang
Rui Dai
Yaohui Wang
Rupayan Mallick
Luca Minciullo
Gianpiero Francesca
F. Brémond
31
16
0
10 Nov 2020
1