Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.04331
Cited By
Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition
8 November 2022
Hyeongju Choi
Apoorva Beedu
H. Haresamudram
Irfan Essa
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition"
19 / 19 papers shown
Title
Contrastive Learning with Cross-Modal Knowledge Mining for Multimodal Human Activity Recognition
Razvan Brinzea
Bulat Khaertdinov
S. Asteriadis
SSL
HAI
89
13
0
20 May 2022
PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects
Pengyuan Wang
Hyunjun Jung
Yitong Li
Siyuan Shen
Rahul Parthasarathy Srikanth
Lorenzo Garattoni
Sven Meier
Nassir Navab
Benjamin Busam
44
48
0
18 May 2022
Omnivore: A Single Model for Many Visual Modalities
Rohit Girdhar
Mannat Singh
Nikhil Ravi
Laurens van der Maaten
Armand Joulin
Ishan Misra
271
237
0
20 Jan 2022
Cross-modal Knowledge Distillation for Vision-to-Sensor Action Recognition
Jianyuan Ni
Raunak Sarbajna
Yang Liu
A. Ngu
Yan Yan
HAI
153
37
0
08 Oct 2021
Contrastive Predictive Coding for Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
93
122
0
09 Dec 2020
HAMLET: A Hierarchical Multimodal Attention-based Human Activity Recognition Algorithm
Md. Mofijul Islam
Tariq Iqbal
51
81
0
03 Aug 2020
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation
Huaishao Luo
Lei Ji
Botian Shi
Haoyang Huang
Nan Duan
Tianrui Li
Jason Li
Xilin Chen
Ming Zhou
VLM
109
438
0
15 Feb 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
568
42,677
0
03 Dec 2019
DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
Chen Wang
Danfei Xu
Yuke Zhu
Roberto Martín-Martín
Cewu Lu
Li Fei-Fei
Silvio Savarese
MDE
109
958
0
15 Jan 2019
On Attention Models for Human Activity Recognition
Vishvak Murahari
T. Plötz
HAI
58
145
0
19 May 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
155
1,333
0
13 Dec 2017
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates
Jun Liu
Amir Shahroudy
Dong Xu
Alex C. Kot
G. Wang
78
456
0
26 Jun 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
811
132,725
0
12 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
240
8,045
0
22 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
270
3,817
0
19 May 2017
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
157
1,004
0
26 Nov 2016
Deep, Convolutional, and Recurrent Models for Human Activity Recognition using Wearables
Nils Y. Hammerla
Shane Halloran
T. Plötz
HAI
BDL
62
890
0
29 Apr 2016
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
238
5,512
0
03 May 2015
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
173
6,057
0
17 Nov 2014
1