ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.07669
  4. Cited By
Real-time Action Recognition with Enhanced Motion Vector CNNs

Real-time Action Recognition with Enhanced Motion Vector CNNs

26 April 2016
Bowen Zhang
Limin Wang
Zhe Wang
Yu Qiao
Hanli Wang
ArXiv (abs)PDFHTML

Papers citing "Real-time Action Recognition with Enhanced Motion Vector CNNs"

50 / 136 papers shown
Title
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Real-Time Manipulation Action Recognition with a Factorized Graph Sequence Encoder
Enes Erdogan
E. Aksoy
Sanem Sariel
113
0
0
15 Mar 2025
EdgeOAR: Real-time Online Action Recognition On Edge Devices
EdgeOAR: Real-time Online Action Recognition On Edge Devices
Wei Luo
Deyu Zhang
Ying Tang
Fan Wu
Yaoxue Zhang
104
0
0
02 Dec 2024
Flash-VStream: Memory-Based Real-Time Understanding for Long Video
  Streams
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams
Haoji Zhang
Yiqin Wang
Yansong Tang
Yong-Jin Liu
Jiashi Feng
Jifeng Dai
Xiaojie Jin
98
45
0
12 Jun 2024
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a
  Hybrid Model
RNNs, CNNs and Transformers in Human Action Recognition: A Survey and a Hybrid Model
Khaled Alomar
Halil Ibrahim Aysel
Xiaohao Cai
MedImViT
81
9
0
02 Jun 2024
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data
Shufan Li
Harkanwar Singh
Aditya Grover
Mamba
173
64
0
08 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled
  Visual-Motional Tokenization
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Di Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
111
51
0
05 Feb 2024
Taylor Videos for Action Recognition
Taylor Videos for Action Recognition
Lei Wang
Xiuyuan Yuan
Tom Gedeon
Liang Zheng
76
8
0
05 Feb 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
161
1
0
15 Jan 2024
Generating Action-conditioned Prompts for Open-vocabulary Video Action
  Recognition
Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition
Chengyou Jia
Minnan Luo
Xiaojun Chang
Zhuohang Dang
Mingfei Han
Mengmeng Wang
Guangwen Dai
Sizhe Dang
Jingdong Wang
VLM
84
7
0
04 Dec 2023
F4D: Factorized 4D Convolutional Neural Network for Efficient
  Video-level Representation Learning
F4D: Factorized 4D Convolutional Neural Network for Efficient Video-level Representation Learning
Mohammad Al-Saad
Lakshmish Ramaswamy
S. Bhandarkar
AI4TS
38
1
0
28 Nov 2023
Training a Large Video Model on a Single Machine in a Day
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
94
17
0
28 Sep 2023
Local Compressed Video Stream Learning for Generic Event Boundary
  Detection
Local Compressed Video Stream Learning for Generic Event Boundary Detection
Libo Zhang
Xin Gu
Congcong Li
Tiejian Luo
Hengrui Fan
68
4
0
27 Sep 2023
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for
  Live Video Analytics with Cross-Camera Collaboration
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for Live Video Analytics with Cross-Camera Collaboration
Duo Wu
Dayou Zhang
Miao Zhang
Ruoyu Zhang
Fang Wang
Shuguang Cui
65
9
0
19 Aug 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
96
11
0
18 Jul 2023
SpotEM: Efficient Video Search for Episodic Memory
SpotEM: Efficient Video Search for Episodic Memory
Santhosh Kumar Ramakrishnan
Ziad Al-Halah
Kristen Grauman
VLM
98
9
0
28 Jun 2023
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Vasileios Magoulianitis
A. Psaltis
3DH3DPC
120
0
0
23 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGenDiffM
121
341
0
03 Jun 2023
CNN-Based Action Recognition and Pose Estimation for Classifying Animal
  Behavior from Videos: A Survey
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey
Michael Perez
Corey Toler-Franklin
MedIm
75
15
0
15 Jan 2023
PatchBlender: A Motion Prior for Video Transformers
PatchBlender: A Motion Prior for Video Transformers
Gabriele Prato
Yale Song
Janarthanan Rajendran
R. Devon Hjelm
Neel Joshi
Sarath Chandar
ViT
51
0
0
11 Nov 2022
Real-time Online Video Detection with Temporal Smoothing Transformers
Real-time Online Video Detection with Temporal Smoothing Transformers
Yue Zhao
Philipp Krahenbuhl
ViT
105
63
0
19 Sep 2022
Multi-Attention Network for Compressed Video Referring Object
  Segmentation
Multi-Attention Network for Compressed Video Referring Object Segmentation
Weidong Chen
Dexiang Hong
Yuankai Qi
Zhenjun Han
Shuhui Wang
Laiyun Qing
Qingming Huang
Guorong Li
VOS
55
40
0
26 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive
  Spatiotemporal Sampling
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
108
2
0
12 Jul 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for
  Skeleton-Based Action Recognition
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
79
268
0
27 Jun 2022
Scalable Temporal Localization of Sensitive Activities in Movies and TV
  Episodes
Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Xiang Hao
Jingxiang Chen
Shixing Chen
Ahmed Saad
Raffay Hamid
AI4TS
97
0
0
16 Jun 2022
Representation Learning for Compressed Video Action Recognition via
  Attentive Cross-modal Interaction with Motion Enhancement
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
Bing Li
Jiaxin Chen
Dongming Zhang
Xiuguo Bao
Di Huang
34
15
0
07 May 2022
Deformable Video Transformer
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
98
28
0
31 Mar 2022
End-to-End Compressed Video Representation Learning for Generic Event
  Boundary Detection
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Congcong Li
Xinyao Wang
Longyin Wen
Dexiang Hong
Tiejian Luo
Libo Zhang
63
17
0
29 Mar 2022
A Coding Framework and Benchmark towards Compressed Video Understanding
A Coding Framework and Benchmark towards Compressed Video Understanding
Yuan Tian
Guo Lu
Yichao Yan
Guangtao Zhai
Lixing Chen
Zhiyong Gao
70
25
0
06 Feb 2022
Capturing Temporal Information in a Single Frame: Channel Sampling
  Strategies for Action Recognition
Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition
Kiyoon Kim
Shreyank N. Gowda
Oisin Mac Aodha
Laura Sevilla-Lara
97
10
0
25 Jan 2022
Condensing a Sequence to One Informative Frame for Video Recognition
Condensing a Sequence to One Informative Frame for Video Recognition
Zhaofan Qiu
Ting Yao
Y. Shu
Chong-Wah Ngo
Tao Mei
139
9
0
11 Jan 2022
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
Zhaoqilin Yang
Gaoyun An
64
5
0
05 Dec 2021
KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal
  Action Localization
KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action Localization
Kalana Abeywardena
Shechem Sumanthiran
Sakuna Jayasundara
Sachira Karunasena
Ranga Rodrigo
P. Jayasekara
3DPC
83
1
0
05 Nov 2021
TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial
  Decoding
TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Zhengwei Wang
Qi She
A. Smolic
72
9
0
17 Oct 2021
Egocentric View Hand Action Recognition by Leveraging Hand Surface and
  Hand Grasp Type
Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type
Sangpil Kim
Jihyun Bae
Hyung-Gun Chi
Sunghee Hong
Byoung Soo Koh
K. Ramani
EgoV
39
0
0
08 Sep 2021
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions
  and U-GRUs for skeletal pedestrian crossing prediction
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
92
24
0
02 Sep 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
101
78
0
20 Aug 2021
REGINA - Reasoning Graph Convolutional Networks in Human Action
  Recognition
REGINA - Reasoning Graph Convolutional Networks in Human Action Recognition
Bruno Degardin
Vasco Lopes
Hugo Proencca
3DHGNN
55
10
0
14 May 2021
HCMS: Hierarchical and Conditional Modality Selection for Efficient
  Video Recognition
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng
Zuxuan Wu
Hengduo Li
Jingjing Chen
Yu-Gang Jiang
71
4
0
20 Apr 2021
Self-supervised Video Representation Learning by Context and Motion
  Decoupling
Self-supervised Video Representation Learning by Context and Motion Decoupling
Lianghua Huang
Yu Liu
Bin Wang
Pan Pan
Yinghui Xu
Rong Jin
SSL
105
51
0
02 Apr 2021
Adaptive Configuration of In Situ Lossy Compression for Cosmology
  Simulations via Fine-Grained Rate-Quality Modeling
Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling
Sian Jin
Jesus Pulido
Pascal Grosset
Jiannan Tian
Dingwen Tao
J. Ahrens
78
23
0
01 Apr 2021
Unsupervised Motion Representation Enhanced Network for Action
  Recognition
Unsupervised Motion Representation Enhanced Network for Action Recognition
Xiaohang Yang
Lingtong Kong
Jie Yang
43
4
0
05 Mar 2021
Activity Graph Transformer for Temporal Action Localization
Activity Graph Transformer for Temporal Action Localization
Megha Nawhal
Greg Mori
133
71
0
21 Jan 2021
Identity-aware Facial Expression Recognition in Compressed Video
Identity-aware Facial Expression Recognition in Compressed Video
Xiaofeng Liu
Linghao Jin
Xu Han
Jun Lu
J. You
Lingsheng Kong
CVBM
99
21
0
01 Jan 2021
Faster and Accurate Compressed Video Action Recognition Straight from
  the Frequency Domain
Faster and Accurate Compressed Video Action Recognition Straight from the Frequency Domain
Samuel Felipe dos Santos
Jurandy Almeida
53
16
0
26 Dec 2020
Human Action Recognition from Various Data Modalities: A Review
Human Action Recognition from Various Data Modalities: A Review
Zehua Sun
Qiuhong Ke
Hossein Rahmani
Mohammed Bennamoun
Gang Wang
Jun Liu
MU
170
534
0
22 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLMAI4TS
129
188
0
11 Dec 2020
Mutual Modality Learning for Video Action Classification
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
59
9
0
04 Nov 2020
Mutual Information Regularized Identity-aware Facial
  ExpressionRecognition in Compressed Video
Mutual Information Regularized Identity-aware Facial ExpressionRecognition in Compressed Video
Xiaofeng Liu
Linghao Jin
Xu Han
J. You
CVBM
84
26
0
20 Oct 2020
Deep Sequence Learning for Video Anticipation: From Discrete and
  Deterministic to Continuous and Stochastic
Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic
S. Aliakbarian
AI4TS
30
0
0
09 Oct 2020
PERF-Net: Pose Empowered RGB-Flow Net
PERF-Net: Pose Empowered RGB-Flow Net
Yinxiao Li
Zhichao Lu
Xuehan Xiong
Jonathan Huang
3DH
83
17
0
28 Sep 2020
123
Next