Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,289 papers shown
Title
Adaptive Hyper-Graph Convolution Network for Skeleton-based Human Action Recognition with Virtual Connections
Youwei Zhou
Tianyang Xu
Cong Wu
Xiaojun Wu
J. Kittler
3DH
179
0
0
22 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
189
1
0
18 Nov 2024
Weakly-Supervised Anomaly Detection in Surveillance Videos Based on Two-Stream I3D Convolution Network
Sareh Nejad
Anwar Haque
77
1
0
13 Nov 2024
Don't Look Twice: Faster Video Transformers with Run-Length Tokenization
Rohan Choudhury
Guanglei Zhu
Sihan Liu
Koichiro Niinuma
Kris M. Kitani
László A. Jeni
83
14
0
07 Nov 2024
Learning Video Representations without Natural Videos
Xueyang Yu
Xinlei Chen
Yossi Gandelsman
VGen
AI4TS
90
1
0
31 Oct 2024
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection
Fan Nie
Jiangqun Ni
Jian Zhang
Bin Zhang
Weizhe Zhang
DiffM
85
3
0
31 Oct 2024
SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity
Kaidi Wang
Jieru Zhao
Shuo Yang
Wenchao Ding
Minyi Guo
58
0
0
28 Oct 2024
That was not what I was aiming at! Differentiating human intent and outcome in a physically dynamic throwing task
Vidullan Surendran
Alan R. Wagner
33
0
0
26 Oct 2024
Enhancing SNN-based Spatio-Temporal Learning: A Benchmark Dataset and Cross-Modality Attention Model
Shibo Zhou
Bo Yang
Mengwen Yuan
Runhao Jiang
Rui Yan
Gang Pan
Huajin Tang
59
7
0
21 Oct 2024
LocoMotion: Learning Motion-Focused Video-Language Representations
Hazel Doughty
Fida Mohammad Thoker
Cees G. M. Snoek
115
2
0
15 Oct 2024
On-the-fly Modulation for Balanced Multimodal Learning
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
52
11
0
15 Oct 2024
The Ingredients for Robotic Diffusion Transformers
Sudeep Dasari
Oier Mees
Sebastian Zhao
Mohan Kumar Srirama
Sergey Levine
118
24
0
14 Oct 2024
Fourier-based Action Recognition for Wildlife Behavior Quantification with Event Cameras
Friedhelm Hamann
Suman Ghosh
Ignacio Juarez Martinez
Tom Hart
Alex Kacelnik
Guillermo Gallego
58
1
0
09 Oct 2024
Cefdet: Cognitive Effectiveness Network Based on Fuzzy Inference for Action Detection
Zhe Luo
Weina Fu
Shuai Liu
Saeed Anwar
Muhammad Saqib
Sambit Bakshi
Khan Muhammad
66
2
0
08 Oct 2024
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
Haibo Wang
Zhiyang Xu
Yu Cheng
Shizhe Diao
Yufan Zhou
Yixin Cao
Qifan Wang
Weifeng Ge
Lifu Huang
91
26
0
04 Oct 2024
LS-HAR: Language Supervised Human Action Recognition with Salient Fusion, Construction Sites as a Use-Case
Mohammad Mahdavian
Mohammad Loni
Mo Chen
Mo Chen
68
0
0
02 Oct 2024
REST-HANDS: Rehabilitation with Egocentric Vision Using Smartglasses for Treatment of Hands after Surviving Stroke
Wiktor Mucha
Kentaro Tanaka
M. Kampel
82
0
0
30 Sep 2024
Egocentric zone-aware action recognition across environments
Simone Alberto Peirone
Gabriele Goletto
M. Planamente
A. Bottino
Barbara Caputo
Giuseppe Averta
EgoV
92
2
0
21 Sep 2024
BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow
EungGu Kang
Byeonghun Lee
Sunghoon Im
Kyong Hwan Jin
SupR
88
6
0
21 Sep 2024
Generating Event-oriented Attribution for Movies via Two-Stage Prefix-Enhanced Multimodal LLM
Yuanjie Lyu
Tong Xu
Zihan Niu
Bo Peng
Jing Ke
Enhong Chen
64
0
0
14 Sep 2024
2D bidirectional gated recurrent unit convolutional Neural networks for end-to-end violence detection In videos
Abdarahmane Traoré
M. Akhloufi
35
13
0
11 Sep 2024
Real-Time Human Action Recognition on Embedded Platforms
Ruiqi Wang
Zichen Wang
Peiqi Gao
Mingzhen Li
Jaehwan Jeong
Yihang Xu
Yejin Lee
Carolyn M. Baum
Lisa Connor
Chenyang Lu
96
3
0
09 Sep 2024
HMAFlow: Learning More Accurate Optical Flow via Hierarchical Motion Field Alignment
Dianbo Ma
Kousuke Imamura
Ziyan Gao
Xiangjie Wang
Satoshi Yamane
62
0
0
09 Sep 2024
Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment
Keyne Oei
Amr Gomaa
Anna Maria Feit
João Belo
101
0
0
06 Sep 2024
MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition
Mallika Garg
Debashis Ghosh
P. M. Pradhan
ViT
67
1
0
05 Sep 2024
Ig3D: Integrating 3D Face Representations in Facial Expression Inference
Lu Dong
Xiao Wang
S. Setlur
Venu Govindaraju
Ifeoma Nwogu
3DH
84
1
0
29 Aug 2024
MMASD+: A Novel Dataset for Privacy-Preserving Behavior Analysis of Children with Autism Spectrum Disorder
Pavan Uttej Ravva
Behdokht Kiafar
Pinar Kullu
Jicheng Li
Anjana Bhat
R. Barmaki
58
0
0
27 Aug 2024
Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification
Mahrukh Awan
Asmar Nadeem
Muhammad Junaid Awan
Armin Mustafa
Syed Sameed Husain
65
1
0
26 Aug 2024
HabitAction: A Video Dataset for Human Habitual Behavior Recognition
Hongwu Li
Zhenliang Zhang
Wei Wang
84
0
0
24 Aug 2024
TDS-CLIP: Temporal Difference Side Network for Efficient VideoAction Recognition
Bin Wang
W. Li
Wenqian Wang
Mingliang Gao
Runmin Cong
Wei Emma Zhang
VLM
55
1
0
20 Aug 2024
Flatten: Video Action Recognition is an Image Classification task
Junlin Chen
Chengcheng Xu
Yangfan Xu
Jian Yang
Jun Yu Li
Zhiping Shi
70
1
0
17 Aug 2024
Weakly Supervised Video Anomaly Detection and Localization with Spatio-Temporal Prompts
Peng Wu
Xuerong Zhou
Guansong Pang
Zhiwei Yang
Qingsen Yan
Peng Wang
Yanning Zhang
143
13
0
12 Aug 2024
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
Jungpil Shin
Abu Saleh Musa Miah
Md. Humaun Kabir
M. Rahim
Abdullah Al Shiam
75
14
0
10 Aug 2024
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
Ahmed Abdelkawy
Asem A. Ali
Asem Ali
3DPC
111
0
0
10 Aug 2024
MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning
Rex Liu
Xin Liu
100
2
0
08 Aug 2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation
Xin Liu
Chao Hao
Zitong Yu
Huanjing Yue
Jingyu Yang
65
1
0
05 Aug 2024
YOWOv3: An Efficient and Generalized Framework for Human Action Detection and Recognition
Duc Manh Nguyen Dang
Viet-Hang Duong
Jia Ching Wang
Nhan Bui Duc
59
3
0
05 Aug 2024
MPT-PAR:Mix-Parameters Transformer for Panoramic Activity Recognition
Wenqing Gan
Yaoyu Li
Jian Li
Zhangang Lin
ViT
85
1
0
01 Aug 2024
Segment Anything for Videos: A Systematic Survey
Chunhui Zhang
Yawen Cui
Weilin Lin
Guanjie Huang
Yan Rong
Li Liu
Shiguang Shan
VLM
86
8
0
31 Jul 2024
Is 3D Convolution with 5D Tensors Really Necessary for Video Analysis?
Habib Hajimolahoseini
Walid Ahmed
Austin Wen
Yang Liu
71
0
0
23 Jul 2024
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Mingze Xu
Mingfei Gao
Zhe Gan
Hong-You Chen
Zhengfeng Lai
Haiming Gang
Kai Kang
Afshin Dehghan
110
60
0
22 Jul 2024
Semi-Supervised Pipe Video Temporal Defect Interval Localization
Zhu Huang
Gang Pan
Chao Kang
Yaozhi Lv
57
0
0
21 Jul 2024
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
171
4
0
20 Jul 2024
MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images
Majedaldein Almahasneh
A. Paiement
Xianghua Xie
Jean Aboudarham
82
4
0
19 Jul 2024
Pose-guided multi-task video transformer for driver action recognition
Ricardo Pizarro
Roberto Valle
L. Bergasa
J. M. Buenaposada
Luis Baumela
ViT
68
0
0
18 Jul 2024
Improved Esophageal Varices Assessment from Non-Contrast CT Scans
Chunli Li
Xiaoming Zhang
Yuan Gao
Xiaoli Yin
Le Lu
Ling Zhang
Ke Yan
Yu Shi
82
0
0
18 Jul 2024
MaskVD: Region Masking for Efficient Video Object Detection
Sreetama Sarkar
Gourav Datta
Souvik Kundu
Kai Zheng
Chirayata Bhattacharyya
Peter A. Beerel
90
4
0
16 Jul 2024
Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding
Minghui Wu
Chenxu Zhao
Anyang Su
Donglin Di
Tianyu Fu
...
Min He
Ya Gao
Meng Ma
Kun Yan
Ping Wang
75
1
0
11 Jul 2024
Computer Vision for Clinical Gait Analysis: A Gait Abnormality Video Dataset
Rahm Ranjan
David Ahmedt-Aristizabal
M. Armin
Juno Kim
87
4
0
05 Jul 2024
Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Yijie Yang
Jinlu Zhang
Jiaxu Zhang
Zhigang Tu
81
5
0
26 Jun 2024
Previous
1
2
3
4
5
...
44
45
46
Next