Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,289 papers shown
Title
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos
Jiang Bian
Xuhong Li
Tao Wang
Qingzhong Wang
Jun Huang
Chen Liu
Jun Zhao
Feixiang Lu
Dejing Dou
Haoyi Xiong
65
11
0
26 Jul 2022
Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning
A. Syed
Eman A. Aldhahri
M. Iqbal
Abid Ali
Ammar Muthanna
Harun Jamil
F. Jamil
3DH
34
2
0
23 Jul 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
70
27
0
22 Jul 2022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng
Zizheng Pan
Mingfei Han
Xiaojun Chang
Bohan Zhuang
ViT
76
25
0
21 Jul 2022
GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning
Huseyin Coskun
Alireza Zareian
Joshua L. Moore
F. Tombari
Chen Wang
SSL
103
3
0
20 Jul 2022
A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation
R. Rahaman
Dipika Singhania
Alexandre Hoang Thiery
Angela Yao
96
2
0
20 Jul 2022
Is an Object-Centric Video Representation Beneficial for Transfer?
Chuhan Zhang
Ankush Gupta
Andrew Zisserman
ViT
149
27
0
20 Jul 2022
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
62
10
0
20 Jul 2022
Learning Sequence Representations by Non-local Recurrent Neural Memory
Wenjie Pei
Xin Feng
Canmiao Fu
Qi Cao
Guangming Lu
Yu-Wing Tai
AI4TS
79
1
0
20 Jul 2022
Time Is MattEr: Temporal Self-supervision for Video Transformers
Sukmin Yun
Jaehyung Kim
Dongyoon Han
Hwanjun Song
Jung-Woo Ha
Jinwoo Shin
ViT
73
12
0
19 Jul 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
114
21
0
17 Jul 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
74
22
0
15 Jul 2022
Is Appearance Free Action Recognition Possible?
Filip Ilic
Thomas Pock
Richard P. Wildes
57
15
0
13 Jul 2022
Robotic Detection of a Human-Comprehensible Gestural Language for Underwater Multi-Human-Robot Collaboration
Sadman Sakib Enan
Michael Fulton
Junaed Sattar
68
8
0
12 Jul 2022
Trusted Multi-Scale Classification Framework for Whole Slide Image
Ming Feng
Kele Xu
Na Wu
Weiquan Huang
Yan Bai
Changjian Wang
Huaimin Wang
69
6
0
12 Jul 2022
Efficient Human Vision Inspired Action Recognition using Adaptive Spatiotemporal Sampling
Khoi-Nguyen C. Mac
Minh Do
Minh Vo
TTA
108
2
0
12 Jul 2022
Pixel-level Correspondence for Self-Supervised Learning from Video
Yash Sharma
Yi Zhu
Chris Russell
Thomas Brox
SSL
49
4
0
08 Jul 2022
GraphVid: It Only Takes a Few Nodes to Understand a Video
Eitan Kosman
Dotan Di Castro
GNN
94
5
0
04 Jul 2022
OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Ye Liu
Lingfeng Qiao
Di Yin
Zhuoxuan Jiang
Xinghua Jiang
Deqiang Jiang
Bo Ren
54
7
0
04 Jul 2022
Automated Classification of General Movements in Infants Using a Two-stream Spatiotemporal Fusion Network
Yuki Hashimoto
Akira Furui
K. Shimatani
M. Casadio
P. Moretti
P. Morasso
Toshio Tsuji
32
3
0
04 Jul 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
125
10
0
30 Jun 2022
A Comprehensive Survey on Deep Gait Recognition: Algorithms, Datasets and Challenges
Chuanfu Shen
Shiqi Yu
Jilong Wang
George Q. Huang
Liang Wang
CVBM
130
60
0
28 Jun 2022
Programmatic Concept Learning for Human Motion Description and Synthesis
Sumith Kulal
Jiayuan Mao
A. Aiken
Jiajun Wu
116
8
0
27 Jun 2022
Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition
Zhan Chen
Sicheng Li
Bing Yang
Qinghan Li
Hong Liu
79
268
0
27 Jun 2022
VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Kashu Yamazaki
Sang Truong
Khoa T. Vo
Michael Kidd
Chase Rainwater
Khoa Luu
Ngan Le
VLM
CoGe
65
26
0
26 Jun 2022
Review on Social Behavior Analysis of Laboratory Animals: From Methodologies to Applications
Ziping Jiang
Paul L. Chazot
Richard Jiang
132
1
0
25 Jun 2022
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
S. H. Khorasgani
Yuxuan Chen
Florian Shkurti
SSL
114
24
0
25 Jun 2022
Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
Kun Xia
Le Wang
Sanping Zhou
Nanning Zheng
Wei Tang
97
38
0
23 Jun 2022
Motion Gait: Gait Recognition via Motion Excitation
Yunpeng Zhang
Zhengyou Wang
Shanna Zhuang
Hui Wang
CVBM
48
1
0
22 Jun 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
77
6
0
21 Jun 2022
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation
Shuaicheng Li
Feng Zhang
Ruiwei Zhao
Rui Feng
Kunlin Yang
Lin-Na Liu
Jun Hou
ViT
82
5
0
21 Jun 2022
One-stage Action Detection Transformer
Lijun Li
Lian Zhuo
Bangyin Zhang
ViT
58
0
0
21 Jun 2022
M&M Mix: A Multimodal Multiview Transformer Ensemble
Xuehan Xiong
Anurag Arnab
Arsha Nagrani
Cordelia Schmid
ViT
70
20
0
20 Jun 2022
Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Xiang Hao
Jingxiang Chen
Shixing Chen
Ahmed Saad
Raffay Hamid
AI4TS
109
0
0
16 Jun 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
120
99
0
16 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
67
47
0
14 Jun 2022
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks
Shanghua Gao
Zhong-Yu Li
Qi Han
Ming-Ming Cheng
Liang Wang
104
35
0
14 Jun 2022
Real-time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators
Indhumathi Kandaswamy
Saurabh Farkya
Z. Daniels
G. V. D. Wal
Aswin Raghavan
...
Jun Hu
M. Lomnitz
M. Isnardi
David C. Zhang
M. Piacentino
BDL
55
4
0
10 Jun 2022
Dual Windows Are Significant: Learning from Mediastinal Window and Focusing on Lung Window
Qiuli Wang
Xin Tan
Chen Liu
67
0
0
08 Jun 2022
Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking
Longlong Jing
Ruichi Yu
Henrik Kretzschmar
Kang Li
C. Qi
...
Yingwei Li
Yurong You
Han Deng
Congcong Li
Drago Anguelov
3DPC
MDE
86
18
0
08 Jun 2022
A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
M. Kowal
Mennatullah Siam
Md. Amirul Islam
Neil D. B. Bruce
Richard P. Wildes
Konstantinos G. Derpanis
70
26
0
06 Jun 2022
3D Convolutional with Attention for Action Recognition
Labina Shrestha
Shikha Dubey
Farrukh Olimov
M. Rafique
M. Jeon
38
0
0
05 Jun 2022
Revisiting the "Video" in Video-Language Understanding
S. Buch
Cristobal Eyzaguirre
Adrien Gaidon
Jiajun Wu
L. Fei-Fei
Juan Carlos Niebles
102
166
0
03 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
95
57
0
02 Jun 2022
Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage
Ezechukwu I. Nwokedi
R. Bains
L. Bidaut
Xujiong Ye
Sara Wells
James M. Brown
74
2
0
01 Jun 2022
IFRNet: Intermediate Feature Refine Network for Efficient Frame Interpolation
Lingtong Kong
Boyuan Jiang
Donghao Luo
Wenqing Chu
Xiaoming Huang
Ying Tai
Chengjie Wang
Jie Yang
136
155
0
29 May 2022
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences
Hehe Fan
Xin Yu
Yuhang Ding
Yi Yang
Mohan Kankanhalli
3DPC
190
113
0
27 May 2022
Cross-Architecture Self-supervised Video Representation Learning
Sheng Guo
Zihua Xiong
Yujie Zhong
Limin Wang
Xiaobo Guo
Bing Han
Weilin Huang
SSL
AI4TS
136
25
0
26 May 2022
Learning Muti-expert Distribution Calibration for Long-tailed Video Classification
Yufan Hu
Junyu Gao
Changsheng Xu
54
5
0
22 May 2022
Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy
Zhiqi Bu
Jialin Mao
Shiyun Xu
198
51
0
21 May 2022
Previous
1
2
3
...
9
10
11
...
44
45
46
Next