Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,289 papers shown
Title
Action parsing using context features
N. Mehrseresht
24
0
0
20 May 2022
PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection
Guang-Hui Shi
Ruifeng Li
Chaoxiang Ma
3DPC
112
145
0
16 May 2022
Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation
Qiuping Jiang
Jiawu Xu
Yudong Mao
Wei Zhou
Xiongkuo Min
Guangtao Zhai
61
4
0
12 May 2022
Past and Future Motion Guided Network for Audio Visual Event Localization
Ting-Yen Chen
Jianqin Yin
Jin Tang
58
2
0
08 May 2022
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
Bing Li
Jiaxin Chen
Dongming Zhang
Xiuguo Bao
Di Huang
51
15
0
07 May 2022
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Mingdong Yang
Guo Chen
Yin-Dong Zheng
Tong Lu
Limin Wang
91
48
0
05 May 2022
Deep Neural Network approaches for Analysing Videos of Music Performances
F. Liwicki
Richa Upadhyay
Prakash Chandra Chhipa
Killian Murphy
F. Visi
S. Östersjö
Marcus Liwicki
52
1
0
05 May 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLM
CLIP
OffRL
334
1,314
0
04 May 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
72
0
0
03 May 2022
Exposing Deepfake Face Forgeries with Guided Residuals
Zhiqing Guo
Gaobo Yang
Jiyou Chen
Xingming Sun
CVBM
63
28
0
02 May 2022
Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition
Yang Zhou
Zhanhao He
Ke Lu
Guanhong Wang
Gaoang Wang
CLL
SLR
104
2
0
01 May 2022
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
B. Mudassar
Sho Ko
Maojingjing Li
Priyabrata Saha
Saibal Mukhopadhyay
111
2
0
30 Apr 2022
Self-supervised Contrastive Learning for Audio-Visual Action Recognition
Yang Liu
Y. Tan
Haoyu Lan
SSL
79
7
0
28 Apr 2022
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training
Guanhong Wang
Ke Lu
Yang Zhou
Zhanhao He
Gaoang Wang
SSL
76
3
0
27 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
117
0
25 Apr 2022
Estimation of Reliable Proposal Quality for Temporal Action Detection
Junshan Hu
Chaoxu Guo
Liansheng Zhuang
Biao Wang
T. Ge
Yuning Jiang
Houqiang Li
85
4
0
25 Apr 2022
Trusted Multi-View Classification with Dynamic Evidential Fusion
Zongbo Han
Changqing Zhang
Huazhu Fu
Qiufeng Wang
EDL
93
235
0
25 Apr 2022
Keypoint based Sign Language Translation without Glosses
Youngmin Kim
Minji Kwak
Dain Lee
Yeongeun Kim
Hyeongboo Baek
SLR
46
7
0
22 Apr 2022
THORN: Temporal Human-Object Relation Network for Action Recognition
Mohammed Guermal
Rui Dai
Francois Bremond
EgoV
66
3
0
20 Apr 2022
A Survey of Video-based Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Qing Yu
Tao Suo
Zhan Sun
Ka Li
Lihua Zhang
64
19
0
20 Apr 2022
ActAR: Actor-Driven Pose Embeddings for Video Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
189
4
0
19 Apr 2022
3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
A. Zemmari
J. Morlier
65
5
0
13 Apr 2022
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
Jinglin Xu
Yongming Rao
Xumin Yu
Guangyi Chen
Jie Zhou
Jiwen Lu
84
97
0
07 Apr 2022
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
94
60
0
06 Apr 2022
DBF: Dynamic Belief Fusion for Combining Multiple Object Detectors
Hyungtae Lee
H. Kwon
FedML
36
18
0
06 Apr 2022
Detector-Free Weakly Supervised Group Activity Recognition
Dongkeun Kim
Jin S. Lee
Minsu Cho
Suha Kwak
ViT
77
44
0
05 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
138
106
0
04 Apr 2022
TALLFormer: Temporal Action Localization with a Long-memory Transformer
Feng Cheng
Gedas Bertasius
ViT
120
94
0
04 Apr 2022
A-ACT: Action Anticipation through Cycle Transformations
Akash Gupta
Jingen Liu
Liefeng Bo
Amit K. Roy-Chowdhury
Tao Mei
93
6
0
02 Apr 2022
Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition
Ryota Hashiguchi
Toru Tamaki
45
6
0
01 Apr 2022
DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow
Zihua Zheng
Ni Nie
Zhi Ling
Pengfei Xiong
Jiangyu Liu
Hongya Wang
Jiankun Li
MDE
70
46
0
01 Apr 2022
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition
Jun Kimata
Tomoya Nitta
Toru Tamaki
103
10
0
01 Apr 2022
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
98
28
0
31 Mar 2022
Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
Feng Cheng
Ming Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Li
Wei Xia
63
17
0
31 Mar 2022
StyleFool: Fooling Video Classification Systems via Style Transfer
Yu Cao
Xi Xiao
Ruoxi Sun
Derui Wang
Minhui Xue
Sheng Wen
AAML
129
26
0
30 Mar 2022
Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
Shi Pu
Kaili Zhao
Mao Zheng
VLM
63
20
0
29 Mar 2022
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Congcong Li
Xinyao Wang
Longyin Wen
Dexiang Hong
Tiejian Luo
Libo Zhang
75
17
0
29 Mar 2022
Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning
Minghao Chen
Fangyun Wei
Chong Li
Deng Cai
AI4TS
105
35
0
28 Mar 2022
End-to-End Active Speaker Detection
Juan Carlos León Alcázar
M. Cordes
Chen Zhao
Guohao Li
107
28
0
27 Mar 2022
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
Muheng Li
Lei Chen
Yueqi Duan
Zhilan Hu
Jianjiang Feng
Jie Zhou
Jiwen Lu
79
76
0
26 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
252
1,222
0
23 Mar 2022
Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)
Yu Huang
Junyang Lin
Chang Zhou
Hongxia Yang
Longbo Huang
68
97
0
23 Mar 2022
Global Matching with Overlapping Attention for Optical Flow Estimation
Shiyu Zhao
Long Zhao
Zhixing Zhang
Enyu Zhou
Dimitris N. Metaxas
3DPC
131
79
0
21 Mar 2022
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video
Wenqi Jia
Miao Liu
James M. Rehg
EgoV
79
14
0
21 Mar 2022
No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces
Jianqi Zhong
Kaichen Zhou
Qingyong Hu
Bing Wang
Niki Trigoni
Andrew Markham
3DPC
101
23
0
21 Mar 2022
FAR: Fourier Aerial Video Recognition
D. Kothandaraman
Tianrui Guan
Xijun Wang
Sean Hu
Ming-Shun Lin
Tianyi Zhou
77
13
0
21 Mar 2022
PressureVision: Estimating Hand Pressure from a Single RGB Image
Patrick Grady
Chengcheng Tang
Samarth Brahmbhatt
Christopher D. Twigg
Chengde Wan
James Hays
Charles C. Kemp
3DH
83
20
0
19 Mar 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
132
51
0
19 Mar 2022
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
Khoa T. Vo
Kashu Yamazaki
Sang Truong
M. Tran
Akihiro Sugimoto
Ngan Le
EgoV
71
9
0
16 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
85
24
0
16 Mar 2022
Previous
1
2
3
...
10
11
12
...
44
45
46
Next