Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,275 papers shown
Title
Dual-path Adaptation from Image to Video Transformers
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
23
37
0
17 Mar 2023
Video Action Recognition with Attentive Semantic Units
Yifei Chen
Dapeng Chen
Ruijin Liu
Hao Li
Wei Peng
21
11
0
17 Mar 2023
Activity Recognition From Newborn Resuscitation Videos
Øyvind Meinich-Bache
Simon Lennart Austnes
K. Engan
Ivar Austvoll
T. Eftestøl
H. Myklebust
S. Kusulla
H. Kidanto
H. Ersdal
13
19
0
14 Mar 2023
Towards Diverse Temporal Grounding under Single Positive Labels
Hao Zhou
Chongyang Zhang
Yanjun Chen
Chuanping Hu
26
1
0
12 Mar 2023
Exploring Recurrent Long-term Temporal Fusion for Multi-view 3D Perception
Chunrui Han
Jinrong Yang
Jian‐Yuan Sun
Zheng Ge
Runpei Dong
Hongyu Zhou
Weixin Mao
Yuang Peng
Xiangyu Zhang
58
58
0
10 Mar 2023
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Yimeng Zhang
Xin Chen
Jinghan Jia
Sijia Liu
Ke Ding
23
25
0
09 Mar 2023
DiM: Distilling Dataset into Generative Model
Kai Wang
Jianyang Gu
Daquan Zhou
Zheng Hua Zhu
Wei Jiang
Yang You
DD
61
41
0
08 Mar 2023
Depression Detection Using Digital Traces on Social Media: A Knowledge-aware Deep Learning Approach
Wenli Zhang
Jiaheng Xie
Zhuocheng Zhang
Xiang Liu
29
9
0
06 Mar 2023
Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator
Qing-Huang Song
Yang Zhou
Mengjie Hu
Chun Liu
25
4
0
06 Mar 2023
Texture-Based Input Feature Selection for Action Recognition
Yalong Jiang
27
0
0
28 Feb 2023
Temporal Coherent Test-Time Optimization for Robust Video Classification
Chenyu Yi
Siyuan Yang
Yufei Wang
Haoliang Li
Yap-Peng Tan
Alex C. Kot
TTA
27
12
0
28 Feb 2023
Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks
Yuxuan Zhang
Qingzhong Wang
Jiang Bian
Yi Liu
Yanwu Xu
Dejing Dou
Haoyi Xiong
27
5
0
24 Feb 2023
Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
S. A. Sheikh
Md. Sahidullah
F. Hirsch
Slim Ouni
29
16
0
21 Feb 2023
Medical Face Masks and Emotion Recognition from the Body: Insights from a Deep Learning Perspective
Nikolaos Kegkeroglou
P. Filntisis
Petros Maragos
CVBM
37
3
0
20 Feb 2023
Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer
N. H. Phong
B. Ribeiro
31
15
0
17 Feb 2023
Multimodal Subtask Graph Generation from Instructional Videos
Y. Jang
Sungryull Sohn
Lajanugen Logeswaran
Tiange Luo
Moontae Lee
Ho Hin Lee
28
10
0
17 Feb 2023
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
Minsu Kim
Hyungil Kim
Y. Ro
VLM
18
18
0
16 Feb 2023
Balanced Audiovisual Dataset for Imbalance Analysis
Wenke Xia
Xu Zhao
Xincheng Pang
Changqing Zhang
Di Hu
41
1
0
14 Feb 2023
CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection
C. Nwoye
Tong Yu
Saurav Sharma
Aditya Murali
Deepak Alapatt
...
Pietro Mascagni
B. Seeliger
Cristians Gonzalez
Didier Mutter
N. Padoy
32
17
0
13 Feb 2023
Revisiting Pre-training in Audio-Visual Learning
Ruoxuan Feng
Wenke Xia
Di Hu
39
1
0
07 Feb 2023
Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks
Leonard Hacker
Finn Bartels
Pierre-Etienne Martin
21
6
0
06 Feb 2023
Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms
Pierre-Etienne Martin
27
1
0
06 Feb 2023
Pyramid Self-attention Polymerization Learning for Semi-supervised Skeleton-based Action Recognition
Binqian Xu
Xiangbo Shu
35
42
0
05 Feb 2023
Action Capsules: Human Skeleton Action Recognition
Ali Farajzadeh Bavil
H. Damirchi
H. Taghirad
33
15
0
30 Jan 2023
Optical Flow Estimation in 360
∘
^\circ
∘
Videos: Dataset, Model and Application
Bin Duan
Keshav Bhandari
Gaowen Liu
Yan Yan
24
0
0
27 Jan 2023
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
55
2
0
26 Jan 2023
Flow-guided Semi-supervised Video Object Segmentation
Yushan Zhang
Andreas Robinson
M. Magnusson
Michael Felsberg
VOS
24
1
0
25 Jan 2023
Zorro: the masked multimodal transformer
Adrià Recasens
Jason Lin
João Carreira
Drew Jaegle
Luyu Wang
...
Pauline Luc
Antoine Miech
Lucas Smaira
Ross Hemsley
Andrew Zisserman
39
20
0
23 Jan 2023
Improving Zero-Shot Action Recognition using Human Instruction with Text Description
Na Wu
Hiroshi Kera
K. Kawamoto
32
7
0
21 Jan 2023
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey
Michael Perez
Corey Toler-Franklin
MedIm
36
14
0
15 Jan 2023
Deep learning-based approaches for human motion decoding in smart walkers for rehabilitation
Carolina Gonçalves
J. Lopes
S. Moccia
Daniele Berardini
Lucia Migliorelli
C. Santos
19
10
0
13 Jan 2023
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou
Erik Chavez
S. Zafeiriou
ViT
29
50
0
12 Jan 2023
Triple-stream Deep Metric Learning of Great Ape Behavioural Actions
Otto Brookes
Majid Mirmehdi
H. Kühl
T. Burghardt
36
14
0
06 Jan 2023
EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding
Shuhan Tan
Tushar Nagarajan
Kristen Grauman
31
21
0
05 Jan 2023
Hierarchical Explanations for Video Action Recognition
Sadaf Gulshad
Teng Long
Nanne van Noord
FAtt
29
6
0
01 Jan 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
111
48
0
31 Dec 2022
Transformers in Action Recognition: A Review on Temporal Modeling
Elham Shabaninia
Hossein Nezamabadi-pour
Fatemeh Shafizadegan
ViT
29
8
0
29 Dec 2022
StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition
Xi Shen
Zhedong Zheng
Yi Yang
SLR
43
13
0
25 Dec 2022
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction
Chaofan Ling
Junpei Zhong
Wei-Hong Li
49
3
0
22 Dec 2022
Deep set conditioned latent representations for action recognition
Akash Singh
Tom De Schepper
Kevin Mets
P. Hellinckx
José Oramas
Steven Latré
BDL
27
2
0
21 Dec 2022
A Survey on Human Action Recognition
Zhou Shuchang
34
0
0
20 Dec 2022
Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization
Chen Ju
Kunhao Zheng
Jinxian Liu
Peisen Zhao
Ya Zhang
Jianlong Chang
Yanfeng Wang
Qi Tian
20
11
0
19 Dec 2022
Person Detection Using an Ultra Low-resolution Thermal Imager on a Low-cost MCU
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
Toon Goedemé
31
2
0
16 Dec 2022
Adversarially Robust Video Perception by Seeing Motion
Lingyu Zhang
Chengzhi Mao
Junfeng Yang
Carl Vondrick
VGen
AAML
49
2
0
13 Dec 2022
Contextual Explainable Video Representation: Human Perception-based Understanding
Khoa T. Vo
Kashu Yamazaki
Phong H. Nguyen
Pha Nguyen
Khoa Luu
Ngan Le
21
9
0
12 Dec 2022
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition
Dawei Du
Ameya Shringi
A. Hoogs
Christopher Funk
21
2
0
12 Dec 2022
CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
Kevin Hyekang Joo
Khoa T. Vo
Kashu Yamazaki
Ngan Le
27
39
0
09 Dec 2022
Audiovisual Masked Autoencoders
Mariana-Iuliana Georgescu
Eduardo Fonseca
Radu Tudor Ionescu
Mario Lucic
Cordelia Schmid
Anurag Arnab
SSL
44
43
0
09 Dec 2022
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation
Jie Jiang
Zhimin Li
Jiangfeng Xiong
Rongwei Quan
Qinglin Lu
Wei Liu
38
2
0
09 Dec 2022
FLAG3D: A 3D Fitness Activity Dataset with Language Instruction
Yansong Tang
Jinpeng Liu
Aoyang Liu
B. Yang
Wen-Dao Dai
Yongming Rao
Jiwen Lu
Jie Zhou
Xiu Li
49
22
0
09 Dec 2022
Previous
1
2
3
...
6
7
8
...
44
45
46
Next