Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,289 papers shown
Title
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering
Yi Cheng
Hehe Fan
Dongyun Lin
Ying Sun
Mohan S. Kankanhalli
J. Lim
91
5
0
25 Jul 2023
On the Connection between Pre-training Data Diversity and Fine-tuning Robustness
Vivek Ramanujan
Thao Nguyen
Sewoong Oh
Ludwig Schmidt
Ali Farhadi
OOD
46
26
0
24 Jul 2023
Multi-Modal Machine Learning for Assessing Gaming Skills in Online Streaming: A Case Study with CS:GO
Longxiang Zhang
Wenping Wang
119
1
0
23 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
96
11
0
18 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
118
21
0
13 Jul 2023
Transformer-based end-to-end classification of variable-length volumetric data
Marzieh Oghbaie
Teresa Araújo
T. Emre
U. Schmidt-Erfurth
Hrvoje Bogunović
ViT
MedIm
67
4
0
13 Jul 2023
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
Amrithaa Seshadri
Alessandra Russo
87
0
0
09 Jul 2023
Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition
Fei-Yu Guo
Li Zhu
Yiwang Wang
Jing Sun
ViT
78
8
0
05 Jul 2023
Look, Remember and Reason: Grounded reasoning in videos with language models
Apratim Bhattacharyya
Sunny Panchal
Mingu Lee
Reza Pourreza
Pulkit Madan
Roland Memisevic
LRM
107
7
0
30 Jun 2023
Deep Equilibrium Multimodal Fusion
Jinhong Ni
Yalong Bai
Wei Zhang
Ting Yao
Tao Mei
88
1
0
29 Jun 2023
Differentially Private Video Activity Recognition
Zelun Luo
Yuliang Zou
Yijin Yang
Zane Durante
De-An Huang
Zhiding Yu
Chaowei Xiao
L. Fei-Fei
Anima Anandkumar
PICV
85
5
0
27 Jun 2023
Spiking Two-Stream Methods with Unsupervised STDP-based Learning for Action Recognition
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
74
3
0
23 Jun 2023
Variance-Covariance Regularization Improves Representation Learning
Jiachen Zhu
Katrina Evtimova
Yubei Chen
Ravid Shwartz-Ziv
Yann LeCun
SSL
90
7
0
23 Jun 2023
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Vasileios Magoulianitis
A. Psaltis
3DH
3DPC
120
0
0
23 Jun 2023
A Reliable and Interpretable Framework of Multi-view Learning for Liver Fibrosis Staging
Zheyao Gao
Yuanye Liu
Fuping Wu
N. Shi
Yuxin Shi
Xiahai Zhuang
EDL
38
12
0
21 Jun 2023
Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos
Md Zahid Hasan
Jiajing Chen
Jiyang Wang
Mohammed Shaiqur Rahman
Ameya Joshi
Senem Velipasalar
Chinmay Hegde
Anuj Sharma
Soumik Sarkar
VLM
124
20
0
16 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
79
4
0
15 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
67
7
0
13 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Zihui Xue
Kristen Grauman
EgoV
116
40
0
08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
62
4
0
07 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
69
10
0
06 Jun 2023
Human-Object Interaction Prediction in Videos through Gaze Following
Zhifan Ni
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
103
13
0
06 Jun 2023
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
58
15
0
31 May 2023
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
Sanjoy Kundu
Shubham Trehan
Sathyanarayanan N. Aakur
LRM
LM&Ro
71
3
0
26 May 2023
Action Sensitivity Learning for Temporal Action Localization
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
129
24
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Thanh-Dat Truong
Khoa Luu
EgoV
146
12
0
25 May 2023
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated Interactions
Ali Ayub
Zachary De Francesco
Patrick Holthaus
Chrystopher L. Nehaniv
Kerstin Dautenhahn
CLL
HAI
107
7
0
22 May 2023
Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains
Kunyu Peng
Di Wen
David Schneider
Jiaming Zhang
Kailun Yang
M. Sarfraz
Rainer Stiefelhagen
Alina Roitberg
74
2
0
15 May 2023
Is end-to-end learning enough for fitness activity recognition?
Antoine Mercier
Guillaume Berger
Sunny Panchal
Florian Letsch
Cornelius Boehm
Nahua Kang
Ingo Bax
Roland Memisevic
59
2
0
14 May 2023
Lightweight Delivery Detection on Doorbell Cameras
Pirazh Khorramshahi
Zhe Wu
Tianchen Wang
Luke Deluccia
Hongcheng Wang
61
0
0
13 May 2023
Active Semantic Localization with Graph Neural Embedding
Mitsuki Yoshida
Kanji Tanaka
Ryo Yamamoto
Daiki Iwata
43
1
0
10 May 2023
Group Activity Recognition via Dynamic Composition and Interaction
Youliang Zhang
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
65
0
0
09 May 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
78
0
0
07 May 2023
ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking
Guanhui Ye
Jiashi Gao
Yuchen Wang
Liyan Song
Xue-Ming Wei
69
4
0
04 May 2023
Weakly-supervised Micro- and Macro-expression Spotting Based on Multi-level Consistency
Wang-Wang Yu
Kai-Fu Yang
Hong-Mei Yan
Yong-Jie Li
70
2
0
04 May 2023
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction
Mohsen Azarmi
Mahdi Rezaei
Tanveer Hussain
Chenghao Qian
88
8
0
01 May 2023
Physical Adversarial Attacks for Surveillance: A Survey
Kien Nguyen Thanh
Tharindu Fernando
Clinton Fookes
Sridha Sridharan
AAML
101
8
0
01 May 2023
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint
Guozhang Li
De Cheng
Xinpeng Ding
N. Wang
Jie Li
Xinbo Gao
77
7
0
25 Apr 2023
MRSN: Multi-Relation Support Network for Video Action Detection
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
135
8
0
24 Apr 2023
Implicit Temporal Modeling with Learnable Alignment for Video Recognition
S. Tu
Qi Dai
Zuxuan Wu
Zhi-Qi Cheng
Hang-Rui Hu
Yu-Gang Jiang
109
37
0
20 Apr 2023
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Mingjun Zhao
Yu
Xiaoli Wang
Lei Yang
Di Niu
61
6
0
20 Apr 2023
Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis
Mindi Ruan
Xiang Yu
Naifeng Zhang
Chuanbo Hu
Shuo Wang
Xin Li
94
8
0
20 Apr 2023
Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization
Siyuan Yang
Jun Liu
Shijian Lu
Er Meng Hwa
Yongjian Hu
Alex C. Kot
3DPC
3DH
72
18
0
18 Apr 2023
Multimodal Short Video Rumor Detection System Based on Contrastive Learning
Yuxing Yang
Junhao Zhao
Siyi Wang
Xiangyu Min
Peifeng Wang
Haizhou Wang
36
2
0
17 Apr 2023
Unsupervised Learning Optical Flow in Multi-frame Dynamic Environment Using Temporal Dynamic Modeling
Zitang Sun
Shinýa Nishida
Zhengbo Luo
45
1
0
14 Apr 2023
Explaining, Analyzing, and Probing Representations of Self-Supervised Learning Models for Sensor-based Human Activity Recognition
Bulat Khaertdinov
S. Asteriadis
55
3
0
14 Apr 2023
PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition
Ruiqi Xian
Xijun Wang
D. Kothandaraman
Tianyi Zhou
58
7
0
14 Apr 2023
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos
Qi Zhao
M. Salman Asif
Zhan Ma
76
34
0
13 Apr 2023
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
Jan Held
A. Cioppa
Silvio Giancola
Abdullah Hamdi
Guohao Li
Marc Van Droogenbroeck
78
32
0
10 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
111
34
0
03 Apr 2023
Previous
1
2
3
...
5
6
7
...
44
45
46
Next