Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.06573
Cited By
Convolutional Two-Stream Network Fusion for Video Action Recognition
22 April 2016
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convolutional Two-Stream Network Fusion for Video Action Recognition"
50 / 853 papers shown
Title
Classification of Tennis Actions Using Deep Learning
Emil Hovad
Therese Hougaard-Jensen
L. H. Clemmensen
21
5
0
04 Feb 2024
Active Generation Network of Human Skeleton for Action Recognition
Longzhao Liu
Xin Wang
Fangming Li
Jiayu Chen
GAN
48
1
0
30 Jan 2024
One-Shot Multi-Rate Pruning of Graph Convolutional Networks
H. Sahbi
33
0
0
29 Dec 2023
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Ping Luo
Jiebo Luo
Chenliang Xu
VLM
59
84
0
29 Dec 2023
Appearance-based Refinement for Object-Centric Motion Segmentation
Junyu Xie
Weidi Xie
Andrew Zisserman
VOS
38
3
0
18 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
32
37
0
07 Dec 2023
Just Add
π
π
π
! Pose Induced Video Transformers for Understanding Activities of Daily Living
Dominick Reilly
Srijan Das
ViT
38
17
0
30 Nov 2023
CAST: Cross-Attention in Space and Time for Video Action Recognition
Dongho Lee
Jongseo Lee
Jinwoo Choi
EgoV
35
12
0
30 Nov 2023
Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation
A. Dasgupta
C. V. Jawahar
Karteek Alahari
TTA
VLM
24
10
0
30 Nov 2023
GeoDeformer: Geometric Deformable Transformer for Action Recognition
Jinhui Ye
Jiaming Zhou
Hui Xiong
Junwei Liang
ViT
23
1
0
29 Nov 2023
Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction
Yilin Wen
Hao Pan
Takehiko Ohkawa
Lei Yang
Jia Pan
Yoichi Sato
Taku Komura
Wenping Wang
44
0
0
29 Nov 2023
Object-based (yet Class-agnostic) Video Domain Adaptation
Dantong Niu
Amir Bar
Roei Herzig
Trevor Darrell
Anna Rohrbach
43
1
0
29 Nov 2023
Riemannian Self-Attention Mechanism for SPD Networks
Rui Wang
Xiao-Jun Wu
Hui Li
Josef Kittler
26
1
0
28 Nov 2023
Modality Mixer Exploiting Complementary Information for Multi-modal Action Recognition
Sumin Lee
Sangmin Woo
Muhammad Adi Nugroho
Changick Kim
30
0
0
21 Nov 2023
On the Behavior of Audio-Visual Fusion Architectures in Identity Verification Tasks
Daniel Claborne
Eric Slyman
Karl Pazdernik
20
0
0
09 Nov 2023
P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification
Abid Ali
Ashish Marisetty
François Brémond
35
6
0
04 Nov 2023
Beyond still images: Temporal features and input variance resilience
AmirHosein Fadaei
M. Dehaqani
43
0
0
01 Nov 2023
Flow Dynamics Correction for Action Recognition
Lei Wang
Piotr Koniusz
21
10
0
16 Oct 2023
What Makes for Robust Multi-Modal Models in the Face of Missing Modalities?
Siting Li
Chenzhuang Du
Yue Zhao
Yu Huang
Hang Zhao
24
4
0
10 Oct 2023
Improving Discriminative Multi-Modal Learning with Large-Scale Pre-Trained Models
Chenzhuang Du
Yue Zhao
Chonghua Liao
Jiacheng You
Jie Fu
Hang Zhao
47
2
0
08 Oct 2023
A Spatio-Temporal Attention-Based Method for Detecting Student Classroom Behaviors
Fan Yang
35
2
0
04 Oct 2023
Local Compressed Video Stream Learning for Generic Event Boundary Detection
Libo Zhang
Xin Gu
Congcong Li
Tiejian Luo
Hengrui Fan
23
3
0
27 Sep 2023
Selective Volume Mixup for Video Action Recognition
Yi Tan
Zhaofan Qiu
Y. Hao
Ting Yao
Xiangnan He
Tao Mei
ViT
35
2
0
18 Sep 2023
TransNet: A Transfer Learning-Based Network for Human Action Recognition
Khaled Alomar
Xiaohao Cai
40
1
0
13 Sep 2023
Evaluation of Key Spatiotemporal Learners for Print Track Anomaly Classification Using Melt Pool Image Streams
Lynn Cherif
Mutahar Safdar
Guy Lamouche
P. Wanjara
P. Paul
G. Wood
Max Zimmermann
F. Hannesen
Yao Zhao
26
1
0
28 Aug 2023
LAC: Latent Action Composition for Skeleton-based Action Segmentation
Di Yang
Yaohui Wang
A. Dantcheva
Quan Kong
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
42
9
0
28 Aug 2023
Temporal-Distributed Backdoor Attack Against Video Based Action Recognition
Xi Li
Songhe Wang
Rui Huang
Mahanth K. Gowda
G. Kesidis
AAML
41
6
0
21 Aug 2023
Unlimited Knowledge Distillation for Action Recognition in the Dark
Ruibing Jin
Guosheng Lin
Min-man Wu
Jie Lin
Zhengguo Li
Xiaoli Li
Zhenghua Chen
16
1
0
18 Aug 2023
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
Giacomo Zara
Alessandro Conti
Subhankar Roy
Stéphane Lathuilière
Paolo Rota
Elisa Ricci
33
11
0
17 Aug 2023
JEDI: Joint Expert Distillation in a Semi-Supervised Multi-Dataset Student-Teacher Scenario for Video Action Recognition
L. Bicsi
B. Alexe
Radu Tudor Ionescu
Marius Leordeanu
22
2
0
09 Aug 2023
SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos
S. Bose
Saikat Sarkar
A. Chakrabarti
34
1
0
15 Jul 2023
Miniaturized Graph Convolutional Networks with Topologically Consistent Pruning
H. Sahbi
28
0
0
30 Jun 2023
Spiking Two-Stream Methods with Unsupervised STDP-based Learning for Action Recognition
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
35
3
0
23 Jun 2023
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Vasileios Magoulianitis
A. Psaltis
3DH
3DPC
30
0
0
23 Jun 2023
A Reliable and Interpretable Framework of Multi-view Learning for Liver Fibrosis Staging
Zheyao Gao
Yuanye Liu
Fuping Wu
N. Shi
Yuxin Shi
Xiahai Zhuang
EDL
24
11
0
21 Jun 2023
Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos
Md Zahid Hasan
Jiajing Chen
Jiyang Wang
Mohammed Shaiqur Rahman
Ameya Joshi
Senem Velipasalar
C. Hegde
Anuj Sharma
S. Sarkar
VLM
52
18
0
16 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Aman Chadha
Srijan Das
ViT
33
4
0
15 Jun 2023
MMASD: A Multimodal Dataset for Autism Intervention Analysis
Jicheng Li
Vuthea Chheang
Pinar Kullu
Eli Brignac
Zhang Guo
Kenneth Barner
Anjana Bhat
R. Barmaki
20
12
0
14 Jun 2023
Boosting Breast Ultrasound Video Classification by the Guidance of Keyframe Feature Centers
AnLan Sun
Zhao Zhang
Meng Lei
Yuting Dai
Dong Wang
Liwei Wang
34
5
0
12 Jun 2023
A Multi-Modal Transformer Network for Action Detection
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
43
15
0
31 May 2023
Budget-Aware Graph Convolutional Network Design using Probabilistic Magnitude Pruning
H. Sahbi
21
0
0
30 May 2023
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Thanh-Dat Truong
Khoa Luu
EgoV
46
10
0
25 May 2023
Learning Higher-order Object Interactions for Keypoint-based Video Understanding
Yi Huang
Asim Kadav
Farley Lai
Deep Patel
H. Graf
20
1
0
16 May 2023
Is end-to-end learning enough for fitness activity recognition?
Antoine Mercier
Guillaume Berger
Sunny Panchal
Florian Letsch
Cornelius Boehm
Nahua Kang
Ingo Bax
Roland Memisevic
28
2
0
14 May 2023
MMG-Ego4D: Multi-Modal Generalization in Egocentric Action Recognition
Xinyu Gong
S. Mohan
Naina Dhingra
Jean-Charles Bazin
Yilei Li
Zhangyang Wang
Rakesh Ranjan
EgoV
56
18
0
12 May 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
34
0
0
07 May 2023
On Uni-Modal Feature Learning in Supervised Multi-Modal Learning
Chenzhuang Du
Jiaye Teng
Tingle Li
Yichen Liu
Tianyuan Yuan
Yue Wang
Yang Yuan
Hang Zhao
89
40
0
02 May 2023
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint
Guozhang Li
De Cheng
Xinpeng Ding
N. Wang
Jie Li
Xinbo Gao
25
6
0
25 Apr 2023
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Mingjun Zhao
Yu
Xiaoli Wang
Lei Yang
Di Niu
26
5
0
20 Apr 2023
Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis
Mindi Ruan
Xiang Yu
Naifeng Zhang
Chuanbo Hu
Shuo Wang
Xin Li
36
8
0
20 Apr 2023
Previous
1
2
3
4
5
...
16
17
18
Next