Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.06573
Cited By
Convolutional Two-Stream Network Fusion for Video Action Recognition
22 April 2016
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Convolutional Two-Stream Network Fusion for Video Action Recognition"
50 / 853 papers shown
Title
Object-Region Video Transformers
Roei Herzig
Elad Ben-Avraham
K. Mangalam
Amir Bar
Gal Chechik
Anna Rohrbach
Trevor Darrell
Amir Globerson
ViT
30
82
0
13 Oct 2021
Early Melanoma Diagnosis with Sequential Dermoscopic Images
Zhen Yu
Jennifer Nguyen
Toàn D. Nguyên
J. Kelly
C. Mclean
Paul Bonnington
Lei Zhang
Victoria Mar
Z. Ge
27
41
0
12 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
46
12
0
12 Oct 2021
Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition
M. C. Leong
Hui Li Tan
Haosong Zhang
Liyuan Li
Feng Lin
J. Lim
40
10
0
12 Oct 2021
A Multi-viewpoint Outdoor Dataset for Human Action Recognition
Asanka G. Perera
Yee Wei Law
T. Ogunwa
J. Chahl
20
40
0
07 Oct 2021
Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction
Rishubh Parihar
Gaurav Ramola
Ranajit Saha
Raviprasad Kini
Aniket Rege
S. Velusamy
36
1
0
03 Oct 2021
Turning old models fashion again: Recycling classical CNN networks using the Lattice Transformation
Ana Paula G. S. de Almeida
Flávio de Barros Vidal
11
0
0
28 Sep 2021
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
Ji Lin
Chuang Gan
Kuan-Chieh Jackson Wang
Song Han
40
64
0
27 Sep 2021
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
65
632
0
24 Sep 2021
V-SlowFast Network for Efficient Visual Sound Separation
Lingyu Zhu
Esa Rahtu
52
10
0
18 Sep 2021
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
152
362
0
17 Sep 2021
PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
Nupur Thakur
Baoxin Li
AAML
31
2
0
13 Sep 2021
Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type
Sangpil Kim
Jihyun Bae
Hyung-Gun Chi
Sunghee Hong
Byoung Soo Koh
K. Ramani
EgoV
24
0
0
08 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
39
20
0
02 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
15
3
0
30 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition
Ning Wang
Guangming Zhu
Liang Zhang
Peiyi Shen
Hongsheng Li
Cong Hua
33
27
0
19 Aug 2021
Self-Supervised Video Representation Learning with Meta-Contrastive Network
Yuanze Lin
Xun Guo
Yan Lu
SSL
27
41
0
19 Aug 2021
Channel-Temporal Attention for First-Person Video Domain Adaptation
Xianyuan Liu
Shuo Zhou
Tao Lei
Haiping Lu
EgoV
21
0
0
17 Aug 2021
Temporal Action Localization Using Gated Recurrent Units
Hassan Keshvari Khojasteh
Hoda Mohammadzade
H. Behroozi
23
3
0
07 Aug 2021
Feature-Supervised Action Modality Transfer
Fida Mohammad Thoker
Cees G. M. Snoek
26
1
0
06 Aug 2021
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework
Miao Yin
Yang Sui
Siyu Liao
Bo Yuan
28
79
0
26 Jul 2021
Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian
Guo Lu
Xiongkuo Min
Zhaohui Che
Guangtao Zhai
G. Guo
Zhiyong Gao
33
26
0
24 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
37
41
0
22 Jul 2021
From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Mengcheng Lan
Shuliang Ning
Yanran Li
Qian Chen
Xunlai Chen
Xiaoguang Han
Shuguang Cui
21
0
0
21 Jul 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
27
47
0
19 Jul 2021
Agent-Environment Network for Temporal Action Proposal Generation
Viet-Khoa Vo-Ho
Ngan Le
Kashu Yamazaki
Akihiro Sugimoto
Minh-Triet Tran
EgoV
19
9
0
17 Jul 2021
Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field
Stanislav Lukyanenko
Won-Dong Jang
D. Wei
R. Struyven
Yoon Kim
...
Helen Y Yang
Alexander M. Rush
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
18
8
0
13 Jul 2021
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video Recognition Systems
Shangyu Xie
Han Wang
Yu Kong
Yuan Hong
AAML
19
25
0
09 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Xin Liu
Henglin Shi
Haoyu Chen
Zitong Yu
Xiaobai Li
Guoying Zhao
21
80
0
01 Jul 2021
When Video Classification Meets Incremental Classes
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
27
28
0
30 Jun 2021
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization
Anurag Bagchi
Jazib Mahmood
Dolton Fernandes
Ravi Kiran Sarvadevabhatla
32
21
0
27 Jun 2021
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos
V. Vasileiou
N. Kardaris
Petros Maragos
20
2
0
26 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
37
127
0
21 Jun 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
40
11
0
18 Jun 2021
MaCLR: Motion-aware Contrastive Learning of Representations for Videos
Fanyi Xiao
Joseph Tighe
Davide Modolo
SSL
24
13
0
17 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
27
11
0
12 Jun 2021
What Makes Multi-modal Learning Better than Single (Provably)
Yu Huang
Chenzhuang Du
Zihui Xue
Xuanyao Chen
Hang Zhao
Longbo Huang
39
251
0
08 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
29
4
0
02 Jun 2021
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation
Zhe Wang
Hao Chen
Xinyu Li
Chunhui Liu
Yuanjun Xiong
Joseph Tighe
Charless C. Fowlkes
30
20
0
29 May 2021
Detecting Biological Locomotion in Video: A Computational Approach
Soo-Min Kang
Richard P. Wildes
17
0
0
26 May 2021
GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot Action Recognition
Bin Sun
Dehui Kong
Shaofan Wang
Jinghua Li
Baocai Yin
Xiaonan Luo
28
18
0
25 May 2021
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Dipika Singhania
R. Rahaman
Angela Yao
AI4TS
16
55
0
23 May 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
F. Brémond
ViT
43
67
0
17 May 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui
Shaofei Huang
Si Liu
Zihan Ding
Guanbin Li
Wenguan Wang
Jizhong Han
Fei Wang
20
46
0
14 May 2021
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Peng Zhang
Can Li
Liang Qiao
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Fei Wu
31
57
0
13 May 2021
Action Shuffling for Weakly Supervised Temporal Localization
Xiaoyu Zhang
Haichao Shi
Changsheng Li
Xinchu Shi
WSOL
43
10
0
10 May 2021
Good Practices and A Strong Baseline for Traffic Anomaly Detection
Yuxiang Zhao
Wenhao Wu
Yue He
Yingying Li
Xiao Tan
Shifeng Chen
AI4TS
17
13
0
09 May 2021
Adaptive Focus for Efficient Video Recognition
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
45
98
0
07 May 2021
ConCAD: Contrastive Learning-based Cross Attention for Sleep Apnea Detection
Guanjie Huang
Fenglong Ma
29
10
0
07 May 2021
Previous
1
2
3
...
5
6
7
...
16
17
18
Next