ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06573
  4. Cited By
Convolutional Two-Stream Network Fusion for Video Action Recognition

Convolutional Two-Stream Network Fusion for Video Action Recognition

22 April 2016
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
ArXivPDFHTML

Papers citing "Convolutional Two-Stream Network Fusion for Video Action Recognition"

50 / 853 papers shown
Title
Object-Region Video Transformers
Object-Region Video Transformers
Roei Herzig
Elad Ben-Avraham
K. Mangalam
Amir Bar
Gal Chechik
Anna Rohrbach
Trevor Darrell
Amir Globerson
ViT
30
82
0
13 Oct 2021
Early Melanoma Diagnosis with Sequential Dermoscopic Images
Early Melanoma Diagnosis with Sequential Dermoscopic Images
Zhen Yu
Jennifer Nguyen
Toàn D. Nguyên
J. Kelly
C. Mclean
Paul Bonnington
Lei Zhang
Victoria Mar
Z. Ge
27
41
0
12 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
46
12
0
12 Oct 2021
Joint Learning On The Hierarchy Representation for Fine-Grained Human
  Action Recognition
Joint Learning On The Hierarchy Representation for Fine-Grained Human Action Recognition
M. C. Leong
Hui Li Tan
Haosong Zhang
Liyuan Li
Feng Lin
J. Lim
40
10
0
12 Oct 2021
A Multi-viewpoint Outdoor Dataset for Human Action Recognition
A Multi-viewpoint Outdoor Dataset for Human Action Recognition
Asanka G. Perera
Yee Wei Law
T. Ogunwa
J. Chahl
20
40
0
07 Oct 2021
Spatio-Temporal Video Representation Learning for AI Based Video
  Playback Style Prediction
Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction
Rishubh Parihar
Gaurav Ramola
Ranajit Saha
Raviprasad Kini
Aniket Rege
S. Velusamy
36
1
0
03 Oct 2021
Turning old models fashion again: Recycling classical CNN networks using
  the Lattice Transformation
Turning old models fashion again: Recycling classical CNN networks using the Lattice Transformation
Ana Paula G. S. de Almeida
Flávio de Barros Vidal
11
0
0
28 Sep 2021
TSM: Temporal Shift Module for Efficient and Scalable Video
  Understanding on Edge Device
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device
Ji Lin
Chuang Gan
Kuan-Chieh Jackson Wang
Song Han
40
64
0
27 Sep 2021
CLIPort: What and Where Pathways for Robotic Manipulation
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
65
632
0
24 Sep 2021
V-SlowFast Network for Efficient Visual Sound Separation
V-SlowFast Network for Efficient Visual Sound Separation
Lingyu Zhu
Esa Rahtu
52
10
0
18 Sep 2021
ActionCLIP: A New Paradigm for Video Action Recognition
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
152
362
0
17 Sep 2021
PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
Nupur Thakur
Baoxin Li
AAML
31
2
0
13 Sep 2021
Egocentric View Hand Action Recognition by Leveraging Hand Surface and
  Hand Grasp Type
Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type
Sangpil Kim
Jihyun Bae
Hyung-Gun Chi
Sunghee Hong
Byoung Soo Koh
K. Ramani
EgoV
24
0
0
08 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric
  Videos
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
39
20
0
02 Sep 2021
LIGAR: Lightweight General-purpose Action Recognition
LIGAR: Lightweight General-purpose Action Recognition
Evgeny Izutov
15
3
0
30 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
Spatio-Temporal Interaction Graph Parsing Networks for Human-Object
  Interaction Recognition
Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition
Ning Wang
Guangming Zhu
Liang Zhang
Peiyi Shen
Hongsheng Li
Cong Hua
33
27
0
19 Aug 2021
Self-Supervised Video Representation Learning with Meta-Contrastive
  Network
Self-Supervised Video Representation Learning with Meta-Contrastive Network
Yuanze Lin
Xun Guo
Yan Lu
SSL
27
41
0
19 Aug 2021
Channel-Temporal Attention for First-Person Video Domain Adaptation
Channel-Temporal Attention for First-Person Video Domain Adaptation
Xianyuan Liu
Shuo Zhou
Tao Lei
Haiping Lu
EgoV
21
0
0
17 Aug 2021
Temporal Action Localization Using Gated Recurrent Units
Temporal Action Localization Using Gated Recurrent Units
Hassan Keshvari Khojasteh
Hoda Mohammadzade
H. Behroozi
23
3
0
07 Aug 2021
Feature-Supervised Action Modality Transfer
Feature-Supervised Action Modality Transfer
Fida Mohammad Thoker
Cees G. M. Snoek
26
1
0
06 Aug 2021
Towards Efficient Tensor Decomposition-Based DNN Model Compression with
  Optimization Framework
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework
Miao Yin
Yang Sui
Siyu Liao
Bo Yuan
28
79
0
26 Jul 2021
Self-Conditioned Probabilistic Learning of Video Rescaling
Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian
Guo Lu
Xiongkuo Min
Zhaohui Che
Guangtao Zhai
G. Guo
Zhiyong Gao
33
26
0
24 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
37
41
0
22 Jul 2021
From Single to Multiple: Leveraging Multi-level Prediction Spaces for
  Video Forecasting
From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Mengcheng Lan
Shuliang Ning
Yanran Li
Qian Chen
Xunlai Chen
Xiaoguang Han
Shuguang Cui
21
0
0
21 Jul 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action
  Recognition
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
27
47
0
19 Jul 2021
Agent-Environment Network for Temporal Action Proposal Generation
Agent-Environment Network for Temporal Action Proposal Generation
Viet-Khoa Vo-Ho
Ngan Le
Kashu Yamazaki
Akihiro Sugimoto
Minh-Triet Tran
EgoV
19
9
0
17 Jul 2021
Developmental Stage Classification of Embryos Using Two-Stream Neural
  Network with Linear-Chain Conditional Random Field
Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field
Stanislav Lukyanenko
Won-Dong Jang
D. Wei
R. Struyven
Yoon Kim
...
Helen Y Yang
Alexander M. Rush
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
18
8
0
13 Jul 2021
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video
  Recognition Systems
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video Recognition Systems
Shangyu Xie
Han Wang
Yu Kong
Yuan Hong
AAML
19
25
0
09 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding
  and Emotion Analysis
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Xin Liu
Henglin Shi
Haoyu Chen
Zitong Yu
Xiaobai Li
Guoying Zhao
21
80
0
01 Jul 2021
When Video Classification Meets Incremental Classes
When Video Classification Meets Incremental Classes
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
27
28
0
30 Jun 2021
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action
  Localization
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization
Anurag Bagchi
Jazib Mahmood
Dolton Fernandes
Ravi Kiran Sarvadevabhatla
32
21
0
27 Jun 2021
Exploring Temporal Context and Human Movement Dynamics for Online Action
  Detection in Videos
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos
V. Vasileiou
N. Kardaris
Petros Maragos
20
2
0
26 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
37
127
0
21 Jun 2021
Self-supervised Video Representation Learning with Cross-Stream
  Prototypical Contrasting
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
40
11
0
18 Jun 2021
MaCLR: Motion-aware Contrastive Learning of Representations for Videos
MaCLR: Motion-aware Contrastive Learning of Representations for Videos
Fanyi Xiao
Joseph Tighe
Davide Modolo
SSL
24
13
0
17 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
27
11
0
12 Jun 2021
What Makes Multi-modal Learning Better than Single (Provably)
What Makes Multi-modal Learning Better than Single (Provably)
Yu Huang
Chenzhuang Du
Zihui Xue
Xuanyao Chen
Hang Zhao
Longbo Huang
39
251
0
08 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
29
4
0
02 Jun 2021
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised
  Temporal Action Segmentation
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation
Zhe Wang
Hao Chen
Xinyu Li
Chunhui Liu
Yuanjun Xiong
Joseph Tighe
Charless C. Fowlkes
30
20
0
29 May 2021
Detecting Biological Locomotion in Video: A Computational Approach
Detecting Biological Locomotion in Video: A Computational Approach
Soo-Min Kang
Richard P. Wildes
17
0
0
26 May 2021
GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot
  Action Recognition
GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot Action Recognition
Bin Sun
Dehui Kong
Shaofan Wang
Jinghua Li
Baocai Yin
Xiaonan Luo
28
18
0
25 May 2021
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Coarse to Fine Multi-Resolution Temporal Convolutional Network
Dipika Singhania
R. Rahaman
Angela Yao
AI4TS
16
55
0
23 May 2021
VPN++: Rethinking Video-Pose embeddings for understanding Activities of
  Daily Living
VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living
Srijan Das
Rui Dai
Di Yang
F. Brémond
ViT
43
67
0
17 May 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor
  Segmentation
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui
Shaofei Huang
Si Liu
Zihan Ding
Guanbin Li
Wenguan Wang
Jizhong Han
Fei Wang
20
46
0
14 May 2021
VSR: A Unified Framework for Document Layout Analysis combining Vision,
  Semantics and Relations
VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations
Peng Zhang
Can Li
Liang Qiao
Zhanzhan Cheng
Shiliang Pu
Yi Niu
Fei Wu
31
57
0
13 May 2021
Action Shuffling for Weakly Supervised Temporal Localization
Action Shuffling for Weakly Supervised Temporal Localization
Xiaoyu Zhang
Haichao Shi
Changsheng Li
Xinchu Shi
WSOL
43
10
0
10 May 2021
Good Practices and A Strong Baseline for Traffic Anomaly Detection
Good Practices and A Strong Baseline for Traffic Anomaly Detection
Yuxiang Zhao
Wenhao Wu
Yue He
Yingying Li
Xiao Tan
Shifeng Chen
AI4TS
17
13
0
09 May 2021
Adaptive Focus for Efficient Video Recognition
Adaptive Focus for Efficient Video Recognition
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
45
98
0
07 May 2021
ConCAD: Contrastive Learning-based Cross Attention for Sleep Apnea
  Detection
ConCAD: Contrastive Learning-based Cross Attention for Sleep Apnea Detection
Guanjie Huang
Fenglong Ma
29
10
0
07 May 2021
Previous
123...567...161718
Next