ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1604.06573
  4. Cited By
Convolutional Two-Stream Network Fusion for Video Action Recognition

Convolutional Two-Stream Network Fusion for Video Action Recognition

22 April 2016
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
ArXivPDFHTML

Papers citing "Convolutional Two-Stream Network Fusion for Video Action Recognition"

50 / 854 papers shown
Title
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
31
87
0
10 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
78
1,004
0
09 Apr 2020
Temporal Pyramid Network for Action Recognition
Temporal Pyramid Network for Action Recognition
Ceyuan Yang
Yinghao Xu
Jianping Shi
Bo Dai
Bolei Zhou
20
367
0
07 Apr 2020
Two-Stream AMTnet for Action Detection
Two-Stream AMTnet for Action Detection
Suman Saha
Gurkirt Singh
Fabio Cuzzolin
ViT
17
13
0
03 Apr 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
42
439
0
03 Apr 2020
PaStaNet: Toward Human Activity Knowledge Engine
PaStaNet: Toward Human Activity Knowledge Engine
Yong-Lu Li
Liang Xu
Xinpeng Liu
Xijie Huang
Yue Xu
Shiyi Wang
Haoshu Fang
Ze Ma
Mingyang Chen
Cewu Lu
28
151
0
02 Apr 2020
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Spatio-temporal Tubelet Feature Aggregation and Object Linking in Videos
Daniel Cores
V. Brea
M. Mucientes
ViT
21
1
0
01 Apr 2020
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Spatio-Temporal Graph for Video Captioning with Knowledge Distillation
Boxiao Pan
Haoye Cai
De-An Huang
Kuan-Hui Lee
Adrien Gaidon
Ehsan Adeli
Juan Carlos Niebles
33
235
0
31 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
20
5
0
18 Mar 2020
Predictively Encoded Graph Convolutional Network for Noise-Robust
  Skeleton-based Action Recognition
Predictively Encoded Graph Convolutional Network for Noise-Robust Skeleton-based Action Recognition
Jongmin Yu
Yongsang Yoon
M. Jeon
40
44
0
17 Mar 2020
ActiLabel: A Combinatorial Transfer Learning Framework for Activity
  Recognition
ActiLabel: A Combinatorial Transfer Learning Framework for Activity Recognition
Parastoo Alinia
Seyed Iman Mirzadeh
H. Ghasemzadeh
21
9
0
16 Mar 2020
MoVi: A Large Multipurpose Motion and Video Dataset
MoVi: A Large Multipurpose Motion and Video Dataset
Saeed Ghorbani
Kimia Mahdaviani
A. Thaler
Konrad Paul Kording
D. Cook
Gunnar Blohm
N. Troje
21
72
0
04 Mar 2020
Vision-based Robot Manipulation Learning via Human Demonstrations
Vision-based Robot Manipulation Learning via Human Demonstrations
Zhixin Jia
Mengxiang Lin
Zhixin Chen
Shibo Jian
SSL
14
7
0
01 Mar 2020
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Alban Main De Boissiere
R. Noumeir
35
38
0
28 Feb 2020
Beyond Dropout: Feature Map Distortion to Regularize Deep Neural
  Networks
Beyond Dropout: Feature Map Distortion to Regularize Deep Neural Networks
Yehui Tang
Yunhe Wang
Yixing Xu
Boxin Shi
Chao Xu
Chunjing Xu
Chang Xu
22
38
0
23 Feb 2020
Stroke Constrained Attention Network for Online Handwritten Mathematical
  Expression Recognition
Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition
Jiaming Wang
Jun Du
Jianshu Zhang
27
24
0
20 Feb 2020
Human Action Recognition using Local Two-Stream Convolution Neural
  Network Features and Support Vector Machines
Human Action Recognition using Local Two-Stream Convolution Neural Network Features and Support Vector Machines
David Torpey
Turgay Celik
6
8
0
19 Feb 2020
Three-Stream Fusion Network for First-Person Interaction Recognition
Three-Stream Fusion Network for First-Person Interaction Recognition
Ye-ji Kim
Dong-Gyu Lee
Seong-Whan Lee
22
8
0
19 Feb 2020
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
A Survey on 3D Skeleton-Based Action Recognition Using Learning Method
Bin Ren
Mengyuan Liu
Runwei Ding
Hong Liu
32
121
0
14 Feb 2020
Learning spatio-temporal representations with temporal squeeze pooling
Learning spatio-temporal representations with temporal squeeze pooling
Guoxi Huang
A. Bors
ViT
14
11
0
11 Feb 2020
Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$
  Videos
Weakly-Supervised Multi-Person Action Recognition in 360∘^{\circ}∘ Videos
Junnan Li
Jianquan Liu
Yongkang Wong
Shoji Nishimura
Mohan S. Kankanhalli
31
13
0
09 Feb 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language
  Recognition
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
Hao Zhou
Wen-gang Zhou
Yun Zhou
Houqiang Li
NoLa
32
197
0
08 Feb 2020
Modality Compensation Network: Cross-Modal Adaptation for Action
  Recognition
Modality Compensation Network: Cross-Modal Adaptation for Action Recognition
Sijie Song
Jiaying Liu
Yanghao Li
Zongming Guo
32
43
0
31 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric
  Learning
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam
Richard J. Radke
27
46
0
21 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
39
2
0
21 Jan 2020
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action
  Recogntion
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion
Kaiyu Shan
Yongtao Wang
Zhuoying Wang
Tingting Liang
Zhi Tang
Ying-Cong Chen
Yangyan Li
AI4TS
28
4
0
19 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Tianhao Li
Limin Wang
VGen
18
55
0
16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for
  Better Action Recognition
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition
Li Tao
Xueting Wang
T. Yamasaki
3DPC
22
24
0
16 Jan 2020
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture
  Recognition in Videos
Res3ATN -- Deep 3D Residual Attention Network for Hand Gesture Recognition in Videos
Naina Dhingra
A. Kunz
3DPC
SLR
38
35
0
04 Jan 2020
Temporal-Spatial Neural Filter: Direction Informed End-to-End
  Multi-channel Target Speech Separation
Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation
Rongzhi Gu
Yuexian Zou
36
18
0
02 Jan 2020
Adversarial Cross-Domain Action Recognition with Co-Attention
Adversarial Cross-Domain Action Recognition with Co-Attention
Boxiao Pan
Zhangjie Cao
Ehsan Adeli
Juan Carlos Niebles
ViT
27
103
0
22 Dec 2019
Towards Robust Learning with Different Label Noise Distributions
Towards Robust Learning with Different Label Noise Distributions
Diego Ortego
Eric Arazo
Paul Albert
Noel E. O'Connor
Kevin McGuinness
NoLa
24
24
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Mimetics: Towards Understanding Human Actions Out of Context
Philippe Weinzaepfel
Grégory Rogez
19
71
0
16 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action
  Recognition
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
24
177
0
11 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
Flow-Distilled IP Two-Stream Networks for Compressed Video Action
  Recognition
Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition
Shiyuan Huang
Xudong Lin
Svebor Karaman
Shih-Fu Chang
22
10
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
24
9
0
09 Dec 2019
Synthetic Humans for Action Recognition from Unseen Viewpoints
Synthetic Humans for Action Recognition from Unseen Viewpoints
Gül Varol
Ivan Laptev
Cordelia Schmid
Andrew Zisserman
38
96
0
09 Dec 2019
Automatic Video Object Segmentation via Motion-Appearance-Stream Fusion
  and Instance-aware Segmentation
Automatic Video Object Segmentation via Motion-Appearance-Stream Fusion and Instance-aware Segmentation
Sung-Kwon Choo
Wonkyo Seo
N. Cho
VOS
33
0
0
03 Dec 2019
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
19
155
0
01 Dec 2019
AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly
  Supervised Action Recognition and Localization
AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization
Xiaoyu Zhang
Changsheng Li
Haichao Shi
Xiaobin Zhu
Peng Li
Jing Dong
34
37
0
27 Nov 2019
G-TAD: Sub-Graph Localization for Temporal Action Detection
G-TAD: Sub-Graph Localization for Temporal Action Detection
Mengmeng Xu
Chen Zhao
D. Rojas
Ali K. Thabet
Guohao Li
39
435
0
26 Nov 2019
SRG: Snippet Relatedness-based Temporal Action Proposal Generator
SRG: Snippet Relatedness-based Temporal Action Proposal Generator
Hyunjun Eun
Sumin Lee
Jinyoung Moon
Jongyoul Park
Chanho Jung
Changick Kim
19
24
0
26 Nov 2019
SCR-Graph: Spatial-Causal Relationships based Graph Reasoning Network
  for Human Action Prediction
SCR-Graph: Spatial-Causal Relationships based Graph Reasoning Network for Human Action Prediction
Bo Chen
Decai Li
Yuqing He
C. Hua
GNN
19
4
0
22 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
36
236
0
21 Nov 2019
Cross-Class Relevance Learning for Temporal Concept Localization
Cross-Class Relevance Learning for Temporal Concept Localization
Junwei Ma
S. Gorti
M. Volkovs
I. Stanevich
Guangwei Yu
23
7
0
19 Nov 2019
Action Recognition Using Volumetric Motion Representations
Action Recognition Using Volumetric Motion Representations
Michael Peven
Gregory Hager
A. Reiter
3DPC
26
0
0
19 Nov 2019
Satellite Image Time Series Classification with Pixel-Set Encoders and
  Temporal Self-Attention
Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention
Vivien Sainte Fare Garnot
Loic Landrieu
S. Giordano
N. Chehata
25
148
0
18 Nov 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time
  Spatiotemporal Action Localization
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
28
143
0
15 Nov 2019
Previous
123...91011...161718
Next