ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,289 papers shown
Title
VideoMix: Rethinking Data Augmentation for Video Classification
VideoMix: Rethinking Data Augmentation for Video Classification
Sangdoo Yun
Seong Joon Oh
Byeongho Heo
Dongyoon Han
Jinhyung Kim
447
76
0
07 Dec 2020
Skeleon-Based Typing Style Learning For Person Identification
Skeleon-Based Typing Style Learning For Person Identification
Lior Gelberg
D. Mendlovic
D. Raviv
3DH
39
0
0
06 Dec 2020
Factorizing Perception and Policy for Interactive Instruction Following
Factorizing Perception and Policy for Interactive Instruction Following
Kunal Pratap Singh
Suvaansh Bhambri
Byeonghwi Kim
Roozbeh Mottaghi
Jonghyun Choi
LM&RoLRM
132
36
0
06 Dec 2020
ParaNet: Deep Regular Representation for 3D Point Clouds
ParaNet: Deep Regular Representation for 3D Point Clouds
Qijian Zhang
Junhui Hou
Y. Qian
Juyong Zhang
Ying He
3DPC
26
1
0
05 Dec 2020
Spatial-Temporal Alignment Network for Action Recognition and Detection
Spatial-Temporal Alignment Network for Action Recognition and Detection
Junwei Liang
Liangliang Cao
Xuehan Xiong
Ting Yu
Alexander G. Hauptmann
3DPC
70
9
0
04 Dec 2020
Video Anomaly Detection by Estimating Likelihood of Representations
Video Anomaly Detection by Estimating Likelihood of Representations
Yuqi Ouyang
Victor Sanchez
62
17
0
02 Dec 2020
Pose-based Sign Language Recognition using GCN and BERT
Pose-based Sign Language Recognition using GCN and BERT
Anirudh Tunga
Sai Vidyaranya Nuthalapati
J. Wachs
SLR
63
70
0
01 Dec 2020
A New Action Recognition Framework for Video Highlights Summarization in
  Sporting Events
A New Action Recognition Framework for Video Highlights Summarization in Sporting Events
Cheng Yan
Xin Li
Guoqiang Li
EDL
39
13
0
01 Dec 2020
UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning
UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning
Kunming Luo
Chuan Wang
Shuaicheng Liu
Haoqiang Fan
Jue Wang
Jian Sun
MDE
98
75
0
01 Dec 2020
Just One Moment: Structural Vulnerability of Deep Action Recognition
  against One Frame Attack
Just One Moment: Structural Vulnerability of Deep Action Recognition against One Frame Attack
Ian Ryu
Jun-Hyuk Kim
Jun-Ho Choi
Jong-Seok Lee
AAML
97
18
0
30 Nov 2020
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal
  Heatmaps
Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps
Mattia Segu
Federico Pirovano
Gianmario Fumagalli
Amedeo Fabris
59
2
0
26 Nov 2020
Group-Skeleton-Based Human Action Recognition in Complex Events
Group-Skeleton-Based Human Action Recognition in Complex Events
Tingtian Li
Zixun Sun
Xiao Chen
43
5
0
26 Nov 2020
t-EVA: Time-Efficient t-SNE Video Annotation
t-EVA: Time-Efficient t-SNE Video Annotation
Soroosh Poorgholi
O. Kayhan
Jan van Gemert
45
5
0
26 Nov 2020
Recent Progress in Appearance-based Action Recognition
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
57
0
0
25 Nov 2020
KShapeNet: Riemannian network on Kendall shape space for Skeleton based
  Action Recognition
KShapeNet: Riemannian network on Kendall shape space for Skeleton based Action Recognition
Racha Friji
Hassen Drira
F. Chaieb
S. Kurtek
Hamza Kchok
3DPC
31
2
0
24 Nov 2020
Temporal Action Detection with Multi-level Supervision
Temporal Action Detection with Multi-level Supervision
Baifeng Shi
Qi Dai
Judy Hoffman
Kate Saenko
Trevor Darrell
Huijuan Xu
90
14
0
24 Nov 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
92
126
0
23 Nov 2020
We don't Need Thousand Proposals$\colon$ Single Shot Actor-Action
  Detection in Videos
We don't Need Thousand Proposals ⁣:\colon: Single Shot Actor-Action Detection in Videos
A. J. Rana
Yogesh S Rawat
ViT
44
11
0
22 Nov 2020
Boundary-sensitive Pre-training for Temporal Localization in Videos
Boundary-sensitive Pre-training for Temporal Localization in Videos
Mengmeng Xu
Juan-Manuel Perez-Rua
Victor Escorcia
Brais Martínez
Xiatian Zhu
Li Zhang
Guohao Li
Tao Xiang
80
61
0
21 Nov 2020
Visual Recognition of Great Ape Behaviours in the Wild
Visual Recognition of Great Ape Behaviours in the Wild
Faizaan Sakib
T. Burghardt
77
24
0
21 Nov 2020
3D attention mechanism for fine-grained classification of table tennis
  strokes using a Twin Spatio-Temporal Convolutional Neural Networks
3D attention mechanism for fine-grained classification of table tennis strokes using a Twin Spatio-Temporal Convolutional Neural Networks
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
J. Morlier
3DPC
45
12
0
20 Nov 2020
Consistency-Aware Graph Network for Human Interaction Understanding
Consistency-Aware Graph Network for Human Interaction Understanding
Zhenhua Wang
Jiajun Meng
Dongyan Guo
Jianhua Zhang
Javen Qinfeng Shi
Shengyong Chen
GNN
42
3
0
20 Nov 2020
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled
  Videos
Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos
Reza Ghoddoosian
S. Sayed
V. Athitsos
AI4TS
30
7
0
20 Nov 2020
HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving
  Objects
HMFlow: Hybrid Matching Optical Flow Network for Small and Fast-Moving Objects
Suihanjin Yu
Youming Zhang
Chen Wang
Xiao Bai
Liang Zhang
Edwin R. Hancock
30
3
0
19 Nov 2020
TRAT: Tracking by Attention Using Spatio-Temporal Features
TRAT: Tracking by Attention Using Spatio-Temporal Features
Hasan Saribas
Hakan Çevikalp
Okan Kopuklu
Bedirhan Uzun
64
25
0
18 Nov 2020
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural
  Networks
Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks
Thomas Teixeira
Eric Granger
Alessandro Lameiras Koerich
CVBM
55
10
0
18 Nov 2020
Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic
  Environments
Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic Environments
Maneekwan Toyungyernsub
Masha Itkina
Ransalu Senanayake
Mykel J. Kochenderfer
90
22
0
18 Nov 2020
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey,
  Opportunities, and Open Research Issues
Video Big Data Analytics in the Cloud: A Reference Architecture, Survey, Opportunities, and Open Research Issues
A. Alam
I. Ullah
Young-Koo Lee
70
25
0
16 Nov 2020
JOLO-GCN: Mining Joint-Centered Light-Weight Information for
  Skeleton-Based Action Recognition
JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition
Jinmiao Cai
Nianjuan Jiang
Xiaoguang Han
Kui Jia
Jiangbo Lu
55
85
0
16 Nov 2020
SALAD: Self-Assessment Learning for Action Detection
SALAD: Self-Assessment Learning for Action Detection
Guillaume Vaudaux-Ruth
Adrien Chan-Hon-Tong
Catherine Achard
41
8
0
13 Nov 2020
Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs
Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs
Sean Segal
Eric Kee
Wenjie Luo
Abbas Sadat
Ersin Yumer
R. Urtasun
42
11
0
12 Nov 2020
Transformers for One-Shot Visual Imitation
Transformers for One-Shot Visual Imitation
Sudeep Dasari
Abhinav Gupta
LM&Ro
94
95
0
11 Nov 2020
Skeleton-based Relational Reasoning for Group Activity Analysis
Skeleton-based Relational Reasoning for Group Activity Analysis
Mauricio Perez
Jun Liu
Alex C. Kot
82
43
0
11 Nov 2020
Selective Spatio-Temporal Aggregation Based Pose Refinement System:
  Towards Understanding Human Activities in Real-World Videos
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos
Di Yang
Rui Dai
Yaohui Wang
Rupayan Mallick
Luca Minciullo
Gianpiero Francesca
Francois Bremond
81
16
0
10 Nov 2020
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial
  Expression Recognition
Temporal Stochastic Softmax for 3D CNNs: An Application in Facial Expression Recognition
T. Ayral
M. Pedersoli
Simon L Bacon
Eric Granger
CVBM3DH
53
11
0
10 Nov 2020
Multi-modal Fusion for Single-Stage Continuous Gesture Recognition
Multi-modal Fusion for Single-Stage Continuous Gesture Recognition
Harshala Gammulle
Simon Denman
Sridha Sridharan
Clinton Fookes
SLR
96
30
0
10 Nov 2020
STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
STCNet: Spatio-Temporal Cross Network for Industrial Smoke Detection
Yichao Cao
Qingfei Tang
Xiaobo Lu
Fan Li
Jinde Cao
27
3
0
10 Nov 2020
An Empirical Study of Visual Features for DNN based Audio-Visual Speech
  Enhancement in Multi-talker Environments
An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments
Shrishti Saha Shetu
Soumitro Chakrabarty
Emanuel Habets
50
3
0
09 Nov 2020
FlowCaps: Optical Flow Estimation with Capsule Networks For Action
  Recognition
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition
Vinoj Jayasundara
D. Roy
Basura Fernando
3DPC
85
3
0
08 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
73
1
0
08 Nov 2020
Mutual Modality Learning for Video Action Classification
Mutual Modality Learning for Video Action Classification
Stepan Alekseevich Komkov
Maksim Dzabraev
Aleksandr Petiushko
65
9
0
04 Nov 2020
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
Qi Kuang
Xin Jin
Qinping Zhao
Bin Zhou
59
30
0
04 Nov 2020
S3-Net: A Fast and Lightweight Video Scene Understanding Network by
  Single-shot Segmentation
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation
Yuan Cheng
Yuchao Yang
Hai-Bao Chen
Ngai Wong
Hao Yu
3DPC
49
3
0
04 Nov 2020
PV-NAS: Practical Neural Architecture Search for Video Recognition
PV-NAS: Practical Neural Architecture Search for Video Recognition
Zihao Wang
Chen Lin
Lu Sheng
Junjie Yan
Jing Shao
ViT
77
7
0
02 Nov 2020
Actor and Action Modular Network for Text-based Video Segmentation
Actor and Action Modular Network for Text-based Video Segmentation
Jianhua Yang
Yan Huang
K. Niu
Linjiang Huang
Zhanyu Ma
Liang Wang
130
10
0
02 Nov 2020
Multimodal and self-supervised representation learning for automatic
  gesture recognition in surgical robotics
Multimodal and self-supervised representation learning for automatic gesture recognition in surgical robotics
Aniruddha Tamhane
Jinlin Wu
Mathias Unberath
SSL
18
0
0
31 Oct 2020
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom
  Interpretation
Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation
Zhengyuan Yang
Amanda Kay
Yuncheng Li
Wendi F. Cross
Jiebo Luo
58
21
0
30 Oct 2020
Exploring Dynamic Context for Multi-path Trajectory Prediction
Exploring Dynamic Context for Multi-path Trajectory Prediction
Hao Cheng
Wentong Liao
Xuejiao Tang
M. Yang
Monika Sester
Bodo Rosenhahn
105
33
0
30 Oct 2020
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture
  Searching
SAR-NAS: Skeleton-based Action Recognition via Neural Architecture Searching
Haoyuan Zhang
Yonghong Hou
Pichao Wang
Zihui Guo
Wanqing Li
64
15
0
29 Oct 2020
ElderSim: A Synthetic Data Generation Platform for Human Action
  Recognition in Eldercare Applications
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Hochul Hwang
Cheongjae Jang
Geonwoo Park
Junghyun Cho
Ig-Jae Kim
113
73
0
28 Oct 2020
Previous
123...181920...444546
Next