ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,289 papers shown
Title
Enriching Local and Global Contexts for Temporal Action Localization
Enriching Local and Global Contexts for Temporal Action Localization
Zixin Zhu
Wei Tang
Le Wang
N. Zheng
G. Hua
99
112
0
27 Jul 2021
Real-Time Activity Recognition and Intention Recognition Using a
  Vision-based Embedded System
Real-Time Activity Recognition and Intention Recognition Using a Vision-based Embedded System
Sahar Darafsh
S. S. Ghidary
M. S. Zamani
32
8
0
27 Jul 2021
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time
  Series Forecasting
Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman
Dotan Di Castro
AI4TS
45
1
0
27 Jul 2021
Temporal Alignment Prediction for Few-Shot Video Classification
Temporal Alignment Prediction for Few-Shot Video Classification
Fei Pan
Chunlei Xu
Jie Guo
Yanwen Guo
AI4TS
41
1
0
26 Jul 2021
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Hanxi Lin
Xinxiao Wu
Jiebo Luo
65
2
0
25 Jul 2021
Self-Conditioned Probabilistic Learning of Video Rescaling
Self-Conditioned Probabilistic Learning of Video Rescaling
Yuan Tian
Guo Lu
Xiongkuo Min
Zhaohui Che
Guangtao Zhai
G. Guo
Zhiyong Gao
37
26
0
24 Jul 2021
Detail Preserving Residual Feature Pyramid Modules for Optical Flow
Detail Preserving Residual Feature Pyramid Modules for Optical Flow
Libo Long
J. Lang
61
6
0
23 Jul 2021
EAN: Event Adaptive Network for Enhanced Action Recognition
EAN: Event Adaptive Network for Enhanced Action Recognition
Yuan Tian
Yichao Yan
Guangtao Zhai
G. Guo
Zhiyong Gao
81
42
0
22 Jul 2021
Multi-modal Residual Perceptron Network for Audio-Video Emotion
  Recognition
Multi-modal Residual Perceptron Network for Audio-Video Emotion Recognition
Xin Chang
W. Skarbek
59
20
0
21 Jul 2021
UNIK: A Unified Framework for Real-world Skeleton-based Action
  Recognition
UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition
Di Yang
Yaohui Wang
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
Francois Bremond
83
49
0
19 Jul 2021
Action Forecasting with Feature-wise Self-Attention
Action Forecasting with Feature-wise Self-Attention
Yan Bin Ng
Basura Fernando
EgoV
28
0
0
19 Jul 2021
Agent-Environment Network for Temporal Action Proposal Generation
Agent-Environment Network for Temporal Action Proposal Generation
Viet-Khoa Vo-Ho
Ngan Le
Kashu Yamazaki
Akihiro Sugimoto
Minh-Triet Tran
EgoV
55
10
0
17 Jul 2021
Training for temporal sparsity in deep neural networks, application in
  video processing
Training for temporal sparsity in deep neural networks, application in video processing
Amirreza Yousefzadeh
Manolis Sifalakis
70
3
0
15 Jul 2021
Developmental Stage Classification of Embryos Using Two-Stream Neural
  Network with Linear-Chain Conditional Random Field
Developmental Stage Classification of Embryos Using Two-Stream Neural Network with Linear-Chain Conditional Random Field
Stanislav Lukyanenko
Won-Dong Jang
D. Wei
R. Struyven
Yoon Kim
...
Helen Y Yang
Alexander M. Rush
D. Ben-Yosef
D. Needleman
Hanspeter Pfister
53
9
0
13 Jul 2021
Aligning Correlation Information for Domain Adaptation in Action
  Recognition
Aligning Correlation Information for Domain Adaptation in Action Recognition
Yuecong Xu
Jianfei Yang
Haozhi Cao
K. Mao
Jianxiong Yin
Simon See
87
39
0
11 Jul 2021
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video
  Recognition Systems
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video Recognition Systems
Shangyu Xie
Han Wang
Yu Kong
Yuan Hong
AAML
73
27
0
09 Jul 2021
Video 3D Sampling for Self-supervised Representation Learning
Video 3D Sampling for Self-supervised Representation Learning
Wei Li
Dezhao Luo
Bo Fang
Yu Zhou
Weiping Wang
62
7
0
08 Jul 2021
Productivity, Portability, Performance: Data-Centric Python
Productivity, Portability, Performance: Data-Centric Python
Yiheng Wang
Yao Zhang
Yanzhang Wang
Yan Wan
Jiao Wang
Zhongyuan Wu
Yuhao Yang
Bowen She
169
101
0
01 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding
  and Emotion Analysis
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion Analysis
Xin Liu
Henglin Shi
Haoyu Chen
Zitong Yu
Xiaobai Li
Guoying Zhao
82
83
0
01 Jul 2021
CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation
CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation
Ankit Singh
135
109
0
30 Jun 2021
When Video Classification Meets Incremental Classes
When Video Classification Meets Incremental Classes
Hanbin Zhao
Xin Qin
Shihao Su
Yongjian Fu
Zibo Lin
Xi Li
CLL
73
28
0
30 Jun 2021
Long-Short Temporal Modeling for Efficient Action Recognition
Long-Short Temporal Modeling for Efficient Action Recognition
Liyu Wu
Yuexian Zou
Can Zhang
38
1
0
30 Jun 2021
Interflow: Aggregating Multi-layer Feature Mappings with Attention
  Mechanism
Interflow: Aggregating Multi-layer Feature Mappings with Attention Mechanism
Zhicheng Cai
35
1
0
26 Jun 2021
Exploring Temporal Context and Human Movement Dynamics for Online Action
  Detection in Videos
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos
V. Vasileiou
N. Kardaris
Petros Maragos
51
2
0
26 Jun 2021
Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily
  Long Videos of Seizures
Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily Long Videos of Seizures
Fernando Pérez-García
C. Scott
Rachel Sparks
B. Diehl
Sébastien Ourselin
SLR
53
17
0
22 Jun 2021
Towards Long-Form Video Understanding
Towards Long-Form Video Understanding
Chaoxia Wu
Philipp Krahenbuhl
VLMViT
119
170
0
21 Jun 2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
Michael S. Ryoo
A. Piergiovanni
Anurag Arnab
Mostafa Dehghani
A. Angelova
ViT
153
129
0
21 Jun 2021
Does Optimal Source Task Performance Imply Optimal Pre-training for a
  Target Task?
Does Optimal Source Task Performance Imply Optimal Pre-training for a Target Task?
Steven Gutstein
Brent Lance
Sanjay Shakkottai
29
1
0
21 Jun 2021
OadTR: Online Action Detection with Transformers
OadTR: Online Action Detection with Transformers
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Yuanjie Shao
Zhe Zuo
Changxin Gao
Nong Sang
OffRLViT
103
117
0
21 Jun 2021
Improving Ultrasound Tongue Image Reconstruction from Lip Images Using
  Self-supervised Learning and Attention Mechanism
Improving Ultrasound Tongue Image Reconstruction from Lip Images Using Self-supervised Learning and Attention Mechanism
Haiyang Liu
Jihang Zhang
50
4
0
20 Jun 2021
Self-supervised Video Representation Learning with Cross-Stream
  Prototypical Contrasting
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
108
11
0
18 Jun 2021
Long-Short Temporal Contrastive Learning of Video Transformers
Long-Short Temporal Contrastive Learning of Video Transformers
Jue Wang
Gedas Bertasius
Du Tran
Lorenzo Torresani
VLMViT
153
50
0
17 Jun 2021
mPyPl: Python Monadic Pipeline Library for Complex Functional Data
  Processing
mPyPl: Python Monadic Pipeline Library for Complex Functional Data Processing
Dmitry Soshnikov
Yana Valieva
AI4CE
18
0
0
16 Jun 2021
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group
  and Activity Detection
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection
Mahsa Ehsanpour
F. Saleh
Silvio Savarese
Ian Reid
Hamid Rezatofighi
81
44
0
16 Jun 2021
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Gradient Forward-Propagation for Large-Scale Temporal Video Modelling
Mateusz Malinowski
Dimitrios Vytiniotis
G. Swirszcz
Viorica Patraucean
João Carreira
65
8
0
15 Jun 2021
Influential Rank: A New Perspective of Post-training for Robust Model
  against Noisy Labels
Influential Rank: A New Perspective of Post-training for Robust Model against Noisy Labels
Seulki Park
Hwanjun Song
Daeho Um
D. Jo
Sangdoo Yun
J. Choi
NoLa
67
0
0
14 Jun 2021
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Multi-level Attention Fusion Network for Audio-visual Event Recognition
Mathilde Brousmiche
Jean Rouat
Stéphane Dupont
154
11
0
12 Jun 2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding
  Evaluation
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation
Linjie Li
Jie Lei
Zhe Gan
Licheng Yu
Yen-Chun Chen
...
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
Lijuan Wang
Zicheng Liu
VLM
119
103
0
08 Jun 2021
Learning by Distillation: A Self-Supervised Learning Framework for
  Optical Flow Estimation
Learning by Distillation: A Self-Supervised Learning Framework for Optical Flow Estimation
Pengpeng Liu
Michael R. Lyu
Irwin King
Jia Xu
53
8
0
08 Jun 2021
White Paper Assistance: A Step Forward Beyond the Shortcut Learning
White Paper Assistance: A Step Forward Beyond the Shortcut Learning
Xuan Cheng
Tianshu Xie
Xiaomin Wang
Jiali Deng
Minghui Liu
Meilin Liu
AAML
53
0
0
08 Jun 2021
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker
  Detection in the Wild
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild
Okan Kopuklu
Maja Taseska
Gerhard Rigoll
3DV
98
46
0
07 Jun 2021
Video Imprint
Video Imprint
Zhanning Gao
Le Wang
Nebojsa Jojic
Zhenxing Niu
N. Zheng
G. Hua
38
5
0
07 Jun 2021
Anticipative Video Transformer
Anticipative Video Transformer
Rohit Girdhar
Kristen Grauman
ViT
91
212
0
03 Jun 2021
Cross-Domain First Person Audio-Visual Action Recognition through
  Relative Norm Alignment
Cross-Domain First Person Audio-Visual Action Recognition through Relative Norm Alignment
M. Planamente
Chiara Plizzari
Emanuele Alberti
Barbara Caputo
EgoV
127
12
0
03 Jun 2021
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video Classification
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
72
55
0
03 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
65
4
0
02 Jun 2021
Connecting Language and Vision for Natural Language-Based Vehicle
  Retrieval
Connecting Language and Vision for Natural Language-Based Vehicle Retrieval
Shuai Bai
Zhedong Zheng
Xiaohan Wang
Junyang Lin
Zhu Zhang
Chang Zhou
Yi Yang
Hongxia Yang
103
27
0
31 May 2021
A Study On the Effects of Pre-processing On Spatio-temporal Action
  Recognition Using Spiking Neural Networks Trained with STDP
A Study On the Effects of Pre-processing On Spatio-temporal Action Recognition Using Spiking Neural Networks Trained with STDP
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
23
5
0
31 May 2021
Transferable Sparse Adversarial Attack
Transferable Sparse Adversarial Attack
Ziwen He
Wei Wang
Jing Dong
Tieniu Tan
AAML
72
21
0
31 May 2021
Unsupervised detection of mouse behavioural anomalies using two-stream
  convolutional autoencoders
Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders
Ezechukwu I. Nwokedi
R. Bains
L. Bidaut
S. Wells
Xujiong Ye
James M. Brown
34
2
0
28 May 2021
Previous
123...141516...444546
Next