ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.2199
  4. Cited By
Two-Stream Convolutional Networks for Action Recognition in Videos
v1v2 (latest)

Two-Stream Convolutional Networks for Action Recognition in Videos

9 June 2014
Karen Simonyan
Andrew Zisserman
ArXiv (abs)PDFHTML

Papers citing "Two-Stream Convolutional Networks for Action Recognition in Videos"

50 / 2,289 papers shown
Title
Action parsing using context features
Action parsing using context features
N. Mehrseresht
24
0
0
20 May 2022
PillarNet: Real-Time and High-Performance Pillar-based 3D Object
  Detection
PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection
Guang-Hui Shi
Ruifeng Li
Chaoxiang Ma
3DPC
112
145
0
16 May 2022
Deep Decomposition and Bilinear Pooling Network for Blind Night-Time
  Image Quality Evaluation
Deep Decomposition and Bilinear Pooling Network for Blind Night-Time Image Quality Evaluation
Qiuping Jiang
Jiawu Xu
Yudong Mao
Wei Zhou
Xiongkuo Min
Guangtao Zhai
61
4
0
12 May 2022
Past and Future Motion Guided Network for Audio Visual Event
  Localization
Past and Future Motion Guided Network for Audio Visual Event Localization
Ting-Yen Chen
Jianqin Yin
Jin Tang
58
2
0
08 May 2022
Representation Learning for Compressed Video Action Recognition via
  Attentive Cross-modal Interaction with Motion Enhancement
Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion Enhancement
Bing Li
Jiaxin Chen
Dongming Zhang
Xiuguo Bao
Di Huang
51
15
0
07 May 2022
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
Mingdong Yang
Guo Chen
Yin-Dong Zheng
Tong Lu
Limin Wang
91
48
0
05 May 2022
Deep Neural Network approaches for Analysing Videos of Music
  Performances
Deep Neural Network approaches for Analysing Videos of Music Performances
F. Liwicki
Richa Upadhyay
Prakash Chandra Chhipa
Killian Murphy
F. Visi
S. Östersjö
Marcus Liwicki
52
1
0
05 May 2022
CoCa: Contrastive Captioners are Image-Text Foundation Models
CoCa: Contrastive Captioners are Image-Text Foundation Models
Jiahui Yu
Zirui Wang
Vijay Vasudevan
Legg Yeung
Mojtaba Seyedhosseini
Yonghui Wu
VLMCLIPOffRL
334
1,314
0
04 May 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
In Defense of Image Pre-Training for Spatiotemporal Recognition
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
72
0
0
03 May 2022
Exposing Deepfake Face Forgeries with Guided Residuals
Exposing Deepfake Face Forgeries with Guided Residuals
Zhiqing Guo
Gaobo Yang
Jiyou Chen
Xingming Sun
CVBM
63
28
0
02 May 2022
Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation
  For Action Recognition
Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition
Yang Zhou
Zhanhao He
Ke Lu
Guanhong Wang
Gaoang Wang
CLLSLR
104
2
0
01 May 2022
RADNet: A Deep Neural Network Model for Robust Perception in Moving
  Autonomous Systems
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
B. Mudassar
Sho Ko
Maojingjing Li
Priyabrata Saha
Saibal Mukhopadhyay
111
2
0
30 Apr 2022
Self-supervised Contrastive Learning for Audio-Visual Action Recognition
Self-supervised Contrastive Learning for Audio-Visual Action Recognition
Yang Liu
Y. Tan
Haoyu Lan
SSL
79
7
0
28 Apr 2022
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation
  Learning for Action Recognition Pre-Training
Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training
Guanhong Wang
Ke Lu
Yang Zhou
Zhanhao He
Gaoang Wang
SSL
76
3
0
27 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and
  Applications
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
117
0
25 Apr 2022
Estimation of Reliable Proposal Quality for Temporal Action Detection
Estimation of Reliable Proposal Quality for Temporal Action Detection
Junshan Hu
Chaoxu Guo
Liansheng Zhuang
Biao Wang
T. Ge
Yuning Jiang
Houqiang Li
85
4
0
25 Apr 2022
Trusted Multi-View Classification with Dynamic Evidential Fusion
Trusted Multi-View Classification with Dynamic Evidential Fusion
Zongbo Han
Changqing Zhang
Huazhu Fu
Qiufeng Wang
EDL
93
235
0
25 Apr 2022
Keypoint based Sign Language Translation without Glosses
Keypoint based Sign Language Translation without Glosses
Youngmin Kim
Minji Kwak
Dain Lee
Yeongeun Kim
Hyeongboo Baek
SLR
46
7
0
22 Apr 2022
THORN: Temporal Human-Object Relation Network for Action Recognition
THORN: Temporal Human-Object Relation Network for Action Recognition
Mohammed Guermal
Rui Dai
Francois Bremond
EgoV
66
3
0
20 Apr 2022
A Survey of Video-based Action Quality Assessment
A Survey of Video-based Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Qing Yu
Tao Suo
Zhan Sun
Ka Li
Lihua Zhang
64
19
0
20 Apr 2022
ActAR: Actor-Driven Pose Embeddings for Video Action Recognition
ActAR: Actor-Driven Pose Embeddings for Video Action Recognition
Soufiane Lamghari
Guillaume-Alexandre Bilodeau
Nicolas Saunier
189
4
0
19 Apr 2022
3D Convolutional Networks for Action Recognition: Application to Sport
  Gesture Recognition
3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
A. Zemmari
J. Morlier
65
5
0
13 Apr 2022
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality
  Assessment
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
Jinglin Xu
Yongming Rao
Xumin Yu
Guangyi Chen
Jie Zhou
Jiwen Lu
84
97
0
07 Apr 2022
An Empirical Study of End-to-End Temporal Action Detection
An Empirical Study of End-to-End Temporal Action Detection
Xiaolong Liu
S. Bai
Xiang Bai
94
60
0
06 Apr 2022
DBF: Dynamic Belief Fusion for Combining Multiple Object Detectors
DBF: Dynamic Belief Fusion for Combining Multiple Object Detectors
Hyungtae Lee
H. Kwon
FedML
36
18
0
06 Apr 2022
Detector-Free Weakly Supervised Group Activity Recognition
Detector-Free Weakly Supervised Group Activity Recognition
Dongkeun Kim
Jin S. Lee
Minsu Cho
Suha Kwak
ViT
77
44
0
05 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
138
106
0
04 Apr 2022
TALLFormer: Temporal Action Localization with a Long-memory Transformer
TALLFormer: Temporal Action Localization with a Long-memory Transformer
Feng Cheng
Gedas Bertasius
ViT
120
94
0
04 Apr 2022
A-ACT: Action Anticipation through Cycle Transformations
A-ACT: Action Anticipation through Cycle Transformations
Akash Gupta
Jingen Liu
Liefeng Bo
Amit K. Roy-Chowdhury
Tao Mei
93
6
0
02 Apr 2022
Vision Transformer with Cross-attention by Temporal Shift for Efficient
  Action Recognition
Vision Transformer with Cross-attention by Temporal Shift for Efficient Action Recognition
Ryota Hashiguchi
Toru Tamaki
45
6
0
01 Apr 2022
DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow
DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow
Zihua Zheng
Ni Nie
Zhi Ling
Pengfei Xiong
Jiangyu Liu
Hongya Wang
Jiankun Li
MDE
70
46
0
01 Apr 2022
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for
  Action Recognition
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition
Jun Kimata
Tomoya Nitta
Toru Tamaki
103
10
0
01 Apr 2022
Deformable Video Transformer
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
98
28
0
31 Mar 2022
Stochastic Backpropagation: A Memory Efficient Strategy for Training
  Video Models
Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models
Feng Cheng
Ming Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Li
Wei Xia
63
17
0
31 Mar 2022
StyleFool: Fooling Video Classification Systems via Style Transfer
StyleFool: Fooling Video Classification Systems via Style Transfer
Yu Cao
Xi Xiao
Ruoxi Sun
Derui Wang
Minhui Xue
Sheng Wen
AAML
129
26
0
30 Mar 2022
Alignment-Uniformity aware Representation Learning for Zero-shot Video
  Classification
Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification
Shi Pu
Kaili Zhao
Mao Zheng
VLM
63
20
0
29 Mar 2022
End-to-End Compressed Video Representation Learning for Generic Event
  Boundary Detection
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Congcong Li
Xinyao Wang
Longyin Wen
Dexiang Hong
Tiejian Luo
Libo Zhang
75
17
0
29 Mar 2022
Frame-wise Action Representations for Long Videos via Sequence
  Contrastive Learning
Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning
Minghao Chen
Fangyun Wei
Chong Li
Deng Cai
AI4TS
105
35
0
28 Mar 2022
End-to-End Active Speaker Detection
End-to-End Active Speaker Detection
Juan Carlos León Alcázar
M. Cordes
Chen Zhao
Guohao Li
107
28
0
27 Mar 2022
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional
  Videos
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
Muheng Li
Lei Chen
Yueqi Duan
Zhilan Hu
Jianjiang Feng
Jie Zhou
Jiwen Lu
79
76
0
26 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for
  Self-Supervised Video Pre-Training
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
252
1,222
0
23 Mar 2022
Modality Competition: What Makes Joint Training of Multi-modal Network
  Fail in Deep Learning? (Provably)
Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)
Yu Huang
Junyang Lin
Chang Zhou
Hongxia Yang
Longbo Huang
68
97
0
23 Mar 2022
Global Matching with Overlapping Attention for Optical Flow Estimation
Global Matching with Overlapping Attention for Optical Flow Estimation
Shiyu Zhao
Long Zhao
Zhixing Zhang
Enyu Zhou
Dimitris N. Metaxas
3DPC
131
79
0
21 Mar 2022
Generative Adversarial Network for Future Hand Segmentation from
  Egocentric Video
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video
Wenqi Jia
Miao Liu
James M. Rehg
EgoV
79
14
0
21 Mar 2022
No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static
  Models by Fitting Feature-level Space-time Surfaces
No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces
Jianqi Zhong
Kaichen Zhou
Qingyong Hu
Bing Wang
Niki Trigoni
Andrew Markham
3DPC
101
23
0
21 Mar 2022
FAR: Fourier Aerial Video Recognition
FAR: Fourier Aerial Video Recognition
D. Kothandaraman
Tianrui Guan
Xijun Wang
Sean Hu
Ming-Shun Lin
Tianyi Zhou
77
13
0
21 Mar 2022
PressureVision: Estimating Hand Pressure from a Single RGB Image
PressureVision: Estimating Hand Pressure from a Single RGB Image
Patrick Grady
Chengcheng Tang
Samarth Brahmbhatt
Christopher D. Twigg
Chengde Wan
James Hays
Charles C. Kemp
3DH
83
20
0
19 Mar 2022
DirecFormer: A Directed Attention in Transformer Approach to Robust
  Action Recognition
DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition
Thanh-Dat Truong
Quoc-Huy Bui
C. Duong
Han-Seok Seo
Son Lam Phung
Xin Li
Khoa Luu
ViT
132
51
0
19 Mar 2022
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal
  Generation
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation
Khoa T. Vo
Kashu Yamazaki
Sang Truong
M. Tran
Akihiro Sugimoto
Ngan Le
EgoV
71
9
0
16 Mar 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
85
24
0
16 Mar 2022
Previous
123...101112...444546
Next