ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.11248
  4. Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition

A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
ArXivPDFHTML

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown
Title
Graph-based Spatial-temporal Feature Learning for Neuromorphic Vision
  Sensing
Graph-based Spatial-temporal Feature Learning for Neuromorphic Vision Sensing
Yin Bi
Aaron Chadha
Alhabib Abbas
Eirina Bourtsoulatze
Y. Andreopoulos
25
26
0
08 Oct 2019
CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule
  Routing
CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing
Kevin Duarte
Yogesh S Rawat
M. Shah
VOS
14
68
0
30 Sep 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
20
19
0
30 Sep 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
130
150
0
28 Sep 2019
Scheduled Differentiable Architecture Search for Visual Recognition
Scheduled Differentiable Architecture Search for Visual Recognition
Zhaofan Qiu
Ting Yao
Yiheng Zhang
Yongdong Zhang
Tao Mei
OOD
20
3
0
23 Sep 2019
Target-Specific Action Classification for Automated Assessment of Human
  Motor Behavior from Video
Target-Specific Action Classification for Automated Assessment of Human Motor Behavior from Video
B. Rezaei
Yiorgos Christakis
B. Ho
Kevin C. Thomas
K. Erb
Sarah Ostadabbas
Shyamal Patel
31
11
0
20 Sep 2019
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Retro-Actions: Learning 'Close' by Time-Reversing Ópen' Videos
Will Price
Dima Damen
21
6
0
20 Sep 2019
Class Feature Pyramids for Video Explanation
Class Feature Pyramids for Video Explanation
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Poppe
R. Veltkamp
FAtt
33
18
0
18 Sep 2019
Multitask Learning to Improve Egocentric Action Recognition
Multitask Learning to Improve Egocentric Action Recognition
G. Kapidis
R. Poppe
E. V. Dam
L. Noldus
R. Veltkamp
EgoV
30
36
0
15 Sep 2019
Zero-Shot Action Recognition in Videos: A Survey
Zero-Shot Action Recognition in Videos: A Survey
Valter Estevam
Hélio Pedrini
David Menotti
33
57
0
13 Sep 2019
Exploring Temporal Differences in 3D Convolutional Neural Networks
Exploring Temporal Differences in 3D Convolutional Neural Networks
Gagan Kanojia
Sudhakar Kumawat
Shanmuganathan Raman
3DPC
AI4TS
21
3
0
07 Sep 2019
PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction
PISEP^2: Pseudo Image Sequence Evolution based 3D Pose Prediction
Xiaoli Liu
Jianqin Yin
Huaping Liu
Yilong Yin
3DH
32
7
0
04 Sep 2019
Explainable Video Action Reasoning via Prior Knowledge and State
  Transitions
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
Tao Zhuo
Zhiyong Cheng
Peng Zhang
Yongkang Wong
Mohan Kankanhalli
FAtt
33
60
0
28 Aug 2019
Fingerspelling recognition in the wild with iterative visual attention
Fingerspelling recognition in the wild with iterative visual attention
Bowen Shi
Aurora Martinez Del Rio
J. Keane
D. Brentari
G. Shakhnarovich
Karen Livescu
11
61
0
28 Aug 2019
Cooperative Cross-Stream Network for Discriminative Action
  Representation
Cooperative Cross-Stream Network for Discriminative Action Representation
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
31
5
0
27 Aug 2019
Action recognition with spatial-temporal discriminative filter banks
Action recognition with spatial-temporal discriminative filter banks
Brais Martínez
Davide Modolo
Yuanjun Xiong
Joseph Tighe
23
66
0
20 Aug 2019
Gradient Weighted Superpixels for Interpretability in CNNs
Gradient Weighted Superpixels for Interpretability in CNNs
Thomas Hartley
K. Sidorov
C. Willis
David Marshall
FAtt
7
3
0
16 Aug 2019
Einconv: Exploring Unexplored Tensor Network Decompositions for
  Convolutional Neural Networks
Einconv: Exploring Unexplored Tensor Network Decompositions for Convolutional Neural Networks
K. Hayashi
Taiki Yamaguchi
Yohei Sugawara
S. Maeda
21
52
0
13 Aug 2019
Enhanced 3D convolutional networks for crowd counting
Enhanced 3D convolutional networks for crowd counting
Zhikang Zou
Huiliang Shao
Xiaoye Qu
Wei Wei
Pan Zhou
33
38
0
12 Aug 2019
Is it Raining Outside? Detection of Rainfall using General-Purpose
  Surveillance Cameras
Is it Raining Outside? Detection of Rainfall using General-Purpose Surveillance Cameras
Joakim Bruslund Haurum
C. Bahnsen
T. Moeslund
17
9
0
12 Aug 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
STM: SpatioTemporal and Motion Encoding for Action Recognition
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
31
380
0
07 Aug 2019
Image to Video Domain Adaptation Using Web Supervision
Image to Video Domain Adaptation Using Web Supervision
Andrew Kae
Yale Song
16
5
0
05 Aug 2019
SF-Net: Structured Feature Network for Continuous Sign Language
  Recognition
SF-Net: Structured Feature Network for Continuous Sign Language Recognition
Zhaoyang Yang
Zhenmei Shi
Xiaoyong Shen
Yu-Wing Tai
SLR
27
63
0
04 Aug 2019
Action Recognition in Untrimmed Videos with Composite Self-Attention
  Two-Stream Framework
Action Recognition in Untrimmed Videos with Composite Self-Attention Two-Stream Framework
Dong Cao
Lisha Xu
Haibo Chen
ViT
10
3
0
04 Aug 2019
Two-Stream Video Classification with Cross-Modality Attention
Two-Stream Video Classification with Cross-Modality Attention
Lu Chi
Guiyu Tian
Yadong Mu
Qi Tian
21
22
0
01 Aug 2019
Use What You Have: Video Retrieval Using Representations From
  Collaborative Experts
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
36
387
0
31 Jul 2019
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective
  Untrimmed Video Recognition
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition
Wenhao Wu
Dongliang He
Xiao Tan
Shifeng Chen
Shilei Wen
15
127
0
31 Jul 2019
Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
Temporal Attentive Alignment for Large-Scale Video Domain Adaptation
Min-Hung Chen
Z. Kira
G. Al-Regib
Jaekwon Yoo
Ruxin Chen
Jian Zheng
TTA
AI4TS
21
179
0
30 Jul 2019
Remote Heart Rate Measurement from Highly Compressed Facial Videos: an
  End-to-end Deep Learning Solution with Video Enhancement
Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement
Zitong Yu
Wei Peng
Xiaobai Li
Xiaopeng Hong
Guoying Zhao
32
267
0
27 Jul 2019
Attention Filtering for Multi-person Spatiotemporal Action Detection on
  Deep Two-Stream CNN Architectures
Attention Filtering for Multi-person Spatiotemporal Action Detection on Deep Two-Stream CNN Architectures
João Antunes
Pedro Abreu
Alexandre Bernardino
A. Smailagic
D. Siewiorek
16
1
0
21 Jul 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
An Efficient 3D CNN for Action/Object Segmentation in Video
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
24
27
0
21 Jul 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
50
75
0
19 Jul 2019
Video Action Recognition Via Neural Architecture Searching
Video Action Recognition Via Neural Architecture Searching
Wei Peng
Xiaopeng Hong
Guoying Zhao
41
36
0
10 Jul 2019
Deformable Tube Network for Action Detection in Videos
Deformable Tube Network for Action Detection in Videos
Wei Li
Zehuan Yuan
Dashan Guo
Lei Huang
Xiangzhong Fang
Changhu Wang
ViT
MedIm
33
5
0
03 Jul 2019
Few-Shot Video Classification via Temporal Alignment
Few-Shot Video Classification via Temporal Alignment
Kaidi Cao
Jingwei Ji
Zhangjie Cao
C. Chang
Juan Carlos Niebles
AI4TS
27
235
0
27 Jun 2019
An Action Recognition network for specific target based on rMC and RPN
An Action Recognition network for specific target based on rMC and RPN
Mingjie Li
Youqian Feng
Zhonghai Yin
Cheng Zhou
Fanghao Dong
Yuanze Lin
Yuhao Dong
21
1
0
19 Jun 2019
Factorized Higher-Order CNNs with an Application to Spatio-Temporal
  Emotion Estimation
Factorized Higher-Order CNNs with an Application to Spatio-Temporal Emotion Estimation
Jean Kossaifi
Antoine Toisoul
Adrian Bulat
Yannis Panagakis
Timothy M. Hospedales
M. Pantic
CVBM
15
80
0
14 Jun 2019
Learning Spatio-Temporal Representation with Local and Global Diffusion
Learning Spatio-Temporal Representation with Local and Global Diffusion
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
11
169
0
13 Jun 2019
HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow
  Estimation on Large-scale Point Clouds
HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-scale Point Clouds
Xiuye Gu
Yijie Wang
Chongruo Wu
Yong Jae Lee
Panqu Wang
3DPC
32
206
0
12 Jun 2019
FASTER Recurrent Networks for Efficient Video Classification
FASTER Recurrent Networks for Efficient Video Classification
Linchao Zhu
Laura Sevilla-Lara
Du Tran
Matt Feiszli
Yi Yang
Heng Wang
49
6
0
10 Jun 2019
UniDual: A Unified Model for Image and Video Understanding
UniDual: A Unified Model for Image and Video Understanding
Yufei Wang
Du Tran
Lorenzo Torresani
21
2
0
10 Jun 2019
Video Modeling with Correlation Networks
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
24
127
0
07 Jun 2019
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video
  Architectures
AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures
Michael S. Ryoo
A. Piergiovanni
Mingxing Tan
A. Angelova
24
102
0
30 May 2019
Dynamic Traffic Scene Classification with Space-Time Coherence
Dynamic Traffic Scene Classification with Space-Time Coherence
Athma Narayanan
Isht Dwivedi
Behzad Dariush
11
23
0
29 May 2019
What Makes Training Multi-Modal Classification Networks Hard?
What Makes Training Multi-Modal Classification Networks Hard?
Weiyao Wang
Du Tran
Matt Feiszli
31
442
0
29 May 2019
Hierarchical Feature Aggregation Networks for Video Action Recognition
Hierarchical Feature Aggregation Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
FAtt
16
8
0
29 May 2019
Unsupervised Learning from Video with Deep Neural Embeddings
Unsupervised Learning from Video with Deep Neural Embeddings
Chengxu Zhuang
Tianwei She
A. Andonian
Max Sobol Mark
Daniel L. K. Yamins
SSL
17
56
0
28 May 2019
Lightweight Network Architecture for Real-Time Action Recognition
Lightweight Network Architecture for Real-Time Action Recognition
Alexander Kozlov
Vadim Andronov
Y. Gritsenko
ViT
25
33
0
21 May 2019
STAR: A Concise Deep Learning Framework for Citywide Human Mobility
  Prediction
STAR: A Concise Deep Learning Framework for Citywide Human Mobility Prediction
Hongnian Wang
Han Su
HAI
26
17
0
16 May 2019
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
VideoGraph: Recognizing Minutes-Long Human Activities in Videos
Noureldien Hussein
E. Gavves
A. Smeulders
28
77
0
13 May 2019
Previous
123...23242526
Next