ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.11248
  4. Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition

A Closer Look at Spatiotemporal Convolutions for Action Recognition

30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
ArXivPDFHTML

Papers citing "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

50 / 1,270 papers shown
Title
CTM: Collaborative Temporal Modeling for Action Recognition
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
11
0
0
08 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action
  Recognition
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
31
63
0
08 Feb 2020
Learning Class Regularized Features for Action Recognition
Learning Class Regularized Features for Action Recognition
Alexandros Stergiou
R. Poppe
R. Veltkamp
12
3
0
07 Feb 2020
Human Action Performance using Deep Neuro-Fuzzy Recurrent Attention
  Model
Human Action Performance using Deep Neuro-Fuzzy Recurrent Attention Model
Nihar Bendre
Nima Ebadi
John J. Prevost
Paul Rad
HAI
22
24
0
29 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric
  Learning
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam
Richard J. Radke
27
46
0
21 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
34
2
0
21 Jan 2020
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action
  Recogntion
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion
Kaiyu Shan
Yongtao Wang
Zhuoying Wang
Tingting Liang
Zhi Tang
Ying-Cong Chen
Yangyan Li
AI4TS
28
4
0
19 Jan 2020
Temporal Interlacing Network
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
29
92
0
17 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Tianhao Li
Limin Wang
VGen
18
55
0
16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for
  Better Action Recognition
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition
Li Tao
Xueting Wang
T. Yamasaki
3DPC
22
24
0
16 Jan 2020
Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Dezhao Luo
Chang-rui Liu
Yu Zhou
Dongbao Yang
Can Ma
QiXiang Ye
Weiping Wang
SSL
25
160
0
02 Jan 2020
DMCL: Distillation Multiple Choice Learning for Multimodal Action
  Recognition
DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition
Nuno C. Garcia
Sarah Adel Bargal
Vitaly Ablavsky
Pietro Morerio
Vittorio Murino
Stan Sclaroff
27
48
0
23 Dec 2019
Something-Else: Compositional Action Recognition with Spatial-Temporal
  Interaction Networks
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
24
173
0
20 Dec 2019
Lower Dimensional Kernels for Video Discriminators
Lower Dimensional Kernels for Video Discriminators
Emmanuel Kahembwe
S. Ramamoorthy
32
50
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Mimetics: Towards Understanding Human Actions Out of Context
Philippe Weinzaepfel
Grégory Rogez
19
71
0
16 Dec 2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and
  Action Recognition in Ping Pong
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
14
7
0
13 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action
  Recognition
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
24
177
0
11 Dec 2019
PuckNet: Estimating hockey puck location from broadcast video
PuckNet: Estimating hockey puck location from broadcast video
Kanav Vats
William J. McNally
Chris Dulhanty
Z. Q. Lin
David A Clausi
John S. Zelek
8
7
0
11 Dec 2019
Appending Adversarial Frames for Universal Video Attack
Appending Adversarial Frames for Universal Video Attack
Zhikai Chen
Lingxi Xie
Shanmin Pang
Yong He
Qi Tian
AAML
11
30
0
10 Dec 2019
Context-Dependent Models for Predicting and Characterizing Facial
  Expressiveness
Context-Dependent Models for Predicting and Characterizing Facial Expressiveness
Victoria Lin
J. Girard
Louis-Philippe Morency
11
8
0
10 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
Flow-Distilled IP Two-Stream Networks for Compressed Video Action
  Recognition
Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition
Shiyuan Huang
Xudong Lin
Svebor Karaman
Shih-Fu Chang
22
10
0
10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
Paritosh Parmar
B. Morris
3DPC
18
9
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal
  interactions
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
24
9
0
09 Dec 2019
Temporal Factorization of 3D Convolutional Kernels
Temporal Factorization of 3D Convolutional Kernels
Gabrielle Ras
L. Ambrogioni
Umut Güçlü
Marcel van Gerven
18
1
0
09 Dec 2019
DASZL: Dynamic Action Signatures for Zero-shot Learning
DASZL: Dynamic Action Signatures for Zero-shot Learning
Tae Soo Kim
Jonathan D. Jones
Michael Peven
Zihao Xiao
Jin Bai
Yi Zhang
Weichao Qiu
Alan Yuille
Gregory Hager
18
3
0
08 Dec 2019
Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition
  and Postural Assessment
Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition and Postural Assessment
Behnoosh Parsa
Athma Narayanan
Behzad Dariush
3DH
35
20
0
07 Dec 2019
ClusterFit: Improving Generalization of Visual Representations
ClusterFit: Improving Generalization of Visual Representations
Xueting Yan
Ishan Misra
Abhinav Gupta
Deepti Ghadiyaram
D. Mahajan
SSL
VLM
27
132
0
06 Dec 2019
RSA: Randomized Simulation as Augmentation for Robust Human Action
  Recognition
RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition
Yi Zhang
Xinyue Wei
Weichao Qiu
Zihao Xiao
Gregory Hager
Alan Yuille
22
6
0
03 Dec 2019
A Multigrid Method for Efficiently Training Video Models
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
29
94
0
02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little
  Network and Depthwise Temporal Aggregation
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
32
126
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
19
155
0
01 Dec 2019
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for
  Weather Forecasting
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for Weather Forecasting
Rafaela C. Nascimento
Y. M. Souto
Eduardo S. Ogasawara
Fábio Porto
Eduardo Bezerra
AI4TS
17
83
0
30 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
42
428
0
28 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
27
7
0
26 Nov 2019
Dynamical System Inspired Adaptive Time Stepping Controller for Residual
  Network Families
Dynamical System Inspired Adaptive Time Stepping Controller for Residual Network Families
Yibo Yang
Jianlong Wu
Hongyang Li
Xia Li
Tiancheng Shen
Zhouchen Lin
OOD
16
21
0
23 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
36
236
0
21 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion
MMTM: Multimodal Transfer Module for CNN Fusion
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
18
277
0
20 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed
  Domain
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain
Barak Battash
H. Barad
Hanlin Tang
Amit Bleiweiss
14
30
0
19 Nov 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time
  Spatiotemporal Action Localization
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
28
143
0
15 Nov 2019
Accelerating cardiac cine MRI using a deep learning-based ESPIRiT
  reconstruction
Accelerating cardiac cine MRI using a deep learning-based ESPIRiT reconstruction
Christopher M. Sandino
P. Lai
S. Vasanawala
Joseph Y. Cheng
16
3
0
13 Nov 2019
Chirality Nets for Human Pose Regression
Chirality Nets for Human Pose Regression
Raymond A. Yeh
Yuan-Ting Hu
Alex Schwing
3DH
19
48
0
31 Oct 2019
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding
  in Videos
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
Yitian Yuan
Lin Ma
Jingwen Wang
Wei Liu
Wenwu Zhu
30
242
0
31 Oct 2019
Comprehensive Video Understanding: Video summarization with
  content-based video recommender design
Comprehensive Video Understanding: Video summarization with content-based video recommender design
Yudong Jiang
Kaixu Cui
B. Peng
Changliang Xu
BDL
20
28
0
30 Oct 2019
Volterra Neural Networks (VNNs)
Volterra Neural Networks (VNNs)
Siddharth Roheda
Hamid Krim
16
10
0
21 Oct 2019
Vatex Video Captioning Challenge 2020: Multi-View Features and Hybrid
  Reward Strategies for Video Captioning
Vatex Video Captioning Challenge 2020: Multi-View Features and Hybrid Reward Strategies for Video Captioning
Xinxin Zhu
A. Gorban
V. A. Makarov
Shichen Lu
I. Tyukin
Hanqing Lu
13
2
0
17 Oct 2019
Tiny Video Networks
Tiny Video Networks
A. Piergiovanni
A. Angelova
Michael S. Ryoo
28
46
0
15 Oct 2019
TrajectoryNet: a new spatio-temporal feature learning network for human
  motion prediction
TrajectoryNet: a new spatio-temporal feature learning network for human motion prediction
Xiaoli Liu
Jianqin Yin
Jin Liu
Pengxiang Ding
Jun Liu
Huaping Liu
3DH
27
11
0
15 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal
  Reasoning
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar
Deva Ramanan
22
176
0
10 Oct 2019
Previous
123...2223242526
Next