Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.11248
Cited By
A Closer Look at Spatiotemporal Convolutions for Action Recognition
30 November 2017
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Closer Look at Spatiotemporal Convolutions for Action Recognition"
50 / 1,270 papers shown
Title
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
11
0
0
08 Feb 2020
Symbiotic Attention with Privileged Information for Egocentric Action Recognition
Xiaohan Wang
Yu Wu
Linchao Zhu
Yi Yang
31
63
0
08 Feb 2020
Learning Class Regularized Features for Action Recognition
Alexandros Stergiou
R. Poppe
R. Veltkamp
12
3
0
07 Feb 2020
Human Action Performance using Deep Neuro-Fuzzy Recurrent Attention Model
Nihar Bendre
Nima Ebadi
John J. Prevost
Paul Rad
HAI
22
24
0
29 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam
Richard J. Radke
27
46
0
21 Jan 2020
A Comprehensive Study on Temporal Modeling for Online Action Detection
Wen Wang
Xiaojiang Peng
Yu Qiao
Jian Cheng
34
2
0
21 Jan 2020
MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion
Kaiyu Shan
Yongtao Wang
Zhuoying Wang
Tingting Liang
Zhi Tang
Ying-Cong Chen
Yangyan Li
AI4TS
28
4
0
19 Jan 2020
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
29
92
0
17 Jan 2020
Learning Spatiotemporal Features via Video and Text Pair Discrimination
Tianhao Li
Limin Wang
VGen
18
55
0
16 Jan 2020
Rethinking Motion Representation: Residual Frames with 3D ConvNets for Better Action Recognition
Li Tao
Xueting Wang
T. Yamasaki
3DPC
22
24
0
16 Jan 2020
Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
Dezhao Luo
Chang-rui Liu
Yu Zhou
Dongbao Yang
Can Ma
QiXiang Ye
Weiping Wang
SSL
25
160
0
02 Jan 2020
DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition
Nuno C. Garcia
Sarah Adel Bargal
Vitaly Ablavsky
Pietro Morerio
Vittorio Murino
Stan Sclaroff
27
48
0
23 Dec 2019
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
24
173
0
20 Dec 2019
Lower Dimensional Kernels for Video Discriminators
Emmanuel Kahembwe
S. Ramamoorthy
32
50
0
18 Dec 2019
Mimetics: Towards Understanding Human Actions Out of Context
Philippe Weinzaepfel
Grégory Rogez
19
71
0
16 Dec 2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong
S. Schwarcz
Peng Xu
Davide D‘Ambrosio
Juhana Kangaspunta
A. Angelova
Huong Phan
Navdeep Jaitly
14
7
0
13 Dec 2019
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition
Jinwoo Choi
Chen Gao
Joseph C.E. Messou
Jia-Bin Huang
24
177
0
11 Dec 2019
PuckNet: Estimating hockey puck location from broadcast video
Kanav Vats
William J. McNally
Chris Dulhanty
Z. Q. Lin
David A Clausi
John S. Zelek
8
7
0
11 Dec 2019
Appending Adversarial Frames for Universal Video Attack
Zhikai Chen
Lingxi Xie
Shanmin Pang
Yong He
Qi Tian
AAML
11
30
0
10 Dec 2019
Context-Dependent Models for Predicting and Characterizing Facial Expressiveness
Victoria Lin
J. Girard
Louis-Philippe Morency
11
8
0
10 Dec 2019
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition
Shiyuan Huang
Xudong Lin
Svebor Karaman
Shih-Fu Chang
22
10
0
10 Dec 2019
HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN
Paritosh Parmar
B. Morris
3DPC
18
9
0
10 Dec 2019
Video action detection by learning graph-based spatio-temporal interactions
Matteo Tomei
Lorenzo Baraldi
Simone Calderara
Simone Bronzin
Rita Cucchiara
24
9
0
09 Dec 2019
Temporal Factorization of 3D Convolutional Kernels
Gabrielle Ras
L. Ambrogioni
Umut Güçlü
Marcel van Gerven
18
1
0
09 Dec 2019
DASZL: Dynamic Action Signatures for Zero-shot Learning
Tae Soo Kim
Jonathan D. Jones
Michael Peven
Zihao Xiao
Jin Bai
Yi Zhang
Weichao Qiu
Alan Yuille
Gregory Hager
18
3
0
08 Dec 2019
Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition and Postural Assessment
Behnoosh Parsa
Athma Narayanan
Behzad Dariush
3DH
35
20
0
07 Dec 2019
ClusterFit: Improving Generalization of Visual Representations
Xueting Yan
Ishan Misra
Abhinav Gupta
Deepti Ghadiyaram
D. Mahajan
SSL
VLM
27
132
0
06 Dec 2019
RSA: Randomized Simulation as Augmentation for Robust Human Action Recognition
Yi Zhang
Xinyue Wei
Weichao Qiu
Zihao Xiao
Gregory Hager
Alan Yuille
22
6
0
03 Dec 2019
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
29
94
0
02 Dec 2019
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
32
126
0
02 Dec 2019
Gate-Shift Networks for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
3DPC
19
155
0
01 Dec 2019
STConvS2S: Spatiotemporal Convolutional Sequence to Sequence Network for Weather Forecasting
Rafaela C. Nascimento
Y. M. Souto
Eduardo S. Ogasawara
Fábio Porto
Eduardo Bezerra
AI4TS
17
83
0
30 Nov 2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Humam Alwassel
D. Mahajan
Bruno Korbar
Lorenzo Torresani
Guohao Li
Du Tran
SSL
42
428
0
28 Nov 2019
Learning Efficient Video Representation with Video Shuffle Networks
Pingchuan Ma
Yao Zhou
Yu Lu
Wayne Zhang
27
7
0
26 Nov 2019
Dynamical System Inspired Adaptive Time Stepping Controller for Residual Network Families
Yibo Yang
Jianlong Wu
Hongyang Li
Xia Li
Tiancheng Shen
Zhouchen Lin
OOD
16
21
0
23 Nov 2019
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
36
236
0
21 Nov 2019
MMTM: Multimodal Transfer Module for CNN Fusion
Hamid Reza Vaezi Joze
Amirreza Shaban
Michael L. Iuzzolino
K. Koishida
18
277
0
20 Nov 2019
Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain
Barak Battash
H. Barad
Hanlin Tang
Amit Bleiweiss
14
30
0
19 Nov 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
28
143
0
15 Nov 2019
Accelerating cardiac cine MRI using a deep learning-based ESPIRiT reconstruction
Christopher M. Sandino
P. Lai
S. Vasanawala
Joseph Y. Cheng
16
3
0
13 Nov 2019
Chirality Nets for Human Pose Regression
Raymond A. Yeh
Yuan-Ting Hu
Alex Schwing
3DH
19
48
0
31 Oct 2019
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
Yitian Yuan
Lin Ma
Jingwen Wang
Wei Liu
Wenwu Zhu
30
242
0
31 Oct 2019
Comprehensive Video Understanding: Video summarization with content-based video recommender design
Yudong Jiang
Kaixu Cui
B. Peng
Changliang Xu
BDL
20
28
0
30 Oct 2019
Volterra Neural Networks (VNNs)
Siddharth Roheda
Hamid Krim
16
10
0
21 Oct 2019
Vatex Video Captioning Challenge 2020: Multi-View Features and Hybrid Reward Strategies for Video Captioning
Xinxin Zhu
A. Gorban
V. A. Makarov
Shichen Lu
I. Tyukin
Hanqing Lu
13
2
0
17 Oct 2019
Tiny Video Networks
A. Piergiovanni
A. Angelova
Michael S. Ryoo
28
46
0
15 Oct 2019
TrajectoryNet: a new spatio-temporal feature learning network for human motion prediction
Xiaoli Liu
Jianqin Yin
Jin Liu
Pengxiang Ding
Jun Liu
Huaping Liu
3DH
27
11
0
15 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar
Deva Ramanan
22
176
0
10 Oct 2019
Previous
1
2
3
...
22
23
24
25
26
Next