Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05571
Cited By
Learning Spatio-Temporal Representation with Local and Global Diffusion
13 June 2019
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Spatio-Temporal Representation with Local and Global Diffusion"
33 / 33 papers shown
Title
Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
Xuzheng Yu
Chen Jiang
Wei Zhang
Tian Gan
Linlin Chao
Jianan Zhao
Yuan Cheng
Qingpei Guo
Wei Chu
86
0
0
09 Jan 2024
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
75
65
0
11 Aug 2018
YH Technologies at ActivityNet Challenge 2018
Ting Yao
Xue Li
46
11
0
29 Jun 2018
Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition
Dongliang He
Fu Li
Qijie Zhao
Xiang Long
Yi Fu
Shilei Wen
55
18
0
27 Jun 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
149
1,333
0
13 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
228
3,033
0
30 Nov 2017
Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks
Zhaofan Qiu
Ting Yao
Tao Mei
96
1,663
0
28 Nov 2017
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
126
1,936
0
27 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
289
8,916
0
21 Nov 2017
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
427
26,539
0
05 Sep 2017
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification
Yunlong Bian
Chuang Gan
Xiao-Chang Liu
Fu Li
Xiang Long
Yandong Li
Heng Qi
Jie Zhou
Shilei Wen
Yuanqing Lin
55
48
0
12 Aug 2017
Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation
Zhenheng Yang
J. Gao
Ram Nevatia
48
57
0
31 Jul 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
235
8,037
0
22 May 2017
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
114
812
0
08 May 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
66
325
0
04 May 2017
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
Rui Hou
Chong Chen
M. Shah
MedIm
74
334
0
30 Mar 2017
Deep Quantization: Encoding Convolutional Activations with Deep Generative Model
Zhaofan Qiu
Ting Yao
Tao Mei
DRL
MQ
72
60
0
29 Nov 2016
Online Real-time Multiple Spatiotemporal Action Localisation and Prediction
Gurkirt Singh
Suman Saha
Michael Sapienza
Philip Torr
Fabio Cuzzolin
73
288
0
25 Nov 2016
Deep Temporal Linear Encoding Networks
Ali Diba
Vivek Sharma
Luc Van Gool
60
228
0
21 Nov 2016
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos
Suman Saha
Gurkirt Singh
Michael Sapienza
Philip Torr
Fabio Cuzzolin
ViT
80
209
0
04 Aug 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
108
3,838
0
02 Aug 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
166
2,612
0
22 Apr 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,322
0
10 Dec 2015
Compact Bilinear Pooling
Yang Gao
Oscar Beijbom
Ning Zhang
Trevor Darrell
83
791
0
19 Nov 2015
Learning to track for spatio-temporal action localization
Philippe Weinzaepfel
Zaïd Harchaoui
Cordelia Schmid
107
339
0
05 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
525
62,360
0
04 Jun 2015
Bilinear CNNs for Fine-grained Visual Recognition
Tsung-Yu Lin
Aruni RoyChowdhury
Subhransu Maji
133
1,875
0
29 Apr 2015
Beyond Short Snippets: Deep Networks for Video Classification
Joe Yue-Hei Ng
Matthew J. Hausknecht
Sudheendra Vijayanarasimhan
Oriol Vinyals
R. Monga
G. Toderici
145
2,338
0
31 Mar 2015
Unsupervised Learning of Video Representations using LSTMs
Nitish Srivastava
Elman Mansimov
Ruslan Salakhutdinov
SSL
140
2,593
0
16 Feb 2015
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
1.7K
39,590
0
01 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLM
BDL
3DV
280
14,713
0
20 Jun 2014
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
250
7,541
0
09 Jun 2014
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
160
6,164
0
03 Dec 2012
1