Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.05571
Cited By
Learning Spatio-Temporal Representation with Local and Global Diffusion
13 June 2019
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Spatio-Temporal Representation with Local and Global Diffusion"
36 / 86 papers shown
Title
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Huayu Chen
Boqing Gong
ViT
339
594
0
22 Apr 2021
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang
Haoqi Fan
Lorenzo Torresani
L. Davis
Heng Wang
VLM
84
21
0
02 Apr 2021
Self-supervised Motion Learning from Static Images
Ziyuan Huang
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Rong Jin
M. Ang
SSL
59
29
0
01 Apr 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
240
2,175
0
29 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
413
2,075
0
09 Feb 2021
Topological Deep Learning
Ephy R. Love
Benjamin Filippenko
Vasileios Maroulas
Gunnar Carlsson
82
11
0
14 Jan 2021
Refining activation downsampling with SoftPool
Alexandros Stergiou
R. Poppe
Grigorios Kalliatakis
87
163
0
02 Jan 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
132
401
0
18 Dec 2020
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
50
0
0
25 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
69
1
0
08 Nov 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
117
31
0
20 Oct 2020
Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Junfu Pu
Wen-gang Zhou
Hezhen Hu
Houqiang Li
99
114
0
11 Oct 2020
PERF-Net: Pose Empowered RGB-Flow Net
Yinxiao Li
Zhichao Lu
Xuehan Xiong
Jonathan Huang
3DH
81
17
0
28 Sep 2020
Learning to Localize Actions from Moments
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
59
17
0
31 Aug 2020
Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Hezhen Hu
Wen-gang Zhou
Junfu Pu
Houqiang Li
SLR
81
54
0
24 Aug 2020
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
Yuxi Li
Weiyao Lin
John See
N. Xu
Shugong Xu
Ke Yan
Cong Yang
358
17
0
19 Aug 2020
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
Ting Yao
Yiheng Zhang
Zhaofan Qiu
Yingwei Pan
Tao Mei
DRL
132
110
0
03 Aug 2020
Hierarchical Contrastive Motion Learning for Video Action Recognition
Xitong Yang
Xiaodong Yang
Sifei Liu
Deqing Sun
L. Davis
Jan Kautz
SSL
99
13
0
20 Jul 2020
Region-based Non-local Operation for Video Classification
Guoxi Huang
A. Bors
75
11
0
17 Jul 2020
Temporal Distinct Representation Learning for Action Recognition
Junwu Weng
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Xudong Jiang
Junsong Yuan
76
26
0
15 Jul 2020
Single Shot Video Object Detector
Jiajun Deng
Yingwei Pan
Ting Yao
Wen-gang Zhou
Houqiang Li
Tao Mei
ObjD
89
41
0
07 Jul 2020
Learn to cycle: Time-consistent feature discovery for action recognition
Alexandros Stergiou
R. Poppe
52
23
0
15 Jun 2020
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Yizhou Zhou
Xiaoyan Sun
Chong Luo
Zhengjun Zha
Wenjun Zeng
3DPC
65
20
0
10 Apr 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
102
450
0
03 Apr 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
99
88
0
29 Mar 2020
Weakly-Supervised Action Localization by Generative Attention Modeling
Baifeng Shi
Qi Dai
Yadong Mu
Jingdong Wang
WSOL
86
149
0
27 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
34
5
0
18 Mar 2020
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
19
0
0
08 Feb 2020
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
96
243
0
21 Nov 2019
Class Feature Pyramids for Video Explanation
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Poppe
R. Veltkamp
FAtt
54
19
0
18 Sep 2019
vireoJD-MM at Activity Detection in Extended Videos
Fuchen Long
Qi Cai
Zhaofan Qiu
Zhijian Hou
Yingwei Pan
Ting Yao
Chong-Wah Ngo
25
4
0
20 Jun 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Zhaofan Qiu
Dong Li
Yehao Li
Qi Cai
Yingwei Pan
Ting Yao
43
8
0
14 Jun 2019
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
101
129
0
07 Jun 2019
Decoupling Localization and Classification in Single Shot Temporal Action Detection
Yupan Huang
Qi Dai
Yutong Lu
59
46
0
16 Apr 2019
Higher-order Network for Action Recognition
Jie Shao
Xiangyang Xue
34
0
0
19 Nov 2018
Human Action Recognition and Prediction: A Survey
Yu Kong
Y. Fu
95
632
0
28 Jun 2018
Previous
1
2