ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.05571
  4. Cited By
Learning Spatio-Temporal Representation with Local and Global Diffusion

Learning Spatio-Temporal Representation with Local and Global Diffusion

13 June 2019
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xinmei Tian
Tao Mei
ArXiv (abs)PDFHTML

Papers citing "Learning Spatio-Temporal Representation with Local and Global Diffusion"

36 / 86 papers shown
Title
VATT: Transformers for Multimodal Self-Supervised Learning from Raw
  Video, Audio and Text
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Huayu Chen
Boqing Gong
ViT
339
594
0
22 Apr 2021
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative
  Memories
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories
Xitong Yang
Haoqi Fan
Lorenzo Torresani
L. Davis
Heng Wang
VLM
84
21
0
02 Apr 2021
Self-supervised Motion Learning from Static Images
Self-supervised Motion Learning from Static Images
Ziyuan Huang
Shiwei Zhang
Jianwen Jiang
Mingqian Tang
Rong Jin
M. Ang
SSL
59
29
0
01 Apr 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
240
2,175
0
29 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
413
2,075
0
09 Feb 2021
Topological Deep Learning
Topological Deep Learning
Ephy R. Love
Benjamin Filippenko
Vasileios Maroulas
Gunnar Carlsson
82
11
0
14 Jan 2021
Refining activation downsampling with SoftPool
Refining activation downsampling with SoftPool
Alexandros Stergiou
R. Poppe
Grigorios Kalliatakis
87
163
0
02 Jan 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
TDN: Temporal Difference Networks for Efficient Action Recognition
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
132
401
0
18 Dec 2020
Recent Progress in Appearance-based Action Recognition
Recent Progress in Appearance-based Action Recognition
J. Humphreys
Zhe Chen
Dacheng Tao
50
0
0
25 Nov 2020
Multi-Temporal Convolutions for Human Action Recognition in Videos
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
69
1
0
08 Nov 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded
  Dialogues
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
117
31
0
20 Oct 2020
Boosting Continuous Sign Language Recognition via Cross Modality
  Augmentation
Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Junfu Pu
Wen-gang Zhou
Hezhen Hu
Houqiang Li
99
114
0
11 Oct 2020
PERF-Net: Pose Empowered RGB-Flow Net
PERF-Net: Pose Empowered RGB-Flow Net
Yinxiao Li
Zhichao Lu
Xuehan Xiong
Jonathan Huang
3DH
81
17
0
28 Sep 2020
Learning to Localize Actions from Moments
Learning to Localize Actions from Moments
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
59
17
0
31 Aug 2020
Global-local Enhancement Network for NMFs-aware Sign Language
  Recognition
Global-local Enhancement Network for NMFs-aware Sign Language Recognition
Hezhen Hu
Wen-gang Zhou
Junfu Pu
Houqiang Li
SLR
81
54
0
24 Aug 2020
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action
  Localization
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization
Yuxi Li
Weiyao Lin
John See
N. Xu
Shugong Xu
Ke Yan
Cong Yang
358
17
0
19 Aug 2020
SeCo: Exploring Sequence Supervision for Unsupervised Representation
  Learning
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
Ting Yao
Yiheng Zhang
Zhaofan Qiu
Yingwei Pan
Tao Mei
DRL
132
110
0
03 Aug 2020
Hierarchical Contrastive Motion Learning for Video Action Recognition
Hierarchical Contrastive Motion Learning for Video Action Recognition
Xitong Yang
Xiaodong Yang
Sifei Liu
Deqing Sun
L. Davis
Jan Kautz
SSL
99
13
0
20 Jul 2020
Region-based Non-local Operation for Video Classification
Region-based Non-local Operation for Video Classification
Guoxi Huang
A. Bors
75
11
0
17 Jul 2020
Temporal Distinct Representation Learning for Action Recognition
Temporal Distinct Representation Learning for Action Recognition
Junwu Weng
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Xudong Jiang
Junsong Yuan
76
26
0
15 Jul 2020
Single Shot Video Object Detector
Single Shot Video Object Detector
Jiajun Deng
Yingwei Pan
Ting Yao
Wen-gang Zhou
Houqiang Li
Tao Mei
ObjD
89
41
0
07 Jul 2020
Learn to cycle: Time-consistent feature discovery for action recognition
Learn to cycle: Time-consistent feature discovery for action recognition
Alexandros Stergiou
R. Poppe
52
23
0
15 Jun 2020
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Yizhou Zhou
Xiaoyan Sun
Chong Luo
Zhengjun Zha
Wenjun Zeng
3DPC
65
20
0
10 Apr 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
102
450
0
03 Apr 2020
Omni-sourced Webly-supervised Learning for Video Recognition
Omni-sourced Webly-supervised Learning for Video Recognition
Haodong Duan
Yue Zhao
Yuanjun Xiong
Wentao Liu
Dahua Lin
VLM
99
88
0
29 Mar 2020
Weakly-Supervised Action Localization by Generative Attention Modeling
Weakly-Supervised Action Localization by Generative Attention Modeling
Baifeng Shi
Qi Dai
Yadong Mu
Jingdong Wang
WSOL
86
149
0
27 Mar 2020
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
STH: Spatio-Temporal Hybrid Convolution for Efficient Action Recognition
Xu Li
Jingwen Wang
Lin Ma
Kaihao Zhang
Fengzong Lian
Zhanhui Kang
Jinjun Wang
34
5
0
18 Mar 2020
CTM: Collaborative Temporal Modeling for Action Recognition
CTM: Collaborative Temporal Modeling for Action Recognition
Li-Yu Daisy Liu
Tao Wang
Jie Liu
Yang Guan
Qi Bu
Longfei Yang
TTA
19
0
0
08 Feb 2020
TEINet: Towards an Efficient Architecture for Video Recognition
TEINet: Towards an Efficient Architecture for Video Recognition
Zhaoyang Liu
Donghao Luo
Yabiao Wang
Limin Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Tong Lu
ViT
96
243
0
21 Nov 2019
Class Feature Pyramids for Video Explanation
Class Feature Pyramids for Video Explanation
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Poppe
R. Veltkamp
FAtt
54
19
0
18 Sep 2019
vireoJD-MM at Activity Detection in Extended Videos
vireoJD-MM at Activity Detection in Extended Videos
Fuchen Long
Qi Cai
Zhaofan Qiu
Zhijian Hou
Yingwei Pan
Ting Yao
Chong-Wah Ngo
25
4
0
20 Jun 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and
  Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 2019
Zhaofan Qiu
Dong Li
Yehao Li
Qi Cai
Yingwei Pan
Ting Yao
43
8
0
14 Jun 2019
Video Modeling with Correlation Networks
Video Modeling with Correlation Networks
Heng Wang
Du Tran
Lorenzo Torresani
Matt Feiszli
101
129
0
07 Jun 2019
Decoupling Localization and Classification in Single Shot Temporal
  Action Detection
Decoupling Localization and Classification in Single Shot Temporal Action Detection
Yupan Huang
Qi Dai
Yutong Lu
59
46
0
16 Apr 2019
Higher-order Network for Action Recognition
Higher-order Network for Action Recognition
Jie Shao
Xiangyang Xue
34
0
0
19 Nov 2018
Human Action Recognition and Prediction: A Survey
Human Action Recognition and Prediction: A Survey
Yu Kong
Y. Fu
95
632
0
28 Jun 2018
Previous
12