Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.02183
Cited By
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
4 August 2021
Rui Qian
Yuxi Li
Huabin Liu
John See
Shuangrui Ding
Xian Liu
Dian Li
Weiyao Lin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization"
21 / 71 papers shown
Title
Learning deep representations by mutual information estimation and maximization
R. Devon Hjelm
A. Fedorov
Samuel Lavoie-Marchildon
Karan Grewal
Phil Bachman
Adam Trischler
Yoshua Bengio
SSL
DRL
227
2,649
0
20 Aug 2018
Deep Clustering for Unsupervised Learning of Visual Features
Mathilde Caron
Piotr Bojanowski
Armand Joulin
Matthijs Douze
SSL
69
1,878
0
15 Jul 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
210
10,152
0
10 Jul 2018
Tracking Emerges by Colorizing Videos
Carl Vondrick
Abhinav Shrivastava
Alireza Fathi
S. Guadarrama
Kevin Patrick Murphy
69
376
0
25 Jun 2018
Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination
Zhirong Wu
Yuanjun Xiong
Stella X. Yu
Dahua Lin
SSL
136
3,437
0
05 May 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
111
317
0
30 Mar 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
125
1,317
0
13 Dec 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
68
1,035
0
22 Nov 2017
Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
3DPC
56
601
0
25 Aug 2017
Decomposing Motion and Content for Natural Video Sequence Prediction
Ruben Villegas
Jimei Yang
Seunghoon Hong
Xunyu Lin
Honglak Lee
44
595
0
25 Jun 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
80
1,021
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
186
7,961
0
22 May 2017
Unsupervised Learning of Long-Term Motion Dynamics for Videos
Zelun Luo
Boya Peng
De-An Huang
Alexandre Alahi
Li Fei-Fei
SSL
51
193
0
07 Jan 2017
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
42
328
0
23 Nov 2016
YouTube-8M: A Large-Scale Video Classification Benchmark
Sami Abu-El-Haija
Nisarg Kothari
Joonseok Lee
Apostol Natsev
G. Toderici
Balakrishnan Varadarajan
Sudheendra Vijayanarasimhan
VLM
82
1,264
0
27 Sep 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
82
3,814
0
02 Aug 2016
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
123
9,266
0
14 Dec 2015
FitNets: Hints for Thin Deep Nets
Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
C. Gatta
Yoshua Bengio
FedML
214
3,862
0
19 Dec 2014
How transferable are features in deep neural networks?
J. Yosinski
Jeff Clune
Yoshua Bengio
Hod Lipson
OOD
93
8,309
0
06 Nov 2014
Sinkhorn Distances: Lightspeed Computation of Optimal Transportation Distances
Marco Cuturi
OT
107
4,210
0
04 Jun 2013
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
84
6,100
0
03 Dec 2012
Previous
1
2