Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.04023
Cited By
Boosting Video Representation Learning with Multi-Faceted Integration
11 January 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Xiaoping Zhang
Dong Wu
Tao Mei
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Boosting Video Representation Learning with Multi-Faceted Integration"
13 / 13 papers shown
Title
Self-supervised Temporal Discriminative Learning for Video Representation Learning
Jinpeng Wang
Yiqi Lin
A. J. Ma
Pong C. Yuen
TTA
34
11
0
05 Aug 2020
SeCo: Exploring Sequence Supervision for Unsupervised Representation Learning
Ting Yao
Yiheng Zhang
Zhaofan Qiu
Yingwei Pan
Tao Mei
DRL
80
110
0
03 Aug 2020
Object Relational Graph with Teacher-Recommended Learning for Video Captioning
Ziqi Zhang
Yaya Shi
Chunfen Yuan
Bing Li
Peijin Wang
Weiming Hu
Zhengjun Zha
VLM
67
271
0
26 Feb 2020
More Is Less: Learning Efficient Video Representations by Big-Little Network and Depthwise Temporal Aggregation
Quanfu Fan
Chun-Fu Chen
Hilde Kuehne
Marco Pistoia
David D. Cox
50
126
0
02 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
113
12,007
0
13 Nov 2019
Gaussian Temporal Awareness Networks for Action Localization
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
179
320
0
09 Sep 2019
STM: SpatioTemporal and Motion Encoding for Action Recognition
Boyuan Jiang
Mengmeng Wang
Weihao Gan
Wei Wu
Junjie Yan
64
381
0
07 Aug 2019
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
133
1,317
0
13 Dec 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
202
7,961
0
22 May 2017
Temporal Segment Networks for Action Recognition in Videos
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
83
807
0
08 May 2017
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
127
2,606
0
22 Apr 2016
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLM
BDL
3DV
198
14,703
0
20 Jun 2014
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
225
7,518
0
09 Jun 2014
1