Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.09709
Cited By
Self-supervised Co-training for Video Representation Learning
19 October 2020
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-supervised Co-training for Video Representation Learning"
50 / 63 papers shown
Title
Pixel Motion as Universal Representation for Robot Control
Kanchana Ranasinghe
Xiang Li
Cristina Mata
J. Park
Michael S. Ryoo
VGen
29
0
0
12 May 2025
Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos
Jiaheng Zhou
Yanfeng Zhou
Wei Fang
Yuxing Tang
Le Lu
Ge Yang
Mamba
199
0
0
26 Mar 2025
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective
Zeen Song
Jingyao Wang
Jianqi Zhang
Changwen Zheng
Wenwen Qiang
SSL
56
0
0
19 Jul 2024
GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension
Jiafeng Liang
Shixin Jiang
Zekun Wang
Haojie Pan
Zerui Chen
Zheng Chu
Ming Liu
Ruiji Fu
Zhongyuan Wang
Bing Qin
29
2
0
26 Jun 2024
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering
Charig Yang
Weidi Xie
Andrew Zisserman
34
1
0
25 Apr 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu
Jaehong Yoon
Mohit Bansal
77
4
0
08 Feb 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie M. Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
VicTR: Video-conditioned Text Representations for Activity Recognition
Kumara Kahatapitiya
Anurag Arnab
Arsha Nagrani
Michael S. Ryoo
29
19
0
05 Apr 2023
Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition
Qianhui Men
Edmond S. L. Ho
Hubert P. H. Shum
Howard Leung
SSL
18
19
0
03 Apr 2023
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
H. Ryu
Arda Senocak
In So Kweon
Joon Son Chung
VLM
19
8
0
30 Mar 2023
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences
Christopher Lang
Alexander Braun
Lars Schillingmann
Karsten Haug
Abhinav Valada
SSL
17
10
0
17 Feb 2023
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
Anshul B. Shah
Benjamin Lundell
H. Sawhney
Ramalingam Chellappa
SSL
16
8
0
02 Jan 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
14
6
0
21 Dec 2022
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang
Deng Huang
Bin Wen
Jiannan Wu
H. Yao
Yi-Xin Jiang
Xiatian Zhu
Zehuan Yuan
24
19
0
09 Oct 2022
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization
Zdravko Marinov
Alina Roitberg
David Schneider
Rainer Stiefelhagen
22
4
0
19 Aug 2022
HyperNet: Self-Supervised Hyperspectral Spatial-Spectral Feature Understanding Network for Hyperspectral Change Detection
Meiqi Hu
Chen Wu
L. Zhang
SSL
26
57
0
20 Jul 2022
Balanced Contrastive Learning for Long-Tailed Visual Recognition
Jianggang Zhu
Z. Wang
Jingjing Chen
Yi-Ping Phoebe Chen
Yueping Jiang
24
167
0
19 Jul 2022
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
Sumanth Gurram
An Fang
David M. Chan
John F. Canny
VLM
AI4TS
28
1
0
16 Jul 2022
Federated Self-supervised Learning for Video Understanding
Yasar Abbas Ur Rehman
Yan Gao
Jiajun Shen
Pedro Porto Buarque de Gusmão
Nicholas D. Lane
FedML
25
15
0
05 Jul 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Sanat Ramesh
V. Srivastav
Deepak Alapatt
Tong Yu
Aditya Murali
...
Saurav Sharma
A. Fleurentin
Georgios Exarchakis
Alexandros Karargyris
N. Padoy
18
42
0
01 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
23
137
0
15 Jun 2022
Hyperspherical Consistency Regularization
Cheng Tan
Zhangyang Gao
Lirong Wu
Siyuan Li
Stan Z. Li
28
24
0
02 Jun 2022
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition
Haodong Duan
Nanxuan Zhao
Kai-xiang Chen
Dahua Lin
ViT
AI4TS
31
19
0
04 May 2022
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval
Yuying Ge
Yixiao Ge
Xihui Liu
Alex Jinpeng Wang
Jianping Wu
Ying Shan
Xiaohu Qie
Ping Luo
VLM
13
43
0
26 Apr 2022
Probabilistic Representations for Video Contrastive Learning
Jungin Park
Jiyoung Lee
Ig-Jae Kim
K. Sohn
SSL
23
43
0
08 Apr 2022
Frequency Selective Augmentation for Video Representation Learning
Jinhyung Kim
Taeoh Kim
Minho Shim
Dongyoon Han
Dongyoon Wee
Junmo Kim
AI4TS
41
3
0
08 Apr 2022
CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow
Xiuchao Sui
Shaohua Li
Xue Geng
Yan Wu
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
Hongyuan Zhu
ViT
26
95
0
31 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Zhan Tong
Yibing Song
Jue Wang
Limin Wang
ViT
137
1,122
0
23 Mar 2022
Object discovery and representation networks
Olivier J. Hénaff
Skanda Koppula
Evan Shelhamer
Daniel Zoran
Andrew Jaegle
Andrew Zisserman
João Carreira
Relja Arandjelović
44
87
0
16 Mar 2022
Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives
David T. Hoffmann
Nadine Behrmann
Juergen Gall
Thomas Brox
M. Noroozi
27
43
0
27 Jan 2022
Self-supervised Video Representation Learning with Cascade Positive Retrieval
Cheng-En Wu
Farley Lai
Yujie Hu
Asim Kadav
SSL
AI4TS
22
3
0
20 Jan 2022
Bridging Video-text Retrieval with Multiple Choice Questions
Yuying Ge
Yixiao Ge
Xihui Liu
Dian Li
Ying Shan
Xiaohu Qie
Ping Luo
BDL
18
108
0
13 Jan 2022
Motion-Focused Contrastive Learning of Video Representations
Rui Li
Yiheng Zhang
Zhaofan Qiu
Ting Yao
Dong Liu
Tao Mei
SSL
19
34
0
11 Jan 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks
A. Vasudevan
Dengxin Dai
Luc Van Gool
SSL
31
6
0
04 Jan 2022
Max-Margin Contrastive Learning
Anshul B. Shah
S. Sra
Ramalingam Chellappa
A. Cherian
SSL
21
44
0
21 Dec 2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Rui Qian
Yeqing Li
Liangzhe Yuan
Boqing Gong
Ting Liu
Matthew A. Brown
Serge J. Belongie
Ming-Hsuan Yang
Hartwig Adam
Yin Cui
AI4TS
43
6
0
08 Dec 2021
Time-Equivariant Contrastive Video Representation Learning
Simon Jenni
Hailin Jin
SSL
AI4TS
137
60
0
07 Dec 2021
Self-supervised Video Transformer
Kanchana Ranasinghe
Muzammal Naseer
Salman Khan
F. Khan
Michael S. Ryoo
ViT
30
84
0
02 Dec 2021
Towards Tokenized Human Dynamics Representation
Kenneth Li
Xiao Sun
Zhirong Wu
Fangyun Wei
Stephen Lin
11
2
0
22 Nov 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
G. Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
201
102
0
21 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and Challenges
Linus Ericsson
H. Gouk
Chen Change Loy
Timothy M. Hospedales
SSL
OOD
AI4TS
27
270
0
18 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Chongjian Ge
Youwei Liang
Yibing Song
Jianbo Jiao
Jue Wang
Ping Luo
ViT
16
36
0
11 Oct 2021
Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
Shuangrui Ding
Maomao Li
Tianyu Yang
Rui Qian
Haohang Xu
Qingyi Chen
Jue Wang
Hongkai Xiong
SSL
21
49
0
30 Sep 2021
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Mohammadreza Zolfaghari
Yi Zhu
Peter V. Gehler
Thomas Brox
132
127
0
30 Sep 2021
Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene
Huy Q. Vo
Tuong Khanh Long Do
Vinson Pham
Duy V.M. Nguyen
An T. Duong
Quang-Dieu Tran
17
3
0
07 Sep 2021
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
Rui Qian
Yuxi Li
Huabin Liu
John See
Shuangrui Ding
Xian Liu
Dian Li
Weiyao Lin
30
42
0
04 Aug 2021
Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification
Kecheng Zheng
Cuiling Lan
Wenjun Zeng
Jiawei Liu
Zhizheng Zhang
Zhengjun Zha
CVBM
31
63
0
31 Jul 2021
Self-supervised Representation Learning Framework for Remote Physiological Measurement Using Spatiotemporal Augmentation Loss
Hao Wang
E. Ahn
Jinman Kim
24
46
0
16 Jul 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
Martine Toering
Ioannis Gatopoulos
M. Stol
Vincent Tao Hu
SSL
27
11
0
18 Jun 2021
Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition
Ziyuan Huang
Zhiwu Qing
Xiang Wang
Yutong Feng
Shiwei Zhang
Jianwen Jiang
Zhurong Xia
Mingqian Tang
Nong Sang
M. Ang
ViT
19
11
0
09 Jun 2021
1
2
Next