Self-supervised Co-training for Video Representation Learning

19 October 2020

Papers citing "Self-supervised Co-training for Video Representation Learning"

50 / 63 papers shown

Title
Pixel Motion as Universal Representation for Robot Control Kanchana Ranasinghe Xiang Li Cristina Mata J. Park Michael S. Ryoo VGen 29 0 0 12 May 2025
Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos Jiaheng Zhou Yanfeng Zhou Wei Fang Yuxing Tang Le Lu Ge Yang Mamba 196 0 0 26 Mar 2025
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective Zeen Song Jingyao Wang Jianqi Zhang Changwen Zheng Wenwen Qiang SSL 56 0 0 19 Jul 2024
GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension Jiafeng Liang Shixin Jiang Zekun Wang Haojie Pan Zerui Chen Zheng Chu Ming Liu Ruiji Fu Zhongyuan Wang Bing Qin 29 2 0 26 Jun 2024
Made to Order: Discovering monotonic temporal changes via self-supervised video ordering Charig Yang Weidi Xie Andrew Zisserman 34 1 0 25 Apr 2024
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion Shoubin Yu Jaehong Yoon Mohit Bansal 77 4 0 08 Feb 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition Jie M. Zhang Zhifan Wan Lanqing Hu Stephen Lin Shuzhe Wu Shiguang Shan TTA 64 1 0 15 Jan 2024
VicTR: Video-conditioned Text Representations for Activity Recognition Kumara Kahatapitiya Anurag Arnab Arsha Nagrani Michael S. Ryoo 29 19 0 05 Apr 2023
Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition Qianhui Men Edmond S. L. Ho Hubert P. H. Shum Howard Leung SSL 18 19 0 03 Apr 2023
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples H. Ryu Arda Senocak In So Kweon Joon Son Chung VLM 19 8 0 30 Mar 2023
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences Christopher Lang Alexander Braun Lars Schillingmann Karsten Haug Abhinav Valada SSL 17 10 0 17 Feb 2023
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos Anshul B. Shah Benjamin Lundell H. Sawhney Ramalingam Chellappa SSL 16 8 0 02 Jan 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning J. Denize Jaonary Rabarisoa Astrid Orcesi Romain Hérault SSL 14 6 0 21 Dec 2022
Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders Haosen Yang Deng Huang Bin Wen Jiannan Wu H. Yao Yi-Xin Jiang Xiatian Zhu Zehuan Yuan 24 19 0 09 Oct 2022
ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization Zdravko Marinov Alina Roitberg David Schneider Rainer Stiefelhagen 22 4 0 19 Aug 2022
HyperNet: Self-Supervised Hyperspectral Spatial-Spectral Feature Understanding Network for Hyperspectral Change Detection Meiqi Hu Chen Wu L. Zhang SSL 26 57 0 20 Jul 2022
Balanced Contrastive Learning for Long-Tailed Visual Recognition Jianggang Zhu Z. Wang Jingjing Chen Yi-Ping Phoebe Chen Yueping Jiang 24 167 0 19 Jul 2022
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training Sumanth Gurram An Fang David M. Chan John F. Canny VLM AI4TS 28 1 0 16 Jul 2022
Federated Self-supervised Learning for Video Understanding Yasar Abbas Ur Rehman Yan Gao Jiajun Shen Pedro Porto Buarque de Gusmão Nicholas D. Lane FedML 22 15 0 05 Jul 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision Sanat Ramesh V. Srivastav Deepak Alapatt Tong Yu Aditya Murali ... Saurav Sharma A. Fleurentin Georgios Exarchakis Alexandros Karargyris N. Padoy 18 42 0 01 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning Benjamin Eysenbach Tianjun Zhang Ruslan Salakhutdinov Sergey Levine SSL OffRL 23 137 0 15 Jun 2022
Hyperspherical Consistency Regularization Cheng Tan Zhangyang Gao Lirong Wu Siyuan Li Stan Z. Li 28 24 0 02 Jun 2022
TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition Haodong Duan Nanxuan Zhao Kai-xiang Chen Dahua Lin ViT AI4TS 31 19 0 04 May 2022
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval Yuying Ge Yixiao Ge Xihui Liu Alex Jinpeng Wang Jianping Wu Ying Shan Xiaohu Qie Ping Luo VLM 13 43 0 26 Apr 2022
Probabilistic Representations for Video Contrastive Learning Jungin Park Jiyoung Lee Ig-Jae Kim K. Sohn SSL 21 43 0 08 Apr 2022
Frequency Selective Augmentation for Video Representation Learning Jinhyung Kim Taeoh Kim Minho Shim Dongyoon Han Dongyoon Wee Junmo Kim AI4TS 41 3 0 08 Apr 2022
CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow Xiuchao Sui Shaohua Li Xue Geng Yan Wu Xinxing Xu Yong Liu Rick Siow Mong Goh Hongyuan Zhu ViT 26 95 0 31 Mar 2022
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training Zhan Tong Yibing Song Jue Wang Limin Wang ViT 125 1,122 0 23 Mar 2022
Object discovery and representation networks Olivier J. Hénaff Skanda Koppula Evan Shelhamer Daniel Zoran Andrew Jaegle Andrew Zisserman João Carreira Relja Arandjelović 38 87 0 16 Mar 2022
Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives David T. Hoffmann Nadine Behrmann Juergen Gall Thomas Brox M. Noroozi 25 43 0 27 Jan 2022
Self-supervised Video Representation Learning with Cascade Positive Retrieval Cheng-En Wu Farley Lai Yujie Hu Asim Kadav SSL AI4TS 20 3 0 20 Jan 2022
Bridging Video-text Retrieval with Multiple Choice Questions Yuying Ge Yixiao Ge Xihui Liu Dian Li Ying Shan Xiaohu Qie Ping Luo BDL 16 108 0 13 Jan 2022
Motion-Focused Contrastive Learning of Video Representations Rui Li Yiheng Zhang Zhaofan Qiu Ting Yao Dong Liu Tao Mei SSL 19 34 0 11 Jan 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks A. Vasudevan Dengxin Dai Luc Van Gool SSL 31 6 0 04 Jan 2022
Max-Margin Contrastive Learning Anshul B. Shah S. Sra Ramalingam Chellappa A. Cherian SSL 18 44 0 21 Dec 2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning Rui Qian Yeqing Li Liangzhe Yuan Boqing Gong Ting Liu Matthew A. Brown Serge J. Belongie Ming-Hsuan Yang Hartwig Adam Yin Cui AI4TS 41 6 0 08 Dec 2021
Time-Equivariant Contrastive Video Representation Learning Simon Jenni Hailin Jin SSL AI4TS 135 60 0 07 Dec 2021
Self-supervised Video Transformer Kanchana Ranasinghe Muzammal Naseer Salman Khan F. Khan Michael S. Ryoo ViT 26 84 0 02 Dec 2021
Towards Tokenized Human Dynamics Representation Kenneth Li Xiao Sun Zhirong Wu Fangyun Wei Stephen Lin 11 2 0 22 Nov 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP Andreas Fürst Elisabeth Rumetshofer Johannes Lehner Viet-Hung Tran Fei Tang ... David P. Kreil Michael K Kopp G. Klambauer Angela Bitto-Nemling Sepp Hochreiter VLM CLIP 199 102 0 21 Oct 2021
Self-Supervised Representation Learning: Introduction, Advances and Challenges Linus Ericsson H. Gouk Chen Change Loy Timothy M. Hospedales SSL OOD AI4TS 27 270 0 18 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning Chongjian Ge Youwei Liang Yibing Song Jianbo Jiao Jue Wang Ping Luo ViT 16 36 0 11 Oct 2021
Motion-aware Contrastive Video Representation Learning via Foreground-background Merging Shuangrui Ding Maomao Li Tianyu Yang Rui Qian Haohang Xu Qingyi Chen Jue Wang Hongkai Xiong SSL 18 49 0 30 Sep 2021
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Mohammadreza Zolfaghari Yi Zhu Peter V. Gehler Thomas Brox 132 127 0 30 Sep 2021
Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene Huy Q. Vo Tuong Khanh Long Do Vinson Pham Duy V.M. Nguyen An T. Duong Quang-Dieu Tran 17 3 0 07 Sep 2021
Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization Rui Qian Yuxi Li Huabin Liu John See Shuangrui Ding Xian Liu Dian Li Weiyao Lin 30 42 0 04 Aug 2021
Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification Kecheng Zheng Cuiling Lan Wenjun Zeng Jiawei Liu Zhizheng Zhang Zhengjun Zha CVBM 31 63 0 31 Jul 2021
Self-supervised Representation Learning Framework for Remote Physiological Measurement Using Spatiotemporal Augmentation Loss Hao Wang E. Ahn Jinman Kim 24 46 0 16 Jul 2021
Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting Martine Toering Ioannis Gatopoulos M. Stol Vincent Tao Hu SSL 25 11 0 18 Jun 2021
Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition Ziyuan Huang Zhiwu Qing Xiang Wang Yutong Feng Shiwei Zhang Jianwen Jiang Zhurong Xia Mingqian Tang Nong Sang M. Ang ViT 19 11 0 09 Jun 2021