ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.09795
  4. Cited By
Self-Supervised Video Representation Learning with Space-Time Cubic
  Puzzles

Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles

24 November 2018
Dahun Kim
Donghyeon Cho
In So Kweon
    SSL
ArXivPDFHTML

Papers citing "Self-Supervised Video Representation Learning with Space-Time Cubic Puzzles"

50 / 197 papers shown
Title
Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction
Enhancing Self-Supervised Fine-Grained Video Object Tracking with Dynamic Memory Prediction
Zihan Zhou
Changrui Dai
Aibo Song
Xiaolin Fang
VOS
54
0
0
30 Apr 2025
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
262
0
0
08 Apr 2025
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
SEVERE++: Evaluating Benchmark Sensitivity in Generalization of Video Representation Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Piyush Bagad
Hazel Doughty
Bernard Ghanem
Cees G. M. Snoek
ViT
SSL
54
0
0
08 Apr 2025
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker
Letian Jiang
Chen Zhao
Bernard Ghanem
62
0
0
01 Apr 2025
Joint Self-Supervised Video Alignment and Action Segmentation
Joint Self-Supervised Video Alignment and Action Segmentation
Ali Shah Ali
Syed Ahmed Mahmood
Mubin Saeed
Andrey Konin
M. Zia
Quoc-Huy Tran
OT
75
0
0
21 Mar 2025
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning
Zihan Zhoua
Changrui Daia
Aibo Songa
Xiaolin Fang
VOS
69
0
0
15 Mar 2025
Data Collection-free Masked Video Modeling
Data Collection-free Masked Video Modeling
Yuchi Ishikawa
Masayoshi Kondo
Yoshimitsu Aoki
ViT
19
1
0
10 Sep 2024
SIGMA:Sinkhorn-Guided Masked Video Modeling
SIGMA:Sinkhorn-Guided Masked Video Modeling
Mohammadreza Salehi
Michael Dorkenwald
Fida Mohammad Thoker
E. Gavves
Cees G. M. Snoek
Yuki M. Asano
55
3
0
22 Jul 2024
Strategies for Pretraining Neural Operators
Strategies for Pretraining Neural Operators
Anthony Zhou
Cooper Lorsung
AmirPouya Hemmasian
Amir Barati Farimani
AI4CE
43
6
0
12 Jun 2024
From CNNs to Transformers in Multimodal Human Action Recognition: A
  Survey
From CNNs to Transformers in Multimodal Human Action Recognition: A Survey
Muhammad Bilal Shaikh
Syed Mohammed Shamsul Islam
Douglas Chai
Naveed Akhtar
35
9
0
22 May 2024
JOSENet: A Joint Stream Embedding Network for Violence Detection in
  Surveillance Videos
JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos
Pietro Nardelli
Danilo Comminiello
32
0
0
05 May 2024
Solving Masked Jigsaw Puzzles with Diffusion Vision Transformers
Solving Masked Jigsaw Puzzles with Diffusion Vision Transformers
Jinyang Liu
Wondmgezahu Teshome
S. Ghimire
Octavia Camps
Mario Sznaier
DiffM
31
1
0
10 Apr 2024
Patch Spatio-Temporal Relation Prediction for Video Anomaly Detection
Patch Spatio-Temporal Relation Prediction for Video Anomaly Detection
Hao Shen
Lu Shi
Wanru Xu
Yigang Cen
Linna Zhang
Gaoyun An
ViT
33
0
0
28 Mar 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
Bootstrap Masked Visual Modeling via Hard Patches Mining
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
47
5
0
21 Dec 2023
Unearthing Common Inconsistency for Generalisable Deepfake Detection
Unearthing Common Inconsistency for Generalisable Deepfake Detection
Beilin Chu
Xuan Xu
Weike You
Linna Zhou
32
0
0
20 Nov 2023
United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure
  Learning from Videos
United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos
Siddhant Bansal
Chetan Arora
C. V. Jawahar
68
6
0
06 Nov 2023
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video
  Representation Learning
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning
Minghao Zhu
Xiao Lin
Ronghao Dang
Chengju Liu
Qi Chen
VGen
35
8
0
01 Sep 2023
Towards Real-World Visual Tracking with Temporal Contexts
Towards Real-World Visual Tracking with Temporal Contexts
Ziang Cao
Ziyuan Huang
Liang Pan
Shiwei Zhang
Ziwei Liu
Changhong Fu
42
42
0
20 Aug 2023
Temporal DINO: A Self-supervised Video Strategy to Enhance Action
  Prediction
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction
Izzeddin Teeti
Rongali Sai Bhargav
Vivek Singh
Andrew Bradley
Biplab Banerjee
Fabio Cuzzolin
19
1
0
08 Aug 2023
A survey on deep learning in medical image registration: new
  technologies, uncertainty, evaluation metrics, and beyond
A survey on deep learning in medical image registration: new technologies, uncertainty, evaluation metrics, and beyond
Junyu Chen
Yihao Liu
Shuwen Wei
Zhangxing Bian
Shalini Subramanian
A. Carass
Jerry L. Prince
Yong Du
OOD
45
36
0
28 Jul 2023
Query-based Video Summarization with Pseudo Label Supervision
Query-based Video Summarization with Pseudo Label Supervision
Jia-Hong Huang
L. Murn
M. Mrak
M. Worring
36
7
0
04 Jul 2023
A Large-Scale Analysis on Self-Supervised Video Representation Learning
A Large-Scale Analysis on Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Yogesh S Rawat
SSL
28
3
0
09 Jun 2023
HomE: Homography-Equivariant Video Representation Learning
HomE: Homography-Equivariant Video Representation Learning
Anirudh Sriram
Adrien Gaidon
Jiajun Wu
Juan Carlos Niebles
L. Fei-Fei
Ehsan Adeli
SSL
AI4TS
33
2
0
02 Jun 2023
Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion
Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion
Quoc-Huy Tran
Muhammad Ahmed
Murad Popattia
M. Hassan
Ahmed Andrey
Konin M. Zeeshan
AI4TS
34
3
0
31 May 2023
Action Sensitivity Learning for Temporal Action Localization
Action Sensitivity Learning for Temporal Action Localization
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
33
22
0
25 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health
  Management: A Survey and Roadmaps
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
29
46
0
10 May 2023
Self-Supervised Video Representation Learning via Latent Time Navigation
Self-Supervised Video Representation Learning via Latent Time Navigation
Di Yang
Yaohui Wang
Quan Kong
A. Dantcheva
Lorenzo Garattoni
Gianpiero Francesca
F. Brémond
SSL
AI4TS
49
10
0
10 May 2023
Verbs in Action: Improving verb understanding in video-language models
Verbs in Action: Improving verb understanding in video-language models
Liliane Momeni
Mathilde Caron
Arsha Nagrani
Andrew Zisserman
Cordelia Schmid
37
70
0
13 Apr 2023
Self-Supervised Video Similarity Learning
Self-Supervised Video Similarity Learning
Giorgos Kordopatis-Zilos
Giorgos Tolias
Christos Tzelepis
I. Kompatsiaris
Ioannis Patras
Symeon Papadopoulos
SSL
37
8
0
06 Apr 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
36
39
0
31 Mar 2023
Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos
Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos
D. Fan
De-Yun Yang
Xinyu Li
Vimal Bhat
M. Rohith
SSL
27
1
0
13 Mar 2023
TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and
  Clustering
TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering
Wei Lin
Anna Kukleva
Horst Possegger
Hilde Kuehne
Horst Bischof
48
2
0
09 Mar 2023
Video Action Recognition Collaborative Learning with Dynamics via
  PSO-ConvNet Transformer
Video Action Recognition Collaborative Learning with Dynamics via PSO-ConvNet Transformer
N. H. Phong
B. Ribeiro
29
15
0
17 Feb 2023
Audio-Visual Contrastive Learning with Temporal Self-Supervision
Audio-Visual Contrastive Learning with Temporal Self-Supervision
Simon Jenni
Alexander Black
John Collomosse
SSL
31
15
0
15 Feb 2023
Weakly-supervised Representation Learning for Video Alignment and
  Analysis
Weakly-supervised Representation Learning for Video Alignment and Analysis
Guy Bar-Shalom
G. Leifman
Michael Elad
Ehud Rivlin
21
2
0
08 Feb 2023
Test of Time: Instilling Video-Language Models with a Sense of Time
Test of Time: Instilling Video-Language Models with a Sense of Time
Piyush Bagad
Makarand Tapaswi
Cees G. M. Snoek
86
36
0
05 Jan 2023
STEPs: Self-Supervised Key Step Extraction and Localization from
  Unlabeled Procedural Videos
STEPs: Self-Supervised Key Step Extraction and Localization from Unlabeled Procedural Videos
Anshul B. Shah
Benjamin Lundell
H. Sawhney
Ramalingam Chellappa
SSL
21
8
0
02 Jan 2023
Similarity Contrastive Estimation for Image and Video Soft Contrastive
  Self-Supervised Learning
Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning
J. Denize
Jaonary Rabarisoa
Astrid Orcesi
Romain Hérault
SSL
19
6
0
21 Dec 2022
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video
  Representation Learning
XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning
Pritam Sarkar
Ali Etemad
19
20
0
25 Nov 2022
Learning State-Aware Visual Representations from Audible Interactions
Learning State-Aware Visual Representations from Audible Interactions
Himangi Mittal
Pedro Morgado
Unnat Jain
Abhinav Gupta
78
23
0
27 Sep 2022
Self-supervised Learning for Unintentional Action Prediction
Self-supervised Learning for Unintentional Action Prediction
Olga Zatsarynna
Yazan Abu Farha
Juergen Gall
SSL
44
8
0
24 Sep 2022
Leveraging Self-Supervised Training for Unintentional Action Recognition
Leveraging Self-Supervised Training for Unintentional Action Recognition
Enea Duka
Anna Kukleva
Bernt Schiele
38
1
0
23 Sep 2022
Temporal Contrastive Learning with Curriculum
Temporal Contrastive Learning with Curriculum
Shuvendu Roy
Ali Etemad
43
3
0
02 Sep 2022
Learning Primitive-aware Discriminative Representations for Few-shot
  Learning
Learning Primitive-aware Discriminative Representations for Few-shot Learning
Jianpeng Yang
Yuhang Niu
Xuemei Xie
G. Shi
24
1
0
20 Aug 2022
Static and Dynamic Concepts for Self-supervised Video Representation
  Learning
Static and Dynamic Concepts for Self-supervised Video Representation Learning
Rui Qian
Shuangrui Ding
Xian Liu
Dahua Lin
SSL
36
22
0
26 Jul 2022
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw
  Puzzles
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles
Guodong Wang
Yunhong Wang
Jie Qin
Dongming Zhang
Xiuguo Bao
Di Huang
24
78
0
20 Jul 2022
The Anatomy of Video Editing: A Dataset and Benchmark Suite for
  AI-Assisted Video Editing
The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing
Dawit Mureja Argaw
Fabian Caba Heilbron
Joon-Young Lee
Markus Woodson
In So Kweon
VGen
52
22
0
20 Jul 2022
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based
  Action Recognition
Learning from Temporal Spatial Cubism for Cross-Dataset Skeleton-based Action Recognition
Yansong Tang
Xingyu Liu
Xumin Yu
Danyang Zhang
Jiwen Lu
Jie Zhou
27
20
0
17 Jul 2022
Dual Contrastive Learning for Spatio-temporal Representation
Dual Contrastive Learning for Spatio-temporal Representation
Shuangrui Ding
Rui Qian
H. Xiong
AI4TS
SSL
41
21
0
12 Jul 2022
1234
Next