Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.10716
Cited By
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion
19 October 2022
Philippe Weinzaepfel
Vincent Leroy
Thomas Lucas
Romain Brégier
Yohann Cabon
Vaibhav Arora
L. Antsfeld
Boris Chidlovskii
G. Csurka
Jérôme Revaud
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion"
50 / 55 papers shown
Title
When Dance Video Archives Challenge Computer Vision
P. Colantoni
Rafique Ahmed
Prashant Ghimire
Damien Muselet
A. Trémeau
3DH
28
0
0
12 May 2025
RayZer: A Self-supervised Large View Synthesis Model
Hanwen Jiang
Hao Tan
Peng Wang
Haian Jin
Yue Zhao
...
Kai Zhang
Fujun Luan
Kalyan Sunkavalli
Qixing Huang
Georgios Pavlakos
62
0
0
01 May 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&Ro
LRM
31
0
0
22 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
45
0
0
10 Apr 2025
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon
Heeseong Shin
Eunbeen Hong
Hyunwook Choi
Hansang Cho
Daun Jeong
Seungryong Kim
26
0
0
07 Apr 2025
Dexterous Manipulation through Imitation Learning: A Survey
Shan An
Ziyu Meng
Chao Tang
Y. Zhou
Tengyu Liu
...
Yao Mu
Ran Song
Wei Zhang
Zeng-Guang Hou
H. Zhang
51
0
0
04 Apr 2025
Speedy MASt3R
Jingxing Li
Yongjae Lee
Abhay Kumar Yadav
Cheng-Fang Peng
Rama Chellappa
Deliang Fan
3DGS
61
0
0
13 Mar 2025
Alligat0R: Pre-Training Through Co-Visibility Segmentation for Relative Camera Pose Regression
Thibaut Loiseau
Guillaume Bourmaud
Vincent Lepetit
64
0
0
10 Mar 2025
MUSt3R: Multi-view Network for Stereo 3D Reconstruction
Yohann Cabon
Lucas Stoffl
L. Antsfeld
G. Csurka
Boris Chidlovskii
Jérôme Revaud
Vincent Leroy
3DGS
3DV
53
2
0
03 Mar 2025
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
114
4
0
11 Feb 2025
MATCHA:Towards Matching Anything
Fei Xue
Sven Elflein
Laura Leal-Taixe
Qunjie Zhou
49
0
0
28 Jan 2025
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Jianing Yang
Alexander Sax
Kevin J Liang
Mikael Henaff
Hao Tang
Ang Cao
J. Chai
Franziska Meier
Matt Feiszli
3DGS
73
16
0
23 Jan 2025
MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
Hanwen Jiang
Zexiang Xu
Desai Xie
Z. Chen
Haian Jin
...
Xin Sun
Jiuxiang Gu
Qixing Huang
Georgios Pavlakos
Hao Tan
151
1
0
18 Dec 2024
Efficient Object-centric Representation Learning with Pre-trained Geometric Prior
Phúc H. Lê Khắc
Graham Healy
A. Smeaton
OCL
79
0
0
16 Dec 2024
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
Yue Chen
Xingyu Chen
Anpei Chen
Gerard Pons-Moll
Yuliang Xiu
3DGS
86
3
0
12 Dec 2024
Cross-View Completion Models are Zero-shot Correspondence Estimators
Honggyu An
J. Kim
Seonghoon Park
Jaewoo Jung
Jisang Han
Sunghwan Hong
Seungryong Kim
3DV
80
3
0
12 Dec 2024
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
Gyeongjin Kang
Jisang Yoo
Jihyeon Park
Seungtae Nam
Hyeonsoo Im
Sangheon Shin
Sangpil Kim
Eunbyung Park
3DGS
153
3
0
26 Nov 2024
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
Xuweiyi Chen
Markus Marks
Zezhou Cheng
81
0
0
25 Nov 2024
Generating 3D-Consistent Videos from Unposed Internet Photos
Gene Chou
Kai Zhang
Sai Bi
Hao Tan
Zexiang Xu
Fujun Luan
Bharath Hariharan
Noah Snavely
3DGS
VGen
81
3
0
20 Nov 2024
Extreme Rotation Estimation in the Wild
Hana Bezalel
Dotan Ankri
Ruojin Cai
Hadar Averbuch-Elor
36
2
0
11 Nov 2024
3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction
Jongmin Lee
Minsu Cho
44
1
0
01 Nov 2024
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
Zhiyuan Min
Yawei Luo
Jianwen Sun
Yi Yang
3DGS
41
0
0
30 Oct 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
Zhiwen Fan
Jian Zhang
Wenyan Cong
Peihao Wang
Renjie Li
...
Z. Wang
Danfei Xu
B. Ivanovic
Marco Pavone
Yue Wang
3DV
41
11
0
24 Oct 2024
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
Stephen Hausler
Peyman Moghadam
SSL
ViT
29
2
0
09 Oct 2024
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion
Junyi Zhang
Charles Herrmann
Junhwa Hur
Varun Jampani
Trevor Darrell
Forrester Cole
Deqing Sun
Ming Yang
VGen
83
70
0
04 Oct 2024
Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks
Sierra Bonilla
Chiara Di Vece
Rema Daher
Xinwei Ju
Danail Stoyanov
Francisco Vasconcelos
Sophia Bano
3DV
34
1
0
29 Aug 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
Alex N. Wang
Christopher Hoang
Yuwen Xiong
Yann LeCun
Mengye Ren
73
0
0
20 Aug 2024
Improving Neural Surface Reconstruction with Feature Priors from Multi-View Image
Xinlin Ren
Chenjie Cao
Yanwei Fu
Xiangyang Xue
33
2
0
04 Aug 2024
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue
Anurag Das
Francis Engelmann
Siyu Tang
J. E. Lenssen
46
24
0
29 Jul 2024
Self-supervised Pretraining and Finetuning for Monocular Depth and Visual Odometry
Boris Chidlovskii
L. Antsfeld
MDE
ViT
29
1
0
16 Jun 2024
Neural Isometries: Taming Transformations for Equivariant ML
Thomas W. Mitchel
Michael Taylor
Vincent Sitzmann
28
0
0
29 May 2024
SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow
Yihan Wang
Lahav Lipson
Jia Deng
34
37
0
23 May 2024
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
Jian Liu
Wei Sun
Hui Yang
Zhiwen Zeng
Chongpei Liu
Jin Zheng
Xingyu Liu
Hossein Rahmani
N. Sebe
Ajmal Saeed Mian
41
15
0
13 May 2024
Playing to Vision Foundation Model's Strengths in Stereo Matching
Chuangwei Liu
Qijun Chen
Rui Fan
35
12
0
09 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
42
12
0
01 Apr 2024
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Jiasen Lu
Christopher Clark
Sangho Lee
Zichen Zhang
Savya Khosla
Ryan Marten
Derek Hoiem
Aniruddha Kembhavi
VLM
MLLM
34
144
0
28 Dec 2023
DUSt3R: Geometric 3D Vision Made Easy
Shuzhe Wang
Vincent Leroy
Yohann Cabon
Boris Chidlovskii
Jérôme Revaud
3DGS
31
321
0
21 Dec 2023
Low-shot Object Learning with Mutual Exclusivity Bias
Anh Thai
Ahmad Humayun
Stefan Stojanov
Zixuan Huang
Bikram Boote
James M. Rehg
32
2
0
06 Dec 2023
Learning from One Continuous Video Stream
João Carreira
Michael King
Viorica Patraucean
Dilara Gokay
Catalin Ionescu
...
Joseph Heyward
Carl Doersch
Y. Aytar
Dima Damen
Andrew Zisserman
CLL
21
4
0
01 Dec 2023
MFOS: Model-Free & One-Shot Object Pose Estimation
Jongmin Lee
Yohann Cabon
Romain Brégier
Sungjoo Yoo
Jérôme Revaud
ViT
26
6
0
03 Oct 2023
Win-Win: Training High-Resolution Vision Transformers from Two Windows
Vincent Leroy
Jérôme Revaud
Thomas Lucas
Philippe Weinzaepfel
ViT
34
2
0
01 Oct 2023
End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon
G. Bono
L. Antsfeld
Boris Chidlovskii
Zhi Zheng
Christian Wolf
3DV
26
9
0
28 Sep 2023
M
3
^{3}
3
3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding
Muhammad Abdullah Jamal
Omid Mohareri
3DPC
16
1
0
26 Sep 2023
SACReg: Scene-Agnostic Coordinate Regression for Visual Localization
Jérôme Revaud
Yohann Cabon
Romain Brégier
Jongmin Lee
Philippe Weinzaepfel
24
10
0
21 Jul 2023
MIMIC: Masked Image Modeling with Image Correspondences
Kalyani Marathe
Mahtab Bigverdi
Nishat Khan
Tuhin Kundu
Patrick Howe
Sharan Ranjit S
Anand Bhattad
Aniruddha Kembhavi
Linda G. Shapiro
Ranjay Krishna
27
0
0
27 Jun 2023
Audiovisual Masked Autoencoders
Mariana-Iuliana Georgescu
Eduardo Fonseca
Radu Tudor Ionescu
Mario Lucic
Cordelia Schmid
Anurag Arnab
SSL
32
43
0
09 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
16
10
0
05 Dec 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
20
80
0
18 Nov 2022
Weak Augmentation Guided Relational Self-Supervised Learning
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Changshui Zhang
Xiaogang Wang
Chang Xu
32
4
0
16 Mar 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
1
2
Next