ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.16772
  4. Cited By
XVO: Generalized Visual Odometry via Cross-Modal Self-Training
v1v2v3 (latest)

XVO: Generalized Visual Odometry via Cross-Modal Self-Training

28 September 2023
Tohida Rehman
Ronit Mandal
Jimuyang Zhang
Debarshi Kumar Sanyal
    SSL
ArXiv (abs)PDFHTML

Papers citing "XVO: Generalized Visual Odometry via Cross-Modal Self-Training"

50 / 54 papers shown
Title
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Flavio Schneider
Ojasv Kamal
Zhijing Jin
Bernhard Schölkopf
MGen
105
84
0
27 Jan 2023
SVFormer: Semi-supervised Video Transformer for Action Recognition
SVFormer: Semi-supervised Video Transformer for Action Recognition
Zhen Xing
Qi Dai
Hang-Rui Hu
Jingjing Chen
Zuxuan Wu
Yu-Gang Jiang
ViT
83
72
0
23 Nov 2022
Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Zhen Xing
Hengduo Li
Zuxuan Wu
Yu-Gang Jiang
3DV
52
18
0
30 Sep 2022
The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction
  by ViTs
The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs
C. Rockwell
Justin Johnson
David Fouhey
ViT
88
43
0
18 Aug 2022
Deep Patch Visual Odometry
Deep Patch Visual Odometry
Zachary Teed
Lahav Lipson
Jia Deng
MDE
89
122
0
08 Aug 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
134
278
0
04 Apr 2022
FisherMatch: Semi-Supervised Rotation Regression via Entropy-based
  Filtering
FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering
Yingda Yin
Yingcheng Cai
He Wang
Baoquan Chen
112
16
0
29 Mar 2022
Video Background Music Generation with Controllable Music Transformer
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di
Jiang
Sihan Liu
Zhaokai Wang
Leyan Zhu
Zexin He
Hongming Liu
Shuicheng Yan
86
93
0
16 Nov 2021
Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised
  Semantic Segmentation
Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation
Lian Xu
Wanli Ouyang
Bennamoun
F. Boussaïd
Ferdous Sohel
Dan Xu
72
130
0
25 Jul 2021
Generalizing to the Open World: Deep Visual Odometry with Online
  Adaptation
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
Shunkai Li
Xin Wu
Yingdian Cao
H. Zha
67
43
0
29 Mar 2021
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object
  Detection
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection
Jihan Yang
Shaoshuai Shi
Zhe Wang
Hongsheng Li
Xiaojuan Qi
3DPC
73
189
0
09 Mar 2021
A Survey on Deep Semi-supervised Learning
A Survey on Deep Semi-supervised Learning
Xiangli Yang
Zixing Song
Irwin King
Zenglin Xu
108
589
0
28 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image
  Segmentation
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViTMedIm
100
3,506
0
08 Feb 2021
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label
  Selection Framework for Semi-Supervised Learning
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning
Mamshad Nayeem Rizve
Kevin Duarte
Yogesh S Rawat
M. Shah
320
521
0
15 Jan 2021
Semantic Audio-Visual Navigation
Semantic Audio-Visual Navigation
Changan Chen
Ziad Al-Halah
Kristen Grauman
96
106
0
21 Dec 2020
TartanVO: A Generalizable Learning-based VO
TartanVO: A Generalizable Learning-based VO
Wenshan Wang
Yaoyu Hu
Sebastian Scherer
59
158
0
31 Oct 2020
Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
Lirui Wang
Yu Xiang
Wei Yang
Arsalan Mousavian
Dieter Fox
3DPC
71
47
0
02 Oct 2020
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial
  and Multi-Map SLAM
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
C. Campos
Richard Elvira
J. Rodríguez
José M.M. Montiel
Juan D. Tardós
103
2,893
0
23 Jul 2020
Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary
  Supervised Deep Adversarial Learning
Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary Supervised Deep Adversarial Learning
Mobarakol Islam
Daniel Anojan Atputharuban
Ravikiran Ramesh
Hongliang Ren
MedIm
148
95
0
22 Jul 2020
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
Zhun Deng
Linjun Zhang
Amirata Ghorbani
James Zou
78
32
0
15 Jun 2020
Robust Learning Through Cross-Task Consistency
Robust Learning Through Cross-Task Consistency
Amir Zamir
Alexander Sax
Teresa Yeo
Oğuzhan Fatih Kar
Nikhil Cheerla
Rohan Suri
Zhangjie Cao
Jitendra Malik
Leonidas Guibas
OOD
60
158
0
07 Jun 2020
Jukebox: A Generative Model for Music
Jukebox: A Generative Model for Music
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
VLM
133
756
0
30 Apr 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
148
1,947
0
13 Apr 2020
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
Wang Zhao
Shaohui Liu
Yezhi Shu
Yong Liu
MDE
88
156
0
03 Apr 2020
MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask
MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask
Shengyu Zhao
Yilun Sheng
Yue Dong
E. Chang
Yan Xu
3DPC
76
213
0
24 Mar 2020
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual
  Odometry
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
Nan Yang
Lukas von Stumberg
Rui Wang
Daniel Cremers
MDE
104
380
0
02 Mar 2020
FixMatch: Simplifying Semi-Supervised Learning with Consistency and
  Confidence
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Kihyuk Sohn
David Berthelot
Chun-Liang Li
Zizhao Zhang
Nicholas Carlini
E. D. Cubuk
Alexey Kurakin
Han Zhang
Colin Raffel
AAML
163
3,578
0
21 Jan 2020
Visual Odometry Revisited: What Should Be Learnt?
Visual Odometry Revisited: What Should Be Learnt?
Huangying Zhan
C. Weerasekera
Jiawang Bian
Ian Reid
MDE
53
159
0
21 Sep 2019
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
Pengxiang Yan
Guanbin Li
Yuan Xie
Zhen Li
Chuan Wang
Tianshui Chen
Liang Lin
59
108
0
12 Aug 2019
Unlabeled Data Improves Adversarial Robustness
Unlabeled Data Improves Adversarial Robustness
Y. Carmon
Aditi Raghunathan
Ludwig Schmidt
Percy Liang
John C. Duchi
130
754
0
31 May 2019
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning
  from Unknown Cameras
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
A. Gordon
Hanhan Li
Rico Jonschkowski
A. Angelova
MDE
75
366
0
10 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
301
5,790
0
26 Mar 2019
Mid-Level Visual Representations Improve Generalization and Sample
  Efficiency for Learning Visuomotor Policies
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies
Alexander Sax
Bradley Emi
Amir Zamir
Leonidas Guibas
Silvio Savarese
Jitendra Malik
SSL
77
16
0
31 Dec 2018
Guided Feature Selection for Deep Visual Odometry
Guided Feature Selection for Deep Visual Odometry
Fei Xue
Qiuyuan Wang
Xin Wang
W. Dong
Junqiu Wang
H. Zha
MDE
63
51
0
25 Nov 2018
CNN-SVO: Improving the Mapping in Semi-Direct Visual Odometry Using
  Single-Image Depth Prediction
CNN-SVO: Improving the Mapping in Semi-Direct Visual Odometry Using Single-Image Depth Prediction
S. Loo
A. Amiri
S. Mashohor
S. Tang
Hong Zhang
58
89
0
01 Oct 2018
Deep Audio-Visual Speech Recognition
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
98
710
0
06 Sep 2018
DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task
  Consistency
DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Yuliang Zou
Zelun Luo
Jia-Bin Huang
MDE
72
477
0
05 Sep 2018
Taskonomy: Disentangling Task Transfer Learning
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
126
1,222
0
23 Apr 2018
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry
  with Deep Feature Reconstruction
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction
Huangying Zhan
Ravi Garg
C. Weerasekera
Kejie Li
Harsh Agarwal
Ian Reid
MDE
53
633
0
11 Mar 2018
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera
  Pose
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
Zhichao Yin
Jianping Shi
MDE
69
1,146
0
06 Mar 2018
DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent
  Convolutional Neural Networks
DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks
Sen Wang
R. Clark
Hongkai Wen
A. Trigoni
73
785
0
25 Sep 2017
UnDeepVO: Monocular Visual Odometry through Unsupervised Deep Learning
UnDeepVO: Monocular Visual Odometry through Unsupervised Deep Learning
Ruihao Li
Sen Wang
Zhiqiang Long
Dongbing Gu
MDE
87
513
0
20 Sep 2017
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
Deqing Sun
Xiaodong Yang
Ming-Yuan Liu
Jan Kautz
3DPC
274
2,450
0
07 Sep 2017
A Survey on Multi-Task Learning
A Survey on Multi-Task Learning
Yu Zhang
Qiang Yang
AIMat
607
2,247
0
25 Jul 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
119
2,945
0
26 May 2017
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry
  and Semantics
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall
Y. Gal
R. Cipolla
3DH
272
3,136
0
19 May 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
381
27,275
0
20 Mar 2017
ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D
  Cameras
ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras
Raul Mur-Artal
Juan D. Tardós
349
5,456
0
20 Oct 2016
Direct Sparse Odometry
Direct Sparse Odometry
Jakob Engel
V. Koltun
Daniel Cremers
99
2,531
0
09 Jul 2016
Fully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOSSSeg
755
37,925
0
20 May 2016
12
Next