Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.16772
Cited By
v1
v2
v3 (latest)
XVO: Generalized Visual Odometry via Cross-Modal Self-Training
28 September 2023
Tohida Rehman
Ronit Mandal
Jimuyang Zhang
Debarshi Kumar Sanyal
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XVO: Generalized Visual Odometry via Cross-Modal Self-Training"
50 / 54 papers shown
Title
Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion
Flavio Schneider
Ojasv Kamal
Zhijing Jin
Bernhard Schölkopf
MGen
105
84
0
27 Jan 2023
SVFormer: Semi-supervised Video Transformer for Action Recognition
Zhen Xing
Qi Dai
Hang-Rui Hu
Jingjing Chen
Zuxuan Wu
Yu-Gang Jiang
ViT
83
72
0
23 Nov 2022
Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors
Zhen Xing
Hengduo Li
Zuxuan Wu
Yu-Gang Jiang
3DV
52
18
0
30 Sep 2022
The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs
C. Rockwell
Justin Johnson
David Fouhey
ViT
88
43
0
18 Aug 2022
Deep Patch Visual Odometry
Zachary Teed
Lahav Lipson
Jia Deng
MDE
89
122
0
08 Aug 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
134
278
0
04 Apr 2022
FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering
Yingda Yin
Yingcheng Cai
He Wang
Baoquan Chen
112
16
0
29 Mar 2022
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di
Jiang
Sihan Liu
Zhaokai Wang
Leyan Zhu
Zexin He
Hongming Liu
Shuicheng Yan
86
93
0
16 Nov 2021
Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation
Lian Xu
Wanli Ouyang
Bennamoun
F. Boussaïd
Ferdous Sohel
Dan Xu
72
130
0
25 Jul 2021
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
Shunkai Li
Xin Wu
Yingdian Cao
H. Zha
67
43
0
29 Mar 2021
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection
Jihan Yang
Shaoshuai Shi
Zhe Wang
Hongsheng Li
Xiaojuan Qi
3DPC
73
189
0
09 Mar 2021
A Survey on Deep Semi-supervised Learning
Xiangli Yang
Zixing Song
Irwin King
Zenglin Xu
108
589
0
28 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViT
MedIm
100
3,506
0
08 Feb 2021
In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning
Mamshad Nayeem Rizve
Kevin Duarte
Yogesh S Rawat
M. Shah
320
521
0
15 Jan 2021
Semantic Audio-Visual Navigation
Changan Chen
Ziad Al-Halah
Kristen Grauman
96
106
0
21 Dec 2020
TartanVO: A Generalizable Learning-based VO
Wenshan Wang
Yaoyu Hu
Sebastian Scherer
59
158
0
31 Oct 2020
Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
Lirui Wang
Yu Xiang
Wei Yang
Arsalan Mousavian
Dieter Fox
3DPC
71
47
0
02 Oct 2020
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
C. Campos
Richard Elvira
J. Rodríguez
José M.M. Montiel
Juan D. Tardós
103
2,893
0
23 Jul 2020
Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary Supervised Deep Adversarial Learning
Mobarakol Islam
Daniel Anojan Atputharuban
Ravikiran Ramesh
Hongliang Ren
MedIm
148
95
0
22 Jul 2020
Improving Adversarial Robustness via Unlabeled Out-of-Domain Data
Zhun Deng
Linjun Zhang
Amirata Ghorbani
James Zou
78
32
0
15 Jun 2020
Robust Learning Through Cross-Task Consistency
Amir Zamir
Alexander Sax
Teresa Yeo
Oğuzhan Fatih Kar
Nikhil Cheerla
Rohan Suri
Zhangjie Cao
Jitendra Malik
Leonidas Guibas
OOD
60
158
0
07 Jun 2020
Jukebox: A Generative Model for Music
Prafulla Dhariwal
Heewoo Jun
Christine Payne
Jong Wook Kim
Alec Radford
Ilya Sutskever
VLM
133
756
0
30 Apr 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
148
1,947
0
13 Apr 2020
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
Wang Zhao
Shaohui Liu
Yezhi Shu
Yong Liu
MDE
88
156
0
03 Apr 2020
MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask
Shengyu Zhao
Yilun Sheng
Yue Dong
E. Chang
Yan Xu
3DPC
76
213
0
24 Mar 2020
D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry
Nan Yang
Lukas von Stumberg
Rui Wang
Daniel Cremers
MDE
104
380
0
02 Mar 2020
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Kihyuk Sohn
David Berthelot
Chun-Liang Li
Zizhao Zhang
Nicholas Carlini
E. D. Cubuk
Alexey Kurakin
Han Zhang
Colin Raffel
AAML
163
3,578
0
21 Jan 2020
Visual Odometry Revisited: What Should Be Learnt?
Huangying Zhan
C. Weerasekera
Jiawang Bian
Ian Reid
MDE
53
159
0
21 Sep 2019
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
Pengxiang Yan
Guanbin Li
Yuan Xie
Zhen Li
Chuan Wang
Tianshui Chen
Liang Lin
59
108
0
12 Aug 2019
Unlabeled Data Improves Adversarial Robustness
Y. Carmon
Aditi Raghunathan
Ludwig Schmidt
Percy Liang
John C. Duchi
130
754
0
31 May 2019
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
A. Gordon
Hanhan Li
Rico Jonschkowski
A. Angelova
MDE
75
366
0
10 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
301
5,790
0
26 Mar 2019
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies
Alexander Sax
Bradley Emi
Amir Zamir
Leonidas Guibas
Silvio Savarese
Jitendra Malik
SSL
77
16
0
31 Dec 2018
Guided Feature Selection for Deep Visual Odometry
Fei Xue
Qiuyuan Wang
Xin Wang
W. Dong
Junqiu Wang
H. Zha
MDE
63
51
0
25 Nov 2018
CNN-SVO: Improving the Mapping in Semi-Direct Visual Odometry Using Single-Image Depth Prediction
S. Loo
A. Amiri
S. Mashohor
S. Tang
Hong Zhang
58
89
0
01 Oct 2018
Deep Audio-Visual Speech Recognition
Triantafyllos Afouras
Joon Son Chung
A. Senior
Oriol Vinyals
Andrew Zisserman
98
710
0
06 Sep 2018
DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency
Yuliang Zou
Zelun Luo
Jia-Bin Huang
MDE
72
477
0
05 Sep 2018
Taskonomy: Disentangling Task Transfer Learning
Amir Zamir
Alexander Sax
Bokui (William) Shen
Leonidas Guibas
Jitendra Malik
Silvio Savarese
126
1,222
0
23 Apr 2018
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction
Huangying Zhan
Ravi Garg
C. Weerasekera
Kejie Li
Harsh Agarwal
Ian Reid
MDE
53
633
0
11 Mar 2018
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
Zhichao Yin
Jianping Shi
MDE
69
1,146
0
06 Mar 2018
DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks
Sen Wang
R. Clark
Hongkai Wen
A. Trigoni
73
785
0
25 Sep 2017
UnDeepVO: Monocular Visual Odometry through Unsupervised Deep Learning
Ruihao Li
Sen Wang
Zhiqiang Long
Dongbing Gu
MDE
87
513
0
20 Sep 2017
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
Deqing Sun
Xiaodong Yang
Ming-Yuan Liu
Jan Kautz
3DPC
274
2,450
0
07 Sep 2017
A Survey on Multi-Task Learning
Yu Zhang
Qiang Yang
AIMat
607
2,247
0
25 Jul 2017
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
119
2,945
0
26 May 2017
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
Alex Kendall
Y. Gal
R. Cipolla
3DH
272
3,136
0
19 May 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
381
27,275
0
20 Mar 2017
ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras
Raul Mur-Artal
Juan D. Tardós
349
5,456
0
20 Oct 2016
Direct Sparse Odometry
Jakob Engel
V. Koltun
Daniel Cremers
99
2,531
0
09 Jul 2016
Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer
Jonathan Long
Trevor Darrell
VOS
SSeg
755
37,925
0
20 May 2016
1
2
Next