ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.07435
  4. Cited By
Forecasting of depth and ego-motion with transformers and
  self-supervision

Forecasting of depth and ego-motion with transformers and self-supervision

15 June 2022
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
    ViTMDE
ArXiv (abs)PDFHTML

Papers citing "Forecasting of depth and ego-motion with transformers and self-supervision"

50 / 53 papers shown
Title
Grounded Language-Image Pre-training
Grounded Language-Image Pre-training
Liunian Harold Li
Pengchuan Zhang
Haotian Zhang
Jianwei Yang
Chunyuan Li
...
Lu Yuan
Lei Zhang
Lei Li
Kai-Wei Chang
Jianfeng Gao
ObjDVLM
126
1,062
0
07 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation
Masked-attention Mask Transformer for Universal Image Segmentation
Bowen Cheng
Ishan Misra
Alex Schwing
Alexander Kirillov
Rohit Girdhar
ISeg
253
2,374
0
02 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and
  Detection
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection
Yanghao Li
Chaoxia Wu
Haoqi Fan
K. Mangalam
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
148
690
0
02 Dec 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP
Ruiyang Liu
Hai-Tao Zheng
Li Tao
Dun Liang
Haitao Zheng
155
99
0
07 Nov 2021
Are conditional GANs explicitly conditional?
Are conditional GANs explicitly conditional?
Houssem-eddine Boulahbal
A. Voicila
Andrew I. Comport
GAN
61
1
0
28 Jun 2021
Scaling Vision Transformers
Scaling Vision Transformers
Xiaohua Zhai
Alexander Kolesnikov
N. Houlsby
Lucas Beyer
ViT
136
1,087
0
08 Jun 2021
The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth
The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth
Jamie Watson
Oisin Mac Aodha
V. Prisacariu
Gabriel J. Brostow
Michael Firman
MDE
74
270
0
29 Apr 2021
Panoptic Segmentation Forecasting
Panoptic Segmentation Forecasting
Colin Graber
Grace Tsai
Michael Firman
Gabriel J. Brostow
Alex Schwing
56
12
0
08 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
453
21,439
0
25 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
98
68
0
02 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
955
29,436
0
26 Feb 2021
The Hardware Lottery
The Hardware Lottery
Sara Hooker
75
212
0
14 Sep 2020
Feature-metric Loss for Self-supervised Learning of Depth and Egomotion
Feature-metric Loss for Self-supervised Learning of Depth and Egomotion
Chang Shu
Kun Yu
Zhixiang Duan
Kuiyuan Yang
SSLMDE
77
234
0
21 Jul 2020
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object
  Problem by Semantic Guidance
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance
Marvin Klingner
Jan-Aike Termöhlen
Jonas Mikolajczyk
Tim Fingscheidt
MDE
119
321
0
14 Jul 2020
Latent Video Transformer
Latent Video Transformer
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
98
120
0
18 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
814
42,055
0
28 May 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT3DVPINN
421
13,048
0
26 May 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
520
42,449
0
03 Dec 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
445
20,181
0
23 Oct 2019
Unsupervised Scale-consistent Depth and Ego-motion Learning from
  Monocular Video
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
Jiawang Bian
Zhichao Li
Naiyan Wang
Huangying Zhan
Chunhua Shen
Ming-Ming Cheng
Ian Reid
MDE
75
511
0
28 Aug 2019
Self-supervised Learning with Geometric Constraints in Monocular Video:
  Connecting Flow, Depth, and Camera
Self-supervised Learning with Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera
Yuhua Chen
Cordelia Schmid
C. Sminchisescu
SSLMDE
67
246
0
12 Jul 2019
3D Packing for Self-Supervised Monocular Depth Estimation
3D Packing for Self-Supervised Monocular Depth Estimation
Vitor Campagnolo Guizilini
Rares Andrei Ambrus
Sudeep Pillai
Allan Raventos
Adrien Gaidon
SSL3DPCMDE
79
648
0
06 May 2019
Segmenting the Future
Segmenting the Future
Hsu-kuang Chiu
Ehsan Adeli
Juan Carlos Niebles
64
45
0
24 Apr 2019
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning
  from Unknown Cameras
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras
A. Gordon
Hanhan Li
Rico Jonschkowski
A. Angelova
MDE
70
365
0
10 Apr 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video
  Generation
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
84
132
0
04 Mar 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
94,891
0
11 Oct 2018
Bayesian Prediction of Future Street Scenes using Synthetic Likelihoods
Bayesian Prediction of Future Street Scenes using Synthetic Likelihoods
Apratim Bhattacharyya
Mario Fritz
Bernt Schiele
UQCV
72
46
0
01 Oct 2018
Recurrent Flow-Guided Semantic Forecasting
Recurrent Flow-Guided Semantic Forecasting
Adam M. Terwilliger
Garrick Brazil
Xiaoming Liu
51
46
0
21 Sep 2018
GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation
  with Generative Adversarial Networks
GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks
Yasin Almalioglu
Muhamad Risqi U. Saputra
Pedro Porto Buarque de Gusmão
Andrew Markham
A. Trigoni
GANMDE
74
146
0
16 Sep 2018
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera
  Motion, Optical Flow and Motion Segmentation
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation
Anurag Ranjan
Varun Jampani
Lukas Balles
Kihwan Kim
Deqing Sun
Jonas Wulff
Michael J. Black
SSL
58
591
0
24 May 2018
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry
  with Deep Feature Reconstruction
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction
Huangying Zhan
Ravi Garg
C. Weerasekera
Kejie Li
Harsh Agarwal
Ian Reid
MDE
53
633
0
11 Mar 2018
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera
  Pose
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
Zhichao Yin
Jianping Shi
MDE
54
1,143
0
06 Mar 2018
Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using
  3D Geometric Constraints
Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints
R. Mahjourian
Martin Wicke
A. Angelova
MDE
99
729
0
15 Feb 2018
Unsupervised Learning of Geometry with Edge-aware Depth-Normal
  Consistency
Unsupervised Learning of Geometry with Edge-aware Depth-Normal Consistency
Zhenheng Yang
Peng Wang
Wenyuan Xu
Liang Zhao
Ram Nevatia
3DVMDE
64
155
0
10 Nov 2017
Stochastic Variational Video Prediction
Stochastic Variational Video Prediction
Mohammad Babaeizadeh
Chelsea Finn
D. Erhan
R. Campbell
Sergey Levine
DRLVGen
75
542
0
30 Oct 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
713
131,652
0
12 Jun 2017
Unsupervised Learning of Depth and Ego-Motion from Video
Unsupervised Learning of Depth and Ego-Motion from Video
Tinghui Zhou
Matthew A. Brown
Noah Snavely
D. Lowe
MDE
132
2,575
0
25 Apr 2017
Predicting Deeper into the Future of Semantic Segmentation
Predicting Deeper into the Future of Semantic Segmentation
Pauline Luc
Natalia Neverova
Camille Couprie
Jakob Verbeek
Yann LeCun
68
242
0
22 Mar 2017
Geometry-Based Next Frame Prediction from Monocular Video
Geometry-Based Next Frame Prediction from Monocular Video
R. Mahjourian
Martin Wicke
A. Angelova
MDE
57
41
0
20 Sep 2016
Unsupervised Monocular Depth Estimation with Left-Right Consistency
Unsupervised Monocular Depth Estimation with Left-Right Consistency
Clément Godard
Oisin Mac Aodha
Gabriel J. Brostow
MDE
145
2,885
0
13 Sep 2016
Deep Predictive Coding Networks for Video Prediction and Unsupervised
  Learning
Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning
William Lotter
Gabriel Kreiman
David D. Cox
SSL
94
935
0
25 May 2016
Unsupervised Learning for Physical Interaction through Video Prediction
Unsupervised Learning for Physical Interaction through Video Prediction
Chelsea Finn
Ian Goodfellow
Sergey Levine
76
1,044
0
23 May 2016
Single-Image Depth Perception in the Wild
Single-Image Depth Perception in the Wild
Weifeng Chen
Z. Fu
Dawei Yang
Jia Deng
MDE
103
520
0
13 Apr 2016
Unsupervised CNN for Single View Depth Estimation: Geometry to the
  Rescue
Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue
Ravi Garg
B. V. Kumar
G. Carneiro
Ian Reid
3DVMDE
121
1,530
0
16 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
Deep multi-scale video prediction beyond mean square error
Deep multi-scale video prediction beyond mean square error
Michaël Mathieu
Camille Couprie
Yann LeCun
GAN
124
1,882
0
17 Nov 2015
Spatial Transformer Networks
Spatial Transformer Networks
Max Jaderberg
Karen Simonyan
Andrew Zisserman
Koray Kavukcuoglu
304
7,387
0
05 Jun 2015
Learning Depth from Single Monocular Images Using Deep Convolutional
  Neural Fields
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields
Fayao Liu
Chunhua Shen
Guosheng Lin
Ian Reid
MDE
166
1,197
0
26 Feb 2015
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
ORB-SLAM: a Versatile and Accurate Monocular SLAM System
Raul Mur-Artal
José M.M. Montiel
Juan D. Tardós
122
6,399
0
03 Feb 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.9K
150,115
0
22 Dec 2014
12
Next