Forecasting of depth and ego-motion with transformers and self-supervision

15 June 2022

Houssem-eddine Boulahbal

Papers citing "Forecasting of depth and ego-motion with transformers and self-supervision"

50 / 53 papers shown

Title
Grounded Language-Image Pre-training Liunian Harold Li Pengchuan Zhang Haotian Zhang Jianwei Yang Chunyuan Li ... Lu Yuan Lei Zhang Lei Li Kai-Wei Chang Jianfeng Gao ObjD VLM 126 1,062 0 07 Dec 2021
Masked-attention Mask Transformer for Universal Image Segmentation Bowen Cheng Ishan Misra Alex Schwing Alexander Kirillov Rohit Girdhar ISeg 253 2,374 0 02 Dec 2021
MViTv2: Improved Multiscale Vision Transformers for Classification and Detection Yanghao Li Chaoxia Wu Haoqi Fan K. Mangalam Bo Xiong Jitendra Malik Christoph Feichtenhofer ViT 148 690 0 02 Dec 2021
Are we ready for a new paradigm shift? A Survey on Visual Deep MLP Ruiyang Liu Hai-Tao Zheng Li Tao Dun Liang Haitao Zheng 155 99 0 07 Nov 2021
Are conditional GANs explicitly conditional? Houssem-eddine Boulahbal A. Voicila Andrew I. Comport GAN 61 1 0 28 Jun 2021
Scaling Vision Transformers Xiaohua Zhai Alexander Kolesnikov N. Houlsby Lucas Beyer ViT 136 1,087 0 08 Jun 2021
The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth Jamie Watson Oisin Mac Aodha V. Prisacariu Gabriel J. Brostow Michael Firman MDE 74 270 0 29 Apr 2021
Panoptic Segmentation Forecasting Colin Graber Grace Tsai Michael Firman Gabriel J. Brostow Alex Schwing 56 12 0 08 Apr 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows Ze Liu Yutong Lin Yue Cao Han Hu Yixuan Wei Zheng Zhang Stephen Lin B. Guo ViT 453 21,439 0 25 Mar 2021
Predicting Video with VQVAE Jacob Walker Ali Razavi Aaron van den Oord DRL 98 68 0 02 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 955 29,436 0 26 Feb 2021
The Hardware Lottery Sara Hooker 75 212 0 14 Sep 2020
Feature-metric Loss for Self-supervised Learning of Depth and Egomotion Chang Shu Kun Yu Zhixiang Duan Kuiyuan Yang SSL MDE 77 234 0 21 Jul 2020
Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance Marvin Klingner Jan-Aike Termöhlen Jonas Mikolajczyk Tim Fingscheidt MDE 119 321 0 14 Jul 2020
Latent Video Transformer Ruslan Rakhimov Denis Volkhonskiy Alexey Artemov Denis Zorin Evgeny Burnaev VGen 98 120 0 18 Jun 2020
Language Models are Few-Shot Learners Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 814 42,055 0 28 May 2020
End-to-End Object Detection with Transformers Nicolas Carion Francisco Massa Gabriel Synnaeve Nicolas Usunier Alexander Kirillov Sergey Zagoruyko ViT 3DV PINN 421 13,048 0 26 May 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury ... Sasank Chilamkurthy Benoit Steiner Lu Fang Junjie Bai Soumith Chintala ODL 520 42,449 0 03 Dec 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Colin Raffel Noam M. Shazeer Adam Roberts Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu AIMat 445 20,181 0 23 Oct 2019
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video Jiawang Bian Zhichao Li Naiyan Wang Huangying Zhan Chunhua Shen Ming-Ming Cheng Ian Reid MDE 75 511 0 28 Aug 2019
Self-supervised Learning with Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera Yuhua Chen Cordelia Schmid C. Sminchisescu SSL MDE 67 246 0 12 Jul 2019
3D Packing for Self-Supervised Monocular Depth Estimation Vitor Campagnolo Guizilini Rares Andrei Ambrus Sudeep Pillai Allan Raventos Adrien Gaidon SSL 3DPC MDE 79 648 0 06 May 2019
Segmenting the Future Hsu-kuang Chiu Ehsan Adeli Juan Carlos Niebles 64 45 0 24 Apr 2019
Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras A. Gordon Hanhan Li Rico Jonschkowski A. Angelova MDE 70 365 0 10 Apr 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation Manoj Kumar Mohammad Babaeizadeh D. Erhan Chelsea Finn Sergey Levine Laurent Dinh Durk Kingma VGen 84 132 0 04 Mar 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.8K 94,891 0 11 Oct 2018
Bayesian Prediction of Future Street Scenes using Synthetic Likelihoods Apratim Bhattacharyya Mario Fritz Bernt Schiele UQCV 72 46 0 01 Oct 2018
Recurrent Flow-Guided Semantic Forecasting Adam M. Terwilliger Garrick Brazil Xiaoming Liu 51 46 0 21 Sep 2018
GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks Yasin Almalioglu Muhamad Risqi U. Saputra Pedro Porto Buarque de Gusmão Andrew Markham A. Trigoni GAN MDE 74 146 0 16 Sep 2018
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation Anurag Ranjan Varun Jampani Lukas Balles Kihwan Kim Deqing Sun Jonas Wulff Michael J. Black SSL 58 591 0 24 May 2018
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction Huangying Zhan Ravi Garg C. Weerasekera Kejie Li Harsh Agarwal Ian Reid MDE 53 633 0 11 Mar 2018
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose Zhichao Yin Jianping Shi MDE 54 1,143 0 06 Mar 2018
Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints R. Mahjourian Martin Wicke A. Angelova MDE 99 729 0 15 Feb 2018
Unsupervised Learning of Geometry with Edge-aware Depth-Normal Consistency Zhenheng Yang Peng Wang Wenyuan Xu Liang Zhao Ram Nevatia 3DV MDE 64 155 0 10 Nov 2017
Stochastic Variational Video Prediction Mohammad Babaeizadeh Chelsea Finn D. Erhan R. Campbell Sergey Levine DRL VGen 75 542 0 30 Oct 2017
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 713 131,652 0 12 Jun 2017
Unsupervised Learning of Depth and Ego-Motion from Video Tinghui Zhou Matthew A. Brown Noah Snavely D. Lowe MDE 132 2,575 0 25 Apr 2017
Predicting Deeper into the Future of Semantic Segmentation Pauline Luc Natalia Neverova Camille Couprie Jakob Verbeek Yann LeCun 68 242 0 22 Mar 2017
Geometry-Based Next Frame Prediction from Monocular Video R. Mahjourian Martin Wicke A. Angelova MDE 57 41 0 20 Sep 2016
Unsupervised Monocular Depth Estimation with Left-Right Consistency Clément Godard Oisin Mac Aodha Gabriel J. Brostow MDE 145 2,885 0 13 Sep 2016
Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning William Lotter Gabriel Kreiman David D. Cox SSL 94 935 0 25 May 2016
Unsupervised Learning for Physical Interaction through Video Prediction Chelsea Finn Ian Goodfellow Sergey Levine 76 1,044 0 23 May 2016
Single-Image Depth Perception in the Wild Weifeng Chen Z. Fu Dawei Yang Jia Deng MDE 103 520 0 13 Apr 2016
Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue Ravi Garg B. V. Kumar G. Carneiro Ian Reid 3DV MDE 121 1,530 0 16 Mar 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.2K 194,020 0 10 Dec 2015
Deep multi-scale video prediction beyond mean square error Michaël Mathieu Camille Couprie Yann LeCun GAN 124 1,882 0 17 Nov 2015
Spatial Transformer Networks Max Jaderberg Karen Simonyan Andrew Zisserman Koray Kavukcuoglu 304 7,387 0 05 Jun 2015
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields Fayao Liu Chunhua Shen Guosheng Lin Ian Reid MDE 166 1,197 0 26 Feb 2015
ORB-SLAM: a Versatile and Accurate Monocular SLAM System Raul Mur-Artal José M.M. Montiel Juan D. Tardós 122 6,399 0 03 Feb 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.9K 150,115 0 22 Dec 2014