ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.08045
  4. Cited By
Forging Vision Foundation Models for Autonomous Driving: Challenges,
  Methodologies, and Opportunities

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

16 January 2024
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
Bin-Bin Gao
Kaiqiang Zhou
Yue Zhao
Huan Jin
Jiantao Gao
Zhen Li
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
ArXivPDFHTML

Papers citing "Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities"

29 / 79 papers shown
Title
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding
  in 2D and 3D
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D
Yiyi Liao
Jun Xie
Andreas Geiger
3DV
3DPC
64
570
0
28 Sep 2021
Voxel Transformer for 3D Object Detection
Voxel Transformer for 3D Object Detection
Jiageng Mao
Yujing Xue
Minzhe Niu
Haoyue Bai
Jiashi Feng
Xiaodan Liang
Hang Xu
Chunjing Xu
3DPC
ViT
56
409
0
06 Sep 2021
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
440
2,340
0
02 Sep 2021
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP
Han Fang
Pengfei Xiong
Luhui Xu
Yu Chen
CLIP
VLM
73
294
0
21 Jun 2021
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance
  Fields
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields
Jonathan T. Barron
B. Mildenhall
Matthew Tancik
Peter Hedman
Ricardo Martín Brualla
Pratul P. Srinivasan
75
1,945
0
24 Mar 2021
RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR
  Point Cloud Segmentation
RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation
Jianyun Xu
Ruixiang Zhang
Jian Dou
Yushi Zhu
Jie Sun
Shiliang Pu
3DPC
51
262
0
24 Mar 2021
GLM: General Language Model Pretraining with Autoregressive Blank
  Infilling
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
Zhengxiao Du
Yujie Qian
Xiao Liu
Ming Ding
J. Qiu
Zhilin Yang
Jie Tang
BDL
AI4CE
91
1,520
0
18 Mar 2021
Sparsely ensembled convolutional neural network classifiers via
  reinforcement learning
Sparsely ensembled convolutional neural network classifiers via reinforcement learning
R. Malashin
19
3
0
07 Feb 2021
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Self-Supervised Pretraining of 3D Features on any Point-Cloud
Zaiwei Zhang
Rohit Girdhar
Armand Joulin
Ishan Misra
3DPC
147
269
0
07 Jan 2021
AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic
  Segmentation
AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic Segmentation
Venice Erin Liong
Thi Ngoc Tho Nguyen
S. Widjaja
Dhananjai Sharma
Z. J. Chong
3DPC
160
111
0
09 Dec 2020
NeRF++: Analyzing and Improving Neural Radiance Fields
NeRF++: Analyzing and Improving Neural Radiance Fields
Kai Zhang
Gernot Riegler
Noah Snavely
V. Koltun
66
1,035
0
15 Oct 2020
SurfelGAN: Synthesizing Realistic Sensor Data for Autonomous Driving
SurfelGAN: Synthesizing Realistic Sensor Data for Autonomous Driving
Zhenpei Yang
Yuning Chai
Dragomir Anguelov
Yin Zhou
Pei Sun
D. Erhan
Sean M. Rafferty
Henrik Kretzschmar
49
101
0
08 May 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
423
3,397
0
09 Mar 2020
SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic
  Instances
SemanticPOSS: A Point Cloud Dataset with Large Quantity of Dynamic Instances
Yancheng Pan
Biao Gao
Jilin Mei
Sibo Geng
Chengkun Li
Huijing Zhao
3DV
3DPC
48
171
0
21 Feb 2020
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
Shaoshuai Shi
Chaoxu Guo
Li Jiang
Zhe Wang
Jianping Shi
Xiaogang Wang
Hongsheng Li
3DPC
83
1,767
0
31 Dec 2019
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
...
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
79
2,851
0
10 Dec 2019
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
Qingyong Hu
Bo Yang
Linhai Xie
Stefano Rosa
Yulan Guo
Zhihua Wang
A. Trigoni
Andrew Markham
3DPC
88
1,480
0
25 Nov 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
113
12,007
0
13 Nov 2019
Argoverse: 3D Tracking and Forecasting with Rich Maps
Argoverse: 3D Tracking and Forecasting with Rich Maps
Ming-Fang Chang
John Lambert
Patsorn Sangkloy
Jagjeet Singh
Sławomir Bąk
...
De Wang
Peter Carr
Simon Lucey
Deva Ramanan
James Hays
3DPC
114
1,289
0
06 Nov 2019
Object-Centric Stereo Matching for 3D Object Detection
Object-Centric Stereo Matching for 3D Object Detection
Alex D. Pon
Jason Ku
Chengyao Li
Steven L. Waslander
3DPC
49
85
0
17 Sep 2019
WoodScape: A multi-task, multi-camera fisheye dataset for autonomous
  driving
WoodScape: A multi-task, multi-camera fisheye dataset for autonomous driving
S. Yogamani
Ciarán Hughes
Jonathan Horgan
Ganesh Sistu
P. Varley
...
Sumanth Chennupati
Sanjaya Nayak
Saquib Mansoor
Xavier Perroton
P. Pérez
HAI
46
263
0
04 May 2019
FCOS: Fully Convolutional One-Stage Object Detection
FCOS: Fully Convolutional One-Stage Object Detection
Zhi Tian
Chunhua Shen
Hao Chen
Tong He
ObjD
100
4,969
0
02 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
251
5,653
0
26 Mar 2019
PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
3DPC
147
2,392
0
11 Dec 2018
Deep Generative Modeling of LiDAR Data
Deep Generative Modeling of LiDAR Data
Lucas Caccia
H. V. Hoof
Aaron Courville
Joelle Pineau
3DPC
165
76
0
04 Dec 2018
IDD: A Dataset for Exploring Problems of Autonomous Navigation in
  Unconstrained Environments
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments
G. Varma
A. Subramanian
A. Namboodiri
Manmohan Chandraker
C. V. Jawahar
59
320
0
26 Nov 2018
Toward Driving Scene Understanding: A Dataset for Learning Driver
  Behavior and Causal Reasoning
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning
Vasili Ramanishka
Yi-Ting Chen
Teruhisa Misu
Kate Saenko
64
280
0
06 Nov 2018
The Cityscapes Dataset for Semantic Urban Scene Understanding
The Cityscapes Dataset for Semantic Urban Scene Understanding
Marius Cordts
Mohamed Omran
Sebastian Ramos
Timo Rehfeld
Markus Enzweiler
Rodrigo Benenson
Uwe Franke
Stefan Roth
Bernt Schiele
691
11,540
0
06 Apr 2016
You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon
S. Divvala
Ross B. Girshick
Ali Farhadi
ObjD
568
36,643
0
08 Jun 2015
Previous
12