ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14819
  4. Cited By
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point
  Modeling

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

29 November 2021
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
    3DPC
ArXivPDFHTML

Papers citing "Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling"

50 / 398 papers shown
Title
SUGAR: Pre-training 3D Visual Representations for Robotics
SUGAR: Pre-training 3D Visual Representations for Robotics
Shizhe Chen
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
44
14
0
01 Apr 2024
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation
  Learning for Neural Radiance Fields
NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Muhammad Zubair Irshad
Sergey Zakahrov
Vitor Campagnolo Guizilini
Adrien Gaidon
Z. Kira
Rares Ambrus
ViT
42
12
0
01 Apr 2024
A Unified Framework for Human-centric Point Cloud Video Understanding
A Unified Framework for Human-centric Point Cloud Video Understanding
Yiteng Xu
Kecheng Ye
Xiao Han
Yiming Ren
Xinge Zhu
Yuexin Ma
31
2
0
29 Mar 2024
To Supervise or Not to Supervise: Understanding and Addressing the Key
  Challenges of 3D Transfer Learning
To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of 3D Transfer Learning
Souhail Hadgi
Lei Li
M. Ovsjanikov
24
0
0
26 Mar 2024
Training point-based deep learning networks for forest segmentation with
  synthetic data
Training point-based deep learning networks for forest segmentation with synthetic data
Francisco Raverta Capua
Juan Schandin
Pablo De Cristóforis
3DPC
36
3
0
21 Mar 2024
CORN: Contact-based Object Representation for Nonprehensile Manipulation
  of General Unseen Objects
CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects
Yoonyoung Cho
Junhyek Han
Yoontae Cho
Beomjoon Kim
34
8
0
16 Mar 2024
SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal
  Conditioned Diffusion Policy
SculptDiff: Learning Robotic Clay Sculpting from Humans with Goal Conditioned Diffusion Policy
Alison Bartsch
Arvind Car
Charlotte Avra
A. Farimani
38
5
0
15 Mar 2024
Fast and Simple Explainability for Point Cloud Networks
Fast and Simple Explainability for Point Cloud Networks
Meir Yossef Levi
Guy Gilboa
3DPC
26
3
0
12 Mar 2024
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer
  Learning for Point Cloud Analysis
Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Xin Zhou
Dingkang Liang
Wei Xu
Xingkui Zhu
Yihan Xu
Zhikang Zou
Xiang Bai
24
26
0
03 Mar 2024
Dynamic 3D Point Cloud Sequences as 2D Videos
Dynamic 3D Point Cloud Sequences as 2D Videos
Yiming Zeng
Junhui Hou
Qijian Zhang
Siyu Ren
Wenping Wang
3DPC
41
1
0
02 Mar 2024
Point Cloud Mamba: Point Cloud Learning via State Space Model
Point Cloud Mamba: Point Cloud Learning via State Space Model
Tao Zhang
Xiangtai Li
Haobo Yuan
Shunping Ji
Shuicheng Yan
37
19
0
01 Mar 2024
MaskLRF: Self-supervised Pretraining via Masked Autoencoding of Local
  Reference Frames for Rotation-invariant 3D Point Set Analysis
MaskLRF: Self-supervised Pretraining via Masked Autoencoding of Local Reference Frames for Rotation-invariant 3D Point Set Analysis
Takahiko Furuya
3DPC
43
2
0
01 Mar 2024
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang
Shengcao Cao
Yu-Xiong Wang
33
16
0
28 Feb 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
41
51
0
27 Feb 2024
CAPT: Category-level Articulation Estimation from a Single Point Cloud
  Using Transformer
CAPT: Category-level Articulation Estimation from a Single Point Cloud Using Transformer
Lian Fu
Ryoichi Ishikawa
Yoshihiro Sato
Takeshi Oishi
3DPC
ViT
27
1
0
27 Feb 2024
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Parameter-efficient Prompt Learning for 3D Point Cloud Understanding
Hongyu Sun
Yongcai Wang
Wang Chen
Haoran Deng
Deying Li
VPVLM
49
5
0
24 Feb 2024
CLIPose: Category-Level Object Pose Estimation with Pre-trained
  Vision-Language Knowledge
CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge
Xiao Lin
Minghao Zhu
Ronghao Dang
Guangliang Zhou
Shaolong Shu
Feng Lin
Chengju Liu
Qi Chen
CLIP
41
8
0
24 Feb 2024
Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised
  Learning
Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning
Johnathan Xie
Yoonho Lee
Annie S. Chen
Chelsea Finn
25
3
0
22 Feb 2024
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only
  Moving Object Segmentation and Ego-Velocity Estimation
RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation
Changsong Pang
Xieyuanli Chen
Yimin Liu
Huimin Lu
Yuwei Cheng
32
2
0
22 Feb 2024
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene
  Understanding
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding
Yu-Qi Yang
Yufeng Guo
Yang Liu
3DPC
38
2
0
22 Feb 2024
Advancements in Point Cloud-Based 3D Defect Detection and Classification
  for Industrial Systems: A Comprehensive Survey
Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey
Anju Rani
D. O. Arroyo
Petar Durdevic
3DPC
19
5
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
56
41
0
19 Feb 2024
DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT
  Based Diffusion Model
DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model
Yu Feng
Xing Shi
Mengli Cheng
Yun Xiong
19
0
0
17 Feb 2024
PointMamba: A Simple State Space Model for Point Cloud Analysis
PointMamba: A Simple State Space Model for Point Cloud Analysis
Dingkang Liang
Xin Zhou
Wei Xu
Xingkui Zhu
Zhikang Zou
Xiaoqing Ye
Xinyu Wang
Xiang Bai
89
90
0
16 Feb 2024
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D
  Point Cloud Understanding
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding
Hai-Tao Yu
Mofei Song
3DPC
14
7
0
15 Feb 2024
GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data
Haoyuan Li
Yanpeng Zhou
Yihan Zeng
Hang Xu
Xiaodan Liang
3DGS
CLIP
27
0
0
09 Feb 2024
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos
Zhicheng Zheng
Xin Yan
Zhenfang Chen
Jingzhou Wang
Qin Zhi Eddie Lim
Joshua B. Tenenbaum
Chuang Gan
LRM
34
6
0
09 Feb 2024
A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer
A Graph is Worth KKK Words: Euclideanizing Graph using Pure Transformer
Zhangyang Gao
Daize Dong
Cheng Tan
Jun-Xiong Xia
Bozhen Hu
Stan Z. Li
46
6
0
04 Feb 2024
Transolver: A Fast Transformer Solver for PDEs on General Geometries
Transolver: A Fast Transformer Solver for PDEs on General Geometries
Haixu Wu
Huakun Luo
Haowen Wang
Jianmin Wang
Mingsheng Long
AI4CE
43
40
0
04 Feb 2024
ALERT-Transformer: Bridging Asynchronous and Synchronous Machine
  Learning for Real-Time Event-based Spatio-Temporal Data
ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data
Carmen Martin-Turrero
Maxence Bouvier
Manuel Breitenstein
Pietro Zanuttigh
Vincent Parret
26
4
0
02 Feb 2024
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other
  Modalities
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Yiyuan Zhang
Xiaohan Ding
Kaixiong Gong
Yixiao Ge
Ying Shan
Xiangyu Yue
ViT
19
7
0
25 Jan 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
50
178
0
24 Jan 2024
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration
Yifan Zhang
Siyu Ren
Junhui Hou
Jinjian Wu
Guangming Shi
Guangming Shi
SSL
3DPC
84
3
0
23 Jan 2024
PointGL: A Simple Global-Local Framework for Efficient Point Cloud
  Analysis
PointGL: A Simple Global-Local Framework for Efficient Point Cloud Analysis
Jianan Li
Jie Wang
Tingfa Xu
3DPC
16
6
0
22 Jan 2024
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with
  Fine-Grained Feature Representation
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
Qingdong He
Jinlong Peng
Zhengkai Jiang
Kai Wu
Xiaozhong Ji
Jiangning Zhang
Yabiao Wang
Chengjie Wang
Mingang Chen
Yunsheng Wu
3DPC
28
7
0
21 Jan 2024
CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point
  Cloud Video Understanding
CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding
Yunze Liu
Changxi Chen
Zifan Wang
Li Yi
3DPC
27
3
0
17 Jan 2024
Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding
Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding
Qi Sun
Xiao Cui
Wen-gang Zhou
Houqiang Li
3DPC
22
1
0
15 Jan 2024
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation
Hyunjin Kim
Minhyuk Sung
51
8
0
11 Jan 2024
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with
  Large Language Models
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu
Xiaoshui Huang
Yuenan Hou
Zhihui Wang
Zhen-fei Yin
Yongshun Gong
Peng Gao
Wanli Ouyang
19
8
0
09 Jan 2024
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes
  Interactively
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
Haobo Yuan
Xiangtai Li
Chong Zhou
Yining Li
Kai Chen
Chen Change Loy
VLM
29
51
0
05 Jan 2024
DHGCN: Dynamic Hop Graph Convolution Network for Self-Supervised Point
  Cloud Learning
DHGCN: Dynamic Hop Graph Convolution Network for Self-Supervised Point Cloud Learning
Jincen Jiang
Lizhi Zhao
Xuequan Lu
Wei Hu
Imran Razzak
Meili Wang
SSL
3DPC
20
8
0
05 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun-Xiong Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
33
14
0
31 Dec 2023
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via
  Expressive Masked Audio Gesture Modeling
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling
Haiyang Liu
Zihao Zhu
Giorgio Becherini
Yichen Peng
Mingyang Su
You Zhou
Xuefei Zhe
Naoya Iwamoto
Bo Zheng
Michael J. Black
SLR
34
29
0
31 Dec 2023
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models
Wan Xu
Tianyu Huang
Tianyu Qu
Guanglei Yang
Yiwen Guo
Wangmeng Zuo
21
0
0
28 Dec 2023
Visual Instruction Tuning towards General-Purpose Multimodal Model: A
  Survey
Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey
Jiaxing Huang
Jingyi Zhang
Kai Jiang
Han Qiu
Shijian Lu
38
22
0
27 Dec 2023
Towards Compact 3D Representations via Point Feature Enhancement Masked
  Autoencoders
Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders
Yaohua Zha
Huizhen Ji
Jinmin Li
Rongsheng Li
Tao Dai
Bin Chen
Zhi Wang
Shu-Tao Xia
3DPC
27
23
0
17 Dec 2023
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation
  Learning
T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning
Weijie Wei
F. Karimi Nejadasl
Theo Gevers
Martin R. Oswald
3DPC
25
3
0
15 Dec 2023
Random resistive memory-based deep extreme point learning machine for
  unified visual processing
Random resistive memory-based deep extreme point learning machine for unified visual processing
Shaocong Wang
Yizhao Gao
Yi Li
Woyu Zhang
Yifei Yu
...
Zhongrui Wang
Dashan Shang
Qi Liu
Kwang-Ting Cheng
Ming-Yu Liu
23
0
0
14 Dec 2023
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object
  Identifiers
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Haifeng Huang
Zehan Wang
Rongjie Huang
Luping Liu
Xize Cheng
Yang Zhao
Tao Jin
Zhou Zhao
59
42
0
13 Dec 2023
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs
  for Embodied AI
Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI
Kai Huang
Boyuan Yang
Wei Gao
29
1
0
13 Dec 2023
Previous
12345678
Next