ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.08830
  4. Cited By
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

18 December 2019
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
    3DPC
ArXivPDFHTML

Papers citing "ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language"

38 / 238 papers shown
Title
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual
  Grounding
EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
Yanmin Wu
Xinhua Cheng
Renrui Zhang
Zesen Cheng
Jian Zhang
56
63
0
29 Sep 2022
Towards Explainable 3D Grounded Visual Question Answering: A New
  Benchmark and Strong Baseline
Towards Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline
Lichen Zhao
Daigang Cai
Jing Zhang
Lu Sheng
Dong Xu
Ruizhi Zheng
Yinjie Zhao
Lipeng Wang
Xibo Fan
6
23
0
24 Sep 2022
Federated Learning via Decentralized Dataset Distillation in
  Resource-Constrained Edge Environments
Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments
Rui Song
Dai Liu
Da Chen
Andreas Festag
Carsten Trinitis
Martin Schulz
Alois C. Knoll
DD
FedML
28
62
0
24 Aug 2022
DoRO: Disambiguation of referred object for embodied agents
DoRO: Disambiguation of referred object for embodied agents
Pradip Pramanick
Chayan Sarkar
S. Paul
R. Roychoudhury
Brojeshwar Bhowmick
LM&Ro
12
14
0
28 Jul 2022
Semantic Abstraction: Open-World 3D Scene Understanding from 2D
  Vision-Language Models
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
43
101
0
23 Jul 2022
Toward Explainable and Fine-Grained 3D Grounding through Referring
  Textual Phrases
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Zhihao Yuan
Xu Yan
Zhuo Li
Xuhao Li
Yao Guo
Shuguang Cui
Zhen Li
28
17
0
05 Jul 2022
Decomposing NeRF for Editing via Feature Field Distillation
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
184
328
0
31 May 2022
Voxel-informed Language Grounding
Voxel-informed Language Grounding
Rodolfo Corona
Shizhan Zhu
Dan Klein
Trevor Darrell
138
11
0
19 May 2022
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language
  Instructions
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions
Chi-Hou Cheang
Haitao Lin
Yanwei Fu
Xiangyang Xue
19
21
0
09 May 2022
Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds
Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds
Heng Wang
Chaoyi Zhang
Jianhui Yu
Weidong (Tom) Cai
3DPC
22
38
0
22 Apr 2022
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive
  Selection
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
Jun-Bin Luo
Jiahui Fu
Xianghao Kong
Chen Gao
Haibing Ren
Hao Shen
Huaxia Xia
Si Liu
29
89
0
13 Apr 2022
Multi-View Transformer for 3D Visual Grounding
Multi-View Transformer for 3D Visual Grounding
Shijia Huang
Yilun Chen
Jiaya Jia
Liwei Wang
31
112
0
05 Apr 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Manuel Kolmet
Qunjie Zhou
Aljosa Osep
Laura Leal-Taixe
24
22
0
28 Mar 2022
Towards Implicit Text-Guided 3D Shape Generation
Towards Implicit Text-Guided 3D Shape Generation
Zhengzhe Liu
Yi Wang
Xiaojuan Qi
Chi-Wing Fu
27
90
0
28 Mar 2022
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes
Yang Jiao
Shaoxiang Chen
Zequn Jie
Wenke Huang
Lin Ma
Yu-Gang Jiang
3DPC
21
46
0
10 Mar 2022
Unsupervised Point Cloud Representation Learning with Deep Neural
  Networks: A Survey
Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey
Aoran Xiao
Jiaxing Huang
Dayan Guan
Xiaoqin Zhang
Shijian Lu
Ling Shao
3DPC
21
71
0
28 Feb 2022
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
Yue Ruan
Han-Hung Lee
Yiming Zhang
Ke Zhang
Angel X. Chang
32
22
0
19 Jan 2022
Comprehensive Visual Question Answering on Point Clouds through
  Compositional Scene Manipulation
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
Xu Yan
Zhihao Yuan
Yuhao Du
Yinghong Liao
Yao Guo
Zhen Li
Shuguang Cui
3DPC
CoGe
23
14
0
22 Dec 2021
ScanQA: 3D Question Answering for Spatial Scene Understanding
ScanQA: 3D Question Answering for Spatial Scene Understanding
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
M. Kawanabe
32
176
0
20 Dec 2021
Bottom Up Top Down Detection Transformers for Language Grounding in
  Images and Point Clouds
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
Ayush Jain
N. Gkanatsios
Ishita Mediratta
Katerina Fragkiadaki
ObjD
28
99
0
16 Dec 2021
3D Question Answering
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
26
46
0
15 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning
  and Visual Grounding
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
21
29
0
02 Dec 2021
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
CLIP
42
289
0
06 Oct 2021
Uncovering Main Causalities for Long-tailed Information Extraction
Uncovering Main Causalities for Long-tailed Information Extraction
Guoshun Nan
Jiaqi Zeng
Rui Qiao
Zhijiang Guo
Wei Lu
CML
54
46
0
11 Sep 2021
Speaker-Oriented Latent Structures for Dialogue-Based Relation Extraction
Guoshun Nan
Guoqing Luo
Sicong Leng
Yao Xiao
Wei Lu
27
8
0
11 Sep 2021
Towers of Babel: Combining Images, Language, and 3D Geometry for
  Learning Multimodal Vision
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision
Xiaoshi Wu
Hadar Averbuch-Elor
J. Sun
Noah Snavely
23
19
0
12 Aug 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D
  Visual Grounding
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He
Yusheng Zhao
Junyu Luo
Tianrui Hui
Shaofei Huang
Aixi Zhang
Si Liu
ViT
16
94
0
05 Aug 2021
Using Depth for Improving Referring Expression Comprehension in
  Real-World Environments
Using Depth for Improving Referring Expression Comprehension in Real-World Environments
Fethiye Irmak Dogan
Iolanda Leite
11
5
0
09 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
22
95
0
07 Jul 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
34
119
0
24 May 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD
  Images
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
27
40
0
14 Mar 2021
OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene
  Grounding
OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene Grounding
Ke-Jyun Wang
Yun-Hsuan Liu
Hung-Ting Su
Jen-Wei Wang
Yu-Siang Wang
Winston H. Hsu
Wen-Chin Chen
45
19
0
13 Mar 2021
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph
  Analysis
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis
Chaoyi Zhang
Jianhui Yu
Yang Song
Weidong (Tom) Cai
3DPC
35
49
0
09 Mar 2021
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding
  on Point Clouds through Instance Multi-level Contextual Referring
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring
Zhihao Yuan
Xu Yan
Yinghong Liao
Ruimao Zhang
Sheng Wang
Zhen Li
Shuguang Cui
71
129
0
01 Mar 2021
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Dave Zhenyu Chen
A. Gholami
Matthias Nießner
Angel X. Chang
3DPC
23
159
0
03 Dec 2020
3D Guided Weakly Supervised Semantic Segmentation
3D Guided Weakly Supervised Semantic Segmentation
Weixuan Sun
Jing Zhang
Nick Barnes
24
13
0
01 Dec 2020
PIE-NET: Parametric Inference of Point Cloud Edges
PIE-NET: Parametric Inference of Point Cloud Edges
Xiaogang Wang
Yuelang Xu
Kai Xu
Andrea Tagliasacchi
Bin Zhou
Ali Mahdavi-Amiri
Hao Zhang
3DPC
20
95
0
09 Jul 2020
ENet: A Deep Neural Network Architecture for Real-Time Semantic
  Segmentation
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,056
0
07 Jun 2016
Previous
12345