Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.12513
Cited By
Learning Point-Language Hierarchical Alignment for 3D Visual Grounding
22 October 2022
Jiaming Chen
Weihua Luo
Ran Song
Xiaolin K. Wei
Lin Ma
Wei Emma Zhang
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Point-Language Hierarchical Alignment for 3D Visual Grounding"
38 / 38 papers shown
Title
Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning
Dongze Lian
Daquan Zhou
Jiashi Feng
Xinchao Wang
79
258
0
17 Oct 2022
Surface Representation for Point Clouds
Haoxi Ran
Jun Liu
Chengjie Wang
3DPC
94
157
0
11 May 2022
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
Jun-Bin Luo
Jiahui Fu
Xianghao Kong
Chen Gao
Haibing Ren
Hao Shen
Huaxia Xia
Si Liu
70
91
0
13 Apr 2022
Multi-View Transformer for 3D Visual Grounding
Shijia Huang
Yilun Chen
Jiaya Jia
Liwei Wang
70
124
0
05 Apr 2022
Vision-Language Pre-Training with Triple Contrastive Learning
Jinyu Yang
Jiali Duan
Son N. Tran
Yi Xu
Sampath Chanda
Liqun Chen
Belinda Zeng
Trishul Chilimbi
Junzhou Huang
VLM
103
295
0
21 Feb 2022
FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection
D. Rukhovich
Anna Vorontsova
Anton Konushin
3DPC
106
117
0
01 Dec 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He
Yusheng Zhao
Junyu Luo
Tianrui Hui
Shaofei Huang
Aixi Zhang
Si Liu
ViT
45
95
0
05 Aug 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
56
95
0
07 Jul 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
79
124
0
24 May 2021
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjD
VLM
170
883
0
26 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
72
342
0
17 Apr 2021
Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding
Binbin Huang
Dongze Lian
Weixin Luo
Shenghua Gao
ObjD
70
94
0
09 Apr 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
55
40
0
14 Mar 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
395
4,941
0
24 Feb 2021
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
Wonjae Kim
Bokyung Son
Ildoo Kim
VLM
CLIP
119
1,745
0
05 Feb 2021
Deep Continuous Fusion for Multi-Sensor 3D Object Detection
Ming Liang
Binh Yang
Shenlong Wang
R. Urtasun
3DPC
261
845
0
20 Dec 2020
Object-and-Action Aware Model for Visual Language Navigation
Yuankai Qi
Zizheng Pan
Shengping Zhang
Anton Van Den Hengel
Qi Wu
LM&Ro
48
112
0
29 Jul 2020
3DSSD: Point-based 3D Single Stage Object Detector
Zetong Yang
Yanan Sun
Shu Liu
Jiaya Jia
3DPC
127
941
0
24 Feb 2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
232
7,504
0
02 Oct 2019
A Fast and Accurate One-Stage Approach to Visual Grounding
Zhengyuan Yang
Boqing Gong
Liwei Wang
Wenbing Huang
Dong Yu
Jiebo Luo
ObjD
48
362
0
18 Aug 2019
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning
Wei Zhang
Bairui Wang
Lin Ma
Wei Liu
84
67
0
03 Jun 2019
Deep Hough Voting for 3D Object Detection in Point Clouds
C. Qi
Or Litany
Kaiming He
Leonidas Guibas
3DPC
105
1,287
0
21 Apr 2019
PIXOR: Real-time 3D Object Detection from Point Clouds
Binh Yang
Wenjie Luo
R. Urtasun
3DPC
66
1,093
0
17 Feb 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks
Peng Wang
Qi Wu
Jiewei Cao
Chunhua Shen
Lianli Gao
Anton Van Den Hengel
ObjD
80
255
0
12 Dec 2018
PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
Shaoshuai Shi
Xiaogang Wang
Hongsheng Li
3DPC
180
2,409
0
11 Dec 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
97
828
0
24 Jan 2018
Joint 3D Proposal Generation and Object Detection from View Aggregation
Jason Ku
Melissa Mozifian
Jungwook Lee
Ali Harakeh
Steven Waslander
3DPC
84
1,396
0
06 Dec 2017
Frustum PointNets for 3D Object Detection from RGB-D Data
C. Qi
Wen Liu
Chenxia Wu
Hao Su
Leonidas Guibas
3DPC
147
2,264
0
22 Nov 2017
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou
Oncel Tuzel
3DPC
107
3,722
0
17 Nov 2017
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
469
4,058
0
14 Feb 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
338
3,238
0
02 Dec 2016
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
142
997
0
26 Nov 2016
Multi-View 3D Object Detection Network for Autonomous Driving
Xiaozhi Chen
Huimin Ma
Ji Wan
Bo Li
Tian Xia
3DPC
176
2,772
0
23 Nov 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
315
2,080
0
07 Jun 2016
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
Shuran Song
Jianxiong Xiao
3DPC
91
680
0
07 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
118
1,345
0
07 Nov 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
196
2,056
0
19 May 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
586
12,704
0
11 Dec 2014
1