Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.00640
Cited By
VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes
1 August 2023
Yuhao Lu
Yixuan Fan
Beixing Deng
Fan Liu
Yali Li
Shengjin Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Github (42★)
Papers citing
"VL-Grasp: a 6-Dof Interactive Grasp Policy for Language-Oriented Objects in Cluttered Indoor Scenes"
29 / 29 papers shown
Title
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
111
0
0
13 May 2025
GAT-Grasp: Gesture-Driven Affordance Transfer for Task-Aware Robotic Grasping
Ruixiang Wang
Huayi Zhou
Xinyue Yao
Guiliang Liu
Kui Jia
102
0
0
08 Mar 2025
A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
Houjian Yu
Mingen Li
Alireza Rezazadeh
Yang Yang
Changhyun Choi
88
2
0
28 Sep 2024
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
V. Bhat
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
104
5
0
16 Sep 2024
Graspness Discovery in Clutters for Fast and Accurate Grasp Detection
Chenxi Wang
Hao-Shu Fang
Minghao Gou
Hongjie Fang
Jin Gao
Cewu Lu
111
115
0
17 Jun 2024
AnyGrasp: Robust and Efficient Grasp Perception in Spatial and Temporal Domains
Haoshu Fang
Chenxi Wang
Hongjie Fang
Minghao Gou
Jirong Liu
Hengxu Yan
Wenhai Liu
Yichen Xie
Cewu Lu
115
206
0
16 Dec 2022
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning
Li Yang
Yan Xu
Chunfen Yuan
Wei Liu
Bing Li
Weiming Hu
ObjD
68
117
0
30 Apr 2022
Dex-NeRF: Using a Neural Radiance Field to Grasp Transparent Objects
Jeffrey Ichnowski
Yahav Avigal
Justin Kerr
Ken Goldberg
98
171
0
27 Oct 2021
Referring Transformer: A One-step Approach to Multi-task Visual Grounding
Muchen Li
Leonid Sigal
ObjD
85
192
0
06 Jun 2021
TransVG: End-to-End Visual Grounding with Transformers
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
74
342
0
17 Apr 2021
DexYCB: A Benchmark for Capturing Hand Grasping of Objects
Yu-Wei Chao
Wei Yang
Yu Xiang
Pavlo Molchanov
Ankur Handa
...
Karl Van Wyk
Umar Iqbal
Stan Birchfield
Jan Kautz
Dieter Fox
82
264
0
09 Apr 2021
A Joint Network for Grasp Detection Conditioned on Natural Language Commands
Yiye Chen
Ruinian Xu
Yunzhi Lin
Patricio A. Vela
85
46
0
01 Apr 2021
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
Haolin Liu
Anran Lin
Xiaoguang Han
Lei Yang
Yizhou Yu
Shuguang Cui
61
40
0
14 Mar 2021
Referring Expression Comprehension: A Survey of Methods and Datasets
Yanyuan Qiao
Chaorui Deng
Qi Wu
ObjD
86
97
0
19 Jul 2020
Deep Learning for Image and Point Cloud Fusion in Autonomous Driving: A Review
Yaodong Cui
Ren‐Hao Chen
Wenbo Chu
Long Chen
Daxin Tian
Ying Li
Dongpu Cao
3DPC
64
396
0
10 Apr 2020
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
B. Mildenhall
Pratul P. Srinivasan
Matthew Tancik
Jonathan T. Barron
R. Ramamoorthi
Ren Ng
129
2,594
0
19 Mar 2020
Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Liujuan Cao
Chenglin Wu
Cheng Deng
Rongrong Ji
ObjD
243
291
0
19 Mar 2020
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension
Zhenfang Chen
Peng Wang
Lin Ma
Kwan-Yee K. Wong
Qi Wu
ObjD
92
68
0
01 Mar 2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
89
370
0
18 Dec 2019
The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation
Christopher Xie
Yu Xiang
Arsalan Mousavian
Dieter Fox
123
92
0
30 Jul 2019
PointNetGPD: Detecting Grasp Configurations from Point Sets
Hongzhuo Liang
Xiaojian Ma
Shuang Li
Michael Görner
Song Tang
Bin Fang
F. Sun
Jianwei Zhang
3DPC
65
334
0
17 Sep 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
97
828
0
24 Jan 2018
Grounding Referring Expressions in Images by Variational Context
Hanwang Zhang
Yulei Niu
Shih-Fu Chang
BDL
ObjD
56
220
0
05 Dec 2017
Grasp Pose Detection in Point Clouds
A. T. Pas
Marcus Gualtieri
Kate Saenko
Robert Platt
3DPC
119
561
0
29 Jun 2017
GuessWhat?! Visual object discovery through multi-modal dialogue
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
108
428
0
23 Nov 2016
Natural Language Object Retrieval
Ronghang Hu
Huazhe Xu
Marcus Rohrbach
Jiashi Feng
Kate Saenko
Trevor Darrell
ObjD
94
553
0
13 Nov 2015
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
126
1,345
0
07 Nov 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
199
2,060
0
19 May 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
127
5,585
0
07 Dec 2014
1