ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09489
  4. Cited By
Language-driven Grasp Detection

Language-driven Grasp Detection

13 June 2024
An Dinh Vuong
Minh Nhat Vu
Baoru Huang
Nghia Nguyen
Hieu Le
T. Vo
Anh Nguyen
    VLM
ArXivPDFHTML

Papers citing "Language-driven Grasp Detection"

47 / 47 papers shown
Title
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
141
1
0
05 Mar 2025
Online Trajectory Replanner for Dynamically Grasping Irregular Objects
Online Trajectory Replanner for Dynamically Grasping Irregular Objects
M. Vu
Florian Grander
Anh Nguyen
60
0
0
29 Jan 2025
A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
Houjian Yu
Mingen Li
Alireza Rezazadeh
Yang Yang
Changhyun Choi
80
2
0
28 Sep 2024
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models
V. Bhat
Prashanth Krishnamurthy
Ramesh Karri
Farshad Khorrami
94
5
0
16 Sep 2024
Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine
  PET Reconstruction
Contrastive Diffusion Model with Auxiliary Guidance for Coarse-to-Fine PET Reconstruction
Zeyu Han
Yuhan Wang
Luping Zhou
Peng Wang
Binyu Yan
Jiliu Zhou
Yan Wang
Dinggang Shen
DiffM
49
24
0
20 Aug 2023
Language-Guided Diffusion Model for Visual Grounding
Language-Guided Diffusion Model for Visual Grounding
Sijia Chen
Baochun Li
78
5
0
18 Aug 2023
Motion Planning Diffusion: Learning and Planning of Robot Motions with
  Diffusion Models
Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models
João Carvalho
An T. Le
Mark Baierl
Dorothea Koert
Jan Peters
DiffM
56
114
0
03 Aug 2023
Going Denser with Open-Vocabulary Part Segmentation
Going Denser with Open-Vocabulary Part Segmentation
Pei Sun
Shoufa Chen
Chenchen Zhu
Fanyi Xiao
Ping Luo
Saining Xie
Zhicheng Yan
ObjD
VLM
43
48
0
18 May 2023
Object-centric Inference for Language Conditioned Placement: A
  Foundation Model based Approach
Object-centric Inference for Language Conditioned Placement: A Foundation Model based Approach
Zhi-Wei Xu
Kechun Xu
Yue Wang
R. Xiong
OCL
30
4
0
06 Apr 2023
Text-Conditioned Sampling Framework for Text-to-Image Generation with
  Masked Generative Models
Text-Conditioned Sampling Framework for Text-to-Image Generation with Masked Generative Models
Jaewoong Lee
Sang-Sub Jang
Jaehyeong Jo
Jaehong Yoon
Yunji Kim
Jin-Hwa Kim
Jung-Woo Ha
Sung Ju Hwang
DiffM
56
4
0
04 Apr 2023
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping
  in Clutter
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter
Kechun Xu
Shuqing Zhao
Zhongxiang Zhou
Zizhang Li
Huaijin Pi
Yifeng Zhu
Yue Wang
R. Xiong
43
48
0
24 Feb 2023
ConceptFusion: Open-set Multimodal 3D Mapping
ConceptFusion: Open-set Multimodal 3D Mapping
Krishna Murthy Jatavallabhula
Ali Kuwajerwala
Qiao Gu
Mohd. Omama
Tao Chen
...
Celso Miguel de Melo
Madhava Krishna
Liam Paull
Florian Shkurti
Antonio Torralba
64
239
0
14 Feb 2023
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large
  Language Models
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
Chan Hee Song
Jiaman Wu
Clay Washington
Brian M Sadler
Wei-Lun Chao
Yu-Chuan Su
LLMAG
LM&Ro
119
412
0
08 Dec 2022
EDGE: Editable Dance Generation From Music
EDGE: Editable Dance Generation From Music
Jo-Han Tseng
Rodrigo Castellon
Chenxi Liu
78
236
0
19 Nov 2022
Grasp Learning: Models, Methods, and Performance
Grasp Learning: Models, Methods, and Performance
Robert Platt
63
24
0
09 Nov 2022
Machine Learning-based Framework for Optimally Solving the Analytical
  Inverse Kinematics for Redundant Manipulators
Machine Learning-based Framework for Optimally Solving the Analytical Inverse Kinematics for Redundant Manipulators
Minh Nhat Vu
F. Beck
Michael Schwegel
C. Hartl-Nesic
Anhtuan Nguyen
Andreas Kugi
35
23
0
08 Nov 2022
Singularity Avoidance with Application to Online Trajectory Optimization
  for Serial Manipulators
Singularity Avoidance with Application to Online Trajectory Optimization for Serial Manipulators
F. Beck
Minh Nhat Vu
C. Hartl-Nesic
Andreas Kugi
38
13
0
04 Nov 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
170
3,444
0
16 Oct 2022
Human Motion Diffusion Model
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
263
758
0
29 Sep 2022
MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware
  Ambidextrous Bin Picking via Physics-based Metaverse Synthesis
MetaGraspNet: A Large-Scale Benchmark Dataset for Scene-Aware Ambidextrous Bin Picking via Physics-based Metaverse Synthesis
Maximilian Gilles
Yuhao Chen
Tim Robin Winter
E. Z. Zeng
Alexander Wong
56
25
0
08 Aug 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
377
6,859
0
13 Apr 2022
Video Diffusion Models
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
187
1,610
0
07 Apr 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
179
1,951
0
04 Apr 2022
Interactive Robotic Grasping with Attribute-Guided Disambiguation
Interactive Robotic Grasping with Attribute-Guided Disambiguation
Yang Yang
Xibai Lou
Changhyun Choi
51
30
0
15 Mar 2022
Conditional Prompt Learning for Vision-Language Models
Conditional Prompt Learning for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VLM
CLIP
VPVLM
122
1,348
0
10 Mar 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLM
ObjD
144
873
0
07 Feb 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
408
15,486
0
20 Dec 2021
RegionCLIP: Region-based Language-Image Pretraining
RegionCLIP: Region-based Language-Image Pretraining
Yiwu Zhong
Jianwei Yang
Pengchuan Zhang
Chunyuan Li
Noel Codella
...
Luowei Zhou
Xiyang Dai
Lu Yuan
Yin Li
Jianfeng Gao
VLM
CLIP
130
575
0
16 Dec 2021
Diffusion Models for Implicit Image Segmentation Ensembles
Diffusion Models for Implicit Image Segmentation Ensembles
J. Wolleb
Robin Sandkühler
Florentin Bieder
Philippe Valmaggia
Philippe C. Cattin
DiffM
MedIm
VLM
181
270
0
06 Dec 2021
CLIPort: What and Where Pathways for Robotic Manipulation
CLIPort: What and Where Pathways for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
106
650
0
24 Sep 2021
CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from
  Simulation
CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation
Bowen Wen
Wenzhao Lian
Kostas Bekris
S. Schaal
51
78
0
19 Sep 2021
Align before Fuse: Vision and Language Representation Learning with
  Momentum Distillation
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Junnan Li
Ramprasaath R. Selvaraju
Akhilesh Deepak Gotmare
Shafiq Joty
Caiming Xiong
Guosheng Lin
FaML
186
1,953
0
16 Jul 2021
End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and
  Semantic Segmentation from RGB
End-to-end Trainable Deep Neural Network for Robotic Grasp Detection and Semantic Segmentation from RGB
Stefan Ainetter
Friedrich Fraundorfer
43
123
0
12 Jul 2021
Diffusion Models Beat GANs on Image Synthesis
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
213
7,831
0
11 May 2021
Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via
  Implicit Representations
Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations
Zhenyu Jiang
Yifeng Zhu
Maxwell Svetlik
Kuan Fang
Yuke Zhu
68
139
0
04 Apr 2021
Vision Transformers for Dense Prediction
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
130
1,732
0
24 Mar 2021
Fine-grained Angular Contrastive Learning with Coarse Labels
Fine-grained Angular Contrastive Learning with Coarse Labels
Guy Bukchin
Eli Schwartz
Kate Saenko
Ori Shahar
Rogerio Feris
Raja Giryes
Leonid Karlinsky
72
54
0
07 Dec 2020
ACRONYM: A Large-Scale Grasp Dataset Based on Simulation
ACRONYM: A Large-Scale Grasp Dataset Based on Simulation
Clemens Eppner
Arsalan Mousavian
Dieter Fox
83
210
0
18 Nov 2020
A Billion Ways to Grasp: An Evaluation of Grasp Sampling Schemes on a
  Dense, Physics-based Grasp Data Set
A Billion Ways to Grasp: An Evaluation of Grasp Sampling Schemes on a Dense, Physics-based Grasp Data Set
Clemens Eppner
Arsalan Mousavian
Dieter Fox
65
73
0
11 Dec 2019
Antipodal Robotic Grasping using Generative Residual Convolutional
  Neural Network
Antipodal Robotic Grasping using Generative Residual Convolutional Neural Network
Sulabh Kumra
Shirin Joshi
F. Sahin
61
292
0
11 Sep 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
100
1,369
0
08 Aug 2019
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
Arsalan Mousavian
Clemens Eppner
Dieter Fox
3DPC
77
565
0
25 May 2019
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
321
5,801
0
21 Apr 2019
PointNetGPD: Detecting Grasp Configurations from Point Sets
PointNetGPD: Detecting Grasp Configurations from Point Sets
Hongzhuo Liang
Xiaojian Ma
Shuang Li
Michael Görner
Song Tang
Bin Fang
F. Sun
Jianwei Zhang
3DPC
57
334
0
17 Sep 2018
Closing the Loop for Robotic Grasping: A Real-time, Generative Grasp
  Synthesis Approach
Closing the Loop for Robotic Grasping: A Real-time, Generative Grasp Synthesis Approach
D. Morrison
Peter Corke
Jurgen Leitner
3DV
88
555
0
14 Apr 2018
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
290
6,931
0
12 Mar 2015
Deep Learning for Detecting Robotic Grasps
Deep Learning for Detecting Robotic Grasps
Ian Lenz
Honglak Lee
Ashutosh Saxena
107
1,646
0
16 Jan 2013
1