Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.17659
Cited By
DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning
25 June 2024
Xiaohan Zhang
Zainab Altaweel
Yohei Hayamizu
Yan Ding
S. Amiri
Hao Yang
Andy Kaminski
Chad Esselink
Shiqi Zhang
VLM
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning"
15 / 15 papers shown
Title
AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots
Zhaxizhuoma
Pengan Chen
Ziniu Wu
Jiawei Sun
Dong Wang
Peng Zhou
Nieqing Cao
Yan Ding
Bin Zhao
Xuelong Li
78
5
0
18 Sep 2024
Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V
Peiyuan Zhi
Zhiyuan Zhang
Muzhi Han
Zeyu Zhang
Zhitian Li
Ziyuan Jiao
Ziyuan Jiao
Siyuan Huang
Siyuan Huang
LRM
LM&Ro
73
31
0
16 Apr 2024
RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
P. Sermanet
Tianli Ding
Jeffrey Zhao
Fei Xia
Debidatta Dwibedi
...
Pannag R Sanketi
Karol Hausman
Izhak Shafran
Brian Ichter
Yuan Cao
LM&Ro
66
52
0
01 Nov 2023
Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds
Yan Ding
Xiaohan Zhang
S. Amiri
Nieqing Cao
Hao Yang
Andy Kaminski
Chad Esselink
Shiqi Zhang
LM&Ro
52
49
0
27 May 2023
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning
L. Guan
Karthik Valmeekam
S. Sreedharan
Subbarao Kambhampati
LLMAG
40
171
0
24 May 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
916
13,788
0
15 Mar 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models
Austin Stone
Ted Xiao
Yao Lu
K. Gopalakrishnan
Kuang-Huei Lee
...
Sean Kirmani
Brianna Zitkovich
F. Xia
Chelsea Finn
Karol Hausman
LM&Ro
204
148
0
02 Mar 2023
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
74
103
0
23 Jul 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
147
1,922
0
04 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
619
9,009
0
28 Jan 2022
Grounding Predicates through Actions
Toki Migimatsu
Jeannette Bohg
171
35
0
29 Sep 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
480
3,952
0
18 Apr 2021
Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image
Danny Driess
Jung-Su Ha
Marc Toussaint
LRM
38
100
0
09 Jun 2020
Closing the Loop for Robotic Grasping: A Real-time, Generative Grasp Synthesis Approach
D. Morrison
Peter Corke
Jurgen Leitner
3DV
81
554
0
14 Apr 2018
PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains
M. Fox
D. Long
82
2,168
0
22 Jun 2011
1