ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.04533
  4. Cited By
Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language
  Models

Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models

7 December 2023
Ivan Kapelyukh
Yifei Ren
Ignacio Alzugaray
Edward Johns
    VLM
    LM&Ro
ArXivPDFHTML

Papers citing "Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models"

27 / 27 papers shown
Title
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
51
1
0
11 Apr 2025
World Knowledge from AI Image Generation for Robot Control
World Knowledge from AI Image Generation for Robot Control
Jonas Krumme
C. Zetzsche
LM&Ro
55
0
0
20 Mar 2025
ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints
ShapeShift: Towards Text-to-Shape Arrangement Synthesis with Content-Aware Geometric Constraints
Vihaan Misra
Peter Schaldenbrand
Jean Oh
DiffM
59
1
0
18 Mar 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Kaixuan Jiang
Yong-Jin Liu
Weixing Chen
Jingzhou Luo
Ziliang Chen
Ling Pan
G. Li
Liang Lin
57
2
0
14 Mar 2025
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Zekun Qi
Wenyao Zhang
Yufei Ding
Runpei Dong
Xinqiang Yu
...
Xin Jin
Kaisheng Ma
Zhizheng Zhang
He Wang
Li Yi
LM&Ro
131
4
0
18 Feb 2025
Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models
Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models
Guanqun Cao
Ryan Mckenna
Erich Graf
John Oyekan
LM&Ro
127
0
0
30 Jan 2025
Enhancing Visual Reasoning with Autonomous Imagination in Multimodal
  Large Language Models
Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models
Jiaheng Liu
Yumeng Li
Boyuan Xiao
Yichang Jian
Ziang Qin
Tianjia Shao
Yao-Xiang Ding
Kun Zhou
MLLM
LRM
100
3
0
27 Nov 2024
Learning Few-Shot Object Placement with Intra-Category Transfer
Learning Few-Shot Object Placement with Intra-Category Transfer
Adrian Rofer
Russell Buchanan
Max Argus
S. Vijayakumar
Abhinav Valada
43
0
0
05 Nov 2024
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Eric Zhu
Mara Levy
M. Gwilliam
Abhinav Shrivastava
45
0
0
04 Nov 2024
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot
  Scene Rearrangement
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
Shutong Jin
Ruiyu Wang
Kuangyi Chen
Florian T. Pokorny
29
0
0
29 Oct 2024
Learning Spatial Bimanual Action Models Based on Affordance Regions and
  Human Demonstrations
Learning Spatial Bimanual Action Models Based on Affordance Regions and Human Demonstrations
Björn S. Plonka
Christian R. G. Dreher
Andre Meixner
Rainer Kartmann
Tamim Asfour
30
2
0
11 Oct 2024
Stimulating Imagination: Towards General-purpose Object Rearrangement
Stimulating Imagination: Towards General-purpose Object Rearrangement
Jianyang Wu
Jie Gu
Xiaokang Ma
Chu Tang
Jingmin Chen
DiffM
LM&Ro
OCL
37
0
0
03 Aug 2024
Visual Preference Inference: An Image Sequence-Based Preference
  Reasoning in Tabletop Object Manipulation
Visual Preference Inference: An Image Sequence-Based Preference Reasoning in Tabletop Object Manipulation
Joonhyung Lee
Sangbeom Park
Yongin Kwon
Jemin Lee
Minwook Ahn
Sungjoon Choi
24
0
0
18 Mar 2024
SceneScore: Learning a Cost Function for Object Arrangement
SceneScore: Learning a Cost Function for Object Arrangement
Ivan Kapelyukh
Edward Johns
OffRL
DiffM
OCL
24
4
0
14 Nov 2023
Language Models as Zero-Shot Trajectory Generators
Language Models as Zero-Shot Trajectory Generators
Teyun Kwon
Norman Di Palo
Edward Johns
LM&Ro
25
45
0
17 Oct 2023
ConSOR: A Context-Aware Semantic Object Rearrangement Framework for
  Partially Arranged Scenes
ConSOR: A Context-Aware Semantic Object Rearrangement Framework for Partially Arranged Scenes
Kartik Ramachandruni
Max Zuo
Sonia Chernova
31
6
0
30 Sep 2023
SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on
  Scene Graphs
SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs
Guangyao Zhai
Xiaoni Cai
Dianye Huang
Yan Di
Fabian Manhardt
Federico Tombari
Nassir Navab
Benjamin Busam
LM&Ro
24
27
0
21 Sep 2023
Energy-based Models are Zero-Shot Planners for Compositional Scene
  Rearrangement
Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement
N. Gkanatsios
Ayush Jain
Zhou Xian
Yunchu Zhang
C. Atkeson
Katerina Fragkiadaki
LM&Ro
98
31
0
27 Apr 2023
Open-World Object Manipulation using Pre-trained Vision-Language Models
Open-World Object Manipulation using Pre-trained Vision-Language Models
Austin Stone
Ted Xiao
Yao Lu
K. Gopalakrishnan
Kuang-Huei Lee
...
Sean Kirmani
Brianna Zitkovich
F. Xia
Chelsea Finn
Karol Hausman
LM&Ro
153
145
0
02 Mar 2023
RealFusion: 360° Reconstruction of Any Object from a Single Image
RealFusion: 360° Reconstruction of Any Object from a Single Image
Luke Melas-Kyriazi
Christian Rupprecht
Iro Laina
Andrea Vedaldi
98
291
0
21 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
272
4,244
0
30 Jan 2023
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory
Nur Muhammad (Mahi) Shafiullah
Chris Paxton
Lerrel Pinto
Soumith Chintala
Arthur Szlam
VLM
LM&Ro
CLIP
95
156
0
11 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
110
145
0
05 Oct 2022
SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp
  and motion optimization through diffusion
SE(3)-DiffusionFields: Learning smooth cost functions for joint grasp and motion optimization through diffusion
Julen Urain
Niklas Funk
Jan Peters
Georgia Chalvatzaki
DiffM
55
118
0
08 Sep 2022
Housekeep: Tidying Virtual Households using Commonsense Reasoning
Housekeep: Tidying Virtual Households using Commonsense Reasoning
Yash Kant
Arun Ramachandran
Sriram Yenamandra
Igor Gilitschenski
Dhruv Batra
Andrew Szot
Harsh Agrawal
LM&Ro
LRM
160
73
0
22 May 2022
Learning Multi-Object Dynamics with Compositional Neural Radiance Fields
Learning Multi-Object Dynamics with Compositional Neural Radiance Fields
Danny Driess
Zhiao Huang
Yunzhu Li
Russ Tedrake
Marc Toussaint
OCL
AI4CE
116
85
0
24 Feb 2022
Efficient and Interpretable Robot Manipulation with Graph Neural
  Networks
Efficient and Interpretable Robot Manipulation with Graph Neural Networks
Yixin Lin
Austin S. Wang
Eric Undersander
Akshara Rai
LM&Ro
105
44
0
25 Feb 2021
1