Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.03464
Cited By
v1
v2 (latest)
Generative Artificial Intelligence in Robotic Manipulation: A Survey
5 March 2025
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
Hangjie Yuan
Chao Zhao
Tao Feng
M. Y. Wang
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generative Artificial Intelligence in Robotic Manipulation: A Survey"
50 / 214 papers shown
Title
Mirage: Cross-Embodiment Zero-Shot Policy Transfer with Cross-Painting
Lawrence Yunliang Chen
Kush Hari
K. Dharmarajan
Chenfeng Xu
Quan Vuong
Ken Goldberg
110
22
0
29 Feb 2024
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Xiaoyu Zhang
Matthew Chang
Pranav Kumar
Saurabh Gupta
DiffM
OffRL
93
15
0
27 Feb 2024
PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large Multimodal Models
Dingkun Guo
Yuqi Xiang
Shuqi Zhao
Xinghao Zhu
Masayoshi Tomizuka
Mingyu Ding
Wei Zhan
75
11
0
26 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
89
24
0
25 Feb 2024
CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation
Jun Wang
Yuzhe Qin
Kaiming Kuang
Yigit Korkmaz
Akhilan Gurumoorthy
Hao Su
Xiaolong Wang
84
20
0
22 Feb 2024
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation
Junting Chen
Yao Mu
Qiaojun Yu
Tianming Wei
Silang Wu
...
Wenqi Shao
Yu Qiao
Huazhe Xu
Mingyu Ding
Ping Luo
LM&Ro
64
11
0
22 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
77
125
0
16 Feb 2024
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun Cen
Chenfei Wu
Xiao Liu
Sheng-Siang Yin
Yixuan Pei
Jinglong Yang
Qifeng Chen
Nan Duan
Jianguo Zhang
107
3
0
16 Feb 2024
Policy Improvement using Language Feedback Models
Victor Zhong
Dipendra Kumar Misra
Xingdi Yuan
Marc-Alexandre Côté
43
11
0
12 Feb 2024
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models
Siddharth Karamcheti
Suraj Nair
Ashwin Balakrishna
Percy Liang
Thomas Kollar
Dorsa Sadigh
MLLM
VLM
106
131
0
12 Feb 2024
DexDiffuser: Generating Dexterous Grasps with Diffusion Models
Zehang Weng
Haofei Lu
Danica Kragic
Jens Lundell
99
33
0
05 Feb 2024
Large Language Models for Robotics: Opportunities, Challenges, and Perspectives
Jiaqi Wang
Zihao Wu
Yiwei Li
Hanqi Jiang
Peng Shu
...
Lin Zhao
Bao Ge
Xiang Li
Tianming Liu
Shu Zhang
LM&Ro
84
75
0
09 Jan 2024
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Zipeng Fu
Tony Zhao
Chelsea Finn
195
322
0
04 Jan 2024
Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation
Hongtao Wu
Ya Jing
Chi-Hou Cheang
Guangzeng Chen
Jiafeng Xu
Xinghang Li
Minghuan Liu
Hang Li
Tao Kong
111
108
0
20 Dec 2023
Holodeck: Language Guided Generation of 3D Embodied AI Environments
Yue Yang
Fan-Yun Sun
Luca Weihs
Eli VanderBilt
Alvaro Herrasti
...
Lingjie Liu
Chris Callison-Burch
Mark Yatskar
Aniruddha Kembhavi
Christopher Clark
LM&Ro
86
90
0
14 Dec 2023
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Yafei Hu
Quanting Xie
Vidhi Jain
Jonathan M Francis
Jay Patrikar
...
Xiaolong Wang
Sebastian A. Scherer
Z. Kira
Fei Xia
Yonatan Bisk
LM&Ro
AI4CE
80
72
0
14 Dec 2023
RobotGPT: Robot Manipulation Learning from ChatGPT
Yixiang Jin
Dingzhe Li
Yong A
Jun Shi
Peng Hao
Gang Hua
Jianwei Zhang
Bin Fang
87
48
0
03 Dec 2023
See and Think: Embodied Agent in Virtual Environment
Zhonghan Zhao
Wenhao Chai
Xuan Wang
Li Boyi
Shengyu Hao
Shidong Cao
Tianbo Ye
Gaoang Wang
LM&Ro
LLMAG
96
38
0
26 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
263
1,170
0
25 Nov 2023
The voraus-AD Dataset for Anomaly Detection in Robot Applications
Jan Thiess Brockmann
Marco Rudolph
Bodo Rosenhahn
Bastian Wandt
60
11
0
08 Nov 2023
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation
Yufei Wang
Zhou Xian
Feng Chen
Tsun-Hsuan Wang
Yian Wang
Katerina Fragkiadaki
Zackory M. Erickson
David Held
Chuang Gan
LM&Ro
94
109
0
02 Nov 2023
Gen2Sim: Scaling up Robot Learning in Simulation with Generative Models
Pushkal Katara
Zhou Xian
Katerina Fragkiadaki
LM&Ro
109
44
0
27 Oct 2023
Large Language Models as Generalizable Policies for Embodied Tasks
Andrew Szot
Max Schwarzer
Harsh Agrawal
Bogdan Mazoure
Walter A. Talbott
Katherine Metcalf
Natalie Mackraz
Devon Hjelm
Alexander Toshev
LM&Ro
76
66
0
26 Oct 2023
MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations
Ajay Mandlekar
Soroush Nasiriany
Bowen Wen
Iretiayo Akinola
Yashraj S. Narang
Linxi Fan
Yuke Zhu
Dieter Fox
LM&Ro
134
117
0
26 Oct 2023
Eureka: Human-Level Reward Design via Coding Large Language Models
Yecheng Jason Ma
William Liang
Guanzhi Wang
De-An Huang
Osbert Bastani
Dinesh Jayaraman
Yuke Zhu
Linxi Fan
A. Anandkumar
70
321
0
19 Oct 2023
Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models
Kevin Black
Mitsuhiko Nakamoto
P. Atreya
Homer Walke
Chelsea Finn
Aviral Kumar
Sergey Levine
DiffM
LM&Ro
105
141
0
16 Oct 2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Open X-Embodiment Collaboration
Abby OÑeill
Abdul Rehman
Abhinav Gupta
Abhiram Maddukuri
...
Zhuo Xu
Zichen Jeff Cui
Zichen Zhang
Zipeng Fu
Zipeng Lin
LM&Ro
166
521
0
13 Oct 2023
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
Jingkang Yang
Yuhao Dong
Shuai Liu
Yue Liu
Ziyue Wang
...
Haoran Tan
Jiamu Kang
Yuanhan Zhang
Kaiyang Zhou
Ziwei Liu
LM&Ro
65
48
0
12 Oct 2023
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
71
212
0
09 Oct 2023
Domain Randomization for Sim2real Transfer of Automatically Generated Grasping Datasets
J. Huber
François Hélénon
Hippolyte Watrelot
F. B. Amar
Stéphane Doncieux
62
12
0
06 Oct 2023
GenSim: Generating Robotic Simulation Tasks via Large Language Models
Lirui Wang
Yiyang Ling
Zhecheng Yuan
Mohit Shridhar
Chen Bao
Yuzhe Qin
Bailin Wang
Huazhe Xu
Xiaolong Wang
LM&Ro
97
84
0
02 Oct 2023
Masked Autoencoders are Scalable Learners of Cellular Morphology
Oren Z. Kraus
Kian Kenyon-Dean
Saber Saberian
Maryam Fallah
Peter McLean
...
Chi Vicky Cheng
Kristen Morse
Maureen Makes
Ben Mabey
Berton Earnshaw
69
15
0
27 Sep 2023
PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
Shizhe Chen
Ricardo Garcia Pinel
Cordelia Schmid
Ivan Laptev
LM&Ro
3DPC
78
38
0
27 Sep 2023
When Prolog meets generative models: a new approach for managing knowledge and planning in robotic applications
Enrico Saccon
Ahmet Tikna
Davide De Martini
Edoardo Lamon
Marco Roveri
Luigi Palopoli
LM&Ro
81
2
0
26 Sep 2023
Compositional Foundation Models for Hierarchical Planning
Anurag Ajay
Seung-Jun Han
Yilun Du
Shaung Li
Abhi Gupta
Tommi Jaakkola
Josh Tenenbaum
L. Kaelbling
Akash Srivastava
Pulkit Agrawal
LRM
76
71
0
15 Sep 2023
Gesture-Informed Robot Assistance via Foundation Models
Li-Heng Lin
Yuchen Cui
Yilun Hao
Fei Xia
Dorsa Sadigh
LM&Ro
SLR
45
21
0
06 Sep 2023
Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation
Hyunwoo Ryu
Jiwoo Kim
Hyun Seok Ahn
Junwoo Chang
Joohwan Seo
Taehan Kim
Yubin Kim
Chaewon Hwang
Jongeun Choi
R. Horowitz
DiffM
74
38
0
06 Sep 2023
RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
Homanga Bharadhwaj
Jay Vakil
Mohit Sharma
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
LM&Ro
97
132
0
05 Sep 2023
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
Yanjie Ze
Ge Yan
Yueh-hua Wu
Annabella Macaluso
Yuying Ge
Jianglong Ye
Nicklas Hansen
Li Erran Li
Xinyu Wang
DiffM
AI4CE
74
83
0
31 Aug 2023
ModelScope Text-to-Video Technical Report
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
VGen
DiffM
102
427
0
12 Aug 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
160
1,276
0
28 Jul 2023
Robust Visual Sim-to-Real Transfer for Robotic Manipulation
Ricardo Garcia Pinel
Robin Strudel
Shizhe Chen
Etienne Arlaud
Ivan Laptev
Cordelia Schmid
OffRL
46
5
0
28 Jul 2023
Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
Huy Ha
Peter R. Florence
Shuran Song
LM&Ro
83
155
0
26 Jul 2023
SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
Krishan Rana
Jesse Haviland
Sourav Garg
Jad Abou-Chakra
Ian Reid
Niko Sünderhauf
LM&Ro
75
236
0
12 Jul 2023
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Wenlong Huang
Chen Wang
Ruohan Zhang
Yunzhu Li
Jiajun Wu
Li Fei-Fei
LM&Ro
109
510
0
12 Jul 2023
Building Cooperative Embodied Agents Modularly with Large Language Models
Hongxin Zhang
Weihua Du
Jiaming Shan
Qinhong Zhou
Yilun Du
J. Tenenbaum
Tianmin Shu
Chuang Gan
LLMAG
LM&Ro
121
169
0
05 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
73
33
0
04 Jul 2023
Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation
Théophile Gervet
Zhou Xian
N. Gkanatsios
Katerina Fragkiadaki
94
74
0
30 Jun 2023
REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction
Zeyi Liu
Arpit Bahety
Shuran Song
LRM
101
125
0
27 Jun 2023
RVT: Robotic View Transformer for 3D Object Manipulation
Ankit Goyal
Jie Xu
Yijie Guo
Valts Blukis
Yu-Wei Chao
Dieter Fox
LM&Ro
104
139
0
26 Jun 2023
Previous
1
2
3
4
5
Next