ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.12098
  4. Cited By
CLIPort: What and Where Pathways for Robotic Manipulation

CLIPort: What and Where Pathways for Robotic Manipulation

24 September 2021
Mohit Shridhar
Lucas Manuelli
D. Fox
    LM&Ro
ArXivPDFHTML

Papers citing "CLIPort: What and Where Pathways for Robotic Manipulation"

50 / 477 papers shown
Title
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in
  Clutter
ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter
Yaoyao Qian
Xu Zhu
Ondrej Biza
Shuo Jiang
Linfeng Zhao
Hao-zhe Huang
Yu Qi
Robert W. Platt
41
15
0
16 Jul 2024
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer
  from Text to Image via CLIP Inversion
Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion
Philipp Allgeuer
Kyra Ahrens
Stefan Wermter
CLIP
VLM
27
3
0
15 Jul 2024
Sensorimotor Attention and Language-based Regressions in Shared Latent
  Variables for Integrating Robot Motion Learning and LLM
Sensorimotor Attention and Language-based Regressions in Shared Latent Variables for Integrating Robot Motion Learning and LLM
Kanata Suzuki
Tetsuya Ogata
37
2
0
12 Jul 2024
Generative Image as Action Models
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
43
9
0
10 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Yang Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
51
50
0
09 Jul 2024
Multimodal Diffusion Transformer: Learning Versatile Behavior from
  Multimodal Goals
Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals
Moritz Reuss
Ömer Erdinç Yagmurlu
Fabian Wenzel
Rudolf Lioutikov
OffRL
37
41
0
08 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
31
0
0
05 Jul 2024
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual
  Manipulation
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation
I-Chun Arthur Liu
Sicheng He
Daniel Seita
Gaurav Sukhatme
LM&Ro
43
11
0
04 Jul 2024
Investigating the Role of Instruction Variety and Task Difficulty in
  Robotic Manipulation Tasks
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Amit Parekh
Nikolas Vitsakis
Alessandro Suglia
Ioannis Konstas
AAML
38
5
0
04 Jul 2024
PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful
  Navigators
PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Kuo-Hao Zeng
Zichen Zhang
Kiana Ehsani
Rose Hendrix
Jordi Salvador
Alvaro Herrasti
Ross Girshick
Aniruddha Kembhavi
Luca Weihs
LM&Ro
OffRL
38
17
0
28 Jun 2024
Lifelong Robot Library Learning: Bootstrapping Composable and
  Generalizable Skills for Embodied Control with Language Models
Lifelong Robot Library Learning: Bootstrapping Composable and Generalizable Skills for Embodied Control with Language Models
Georgios Tziafas
H. Kasaei
KELM
LM&Ro
47
8
0
26 Jun 2024
DKPROMPT: Domain Knowledge Prompting Vision-Language Models for
  Open-World Planning
DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning
Xiaohan Zhang
Zainab Altaweel
Yohei Hayamizu
Yan Ding
S. Amiri
Hao Yang
Andy Kaminski
Chad Esselink
Shiqi Zhang
VLM
LM&Ro
41
6
0
25 Jun 2024
CogExplore: Contextual Exploration with Language-Encoded Environment
  Representations
CogExplore: Contextual Exploration with Language-Encoded Environment Representations
Harel Biggie
Patrick Cooper
Doncey Albin
Kristen Such
Christoffer Heckman
LM&Ro
35
0
0
24 Jun 2024
Open-vocabulary Pick and Place via Patch-level Semantic Maps
Open-vocabulary Pick and Place via Patch-level Semantic Maps
Mingxi Jia
Haojie Huang
Zhewen Zhang
Chenghao Wang
Linfeng Zhao
Dian Wang
J. Liu
Robin Walters
Robert Platt
Stefanie Tellex
LM&Ro
44
5
0
21 Jun 2024
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Jiaming Zhou
Teli Ma
Kun-Yu Lin
Ronghe Qiu
Zifan Wang
Junwei Liang
52
3
0
20 Jun 2024
SpatialBot: Precise Spatial Understanding with Vision Language Models
SpatialBot: Precise Spatial Understanding with Vision Language Models
Wenxiao Cai
Yaroslav Ponomarenko
Jianhao Yuan
Xiaoqi Li
Wankou Yang
Hao Dong
Bo-Lu Zhao
VLM
56
28
0
19 Jun 2024
Contrast Sets for Evaluating Language-Guided Robot Policies
Contrast Sets for Evaluating Language-Guided Robot Policies
Abrar Anwar
Rohan Gupta
Jesse Thomason
32
3
0
19 Jun 2024
ARDuP: Active Region Video Diffusion for Universal Policies
ARDuP: Active Region Video Diffusion for Universal Policies
Shuaiyi Huang
Mara Levy
Zhenyu Jiang
Anima Anandkumar
Yuke Zhu
Linxi Fan
De-An Huang
Abhinav Shrivastava
VGen
50
2
0
19 Jun 2024
Enabling robots to follow abstract instructions and complete complex
  dynamic tasks
Enabling robots to follow abstract instructions and complete complex dynamic tasks
Ruaridh Mon-Williams
Gen Li
Ran Long
Wenqian Du
Chris Lucas
LM&Ro
45
3
0
17 Jun 2024
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Duoduo CLIP: Efficient 3D Understanding with Multi-View Images
Han-Hung Lee
Yiming Zhang
Angel X. Chang
3DPC
45
3
0
17 Jun 2024
Language-Guided Manipulation with Diffusion Policies and Constrained
  Inpainting
Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting
Ce Hao
Kelvin Lin
Siyuan Luo
Harold Soh
36
4
0
14 Jun 2024
Contrastive Imitation Learning for Language-guided Multi-Task Robotic
  Manipulation
Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Teli Ma
Jiaming Zhou
Zifan Wang
Ronghe Qiu
Junwei Liang
48
9
0
14 Jun 2024
Language-driven Grasp Detection
Language-driven Grasp Detection
An Dinh Vuong
Minh Nhat Vu
Baoru Huang
Nghia Nguyen
Hieu Le
T. Vo
Anh Nguyen
VLM
41
19
0
13 Jun 2024
OpenVLA: An Open-Source Vision-Language-Action Model
OpenVLA: An Open-Source Vision-Language-Action Model
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
...
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
51
367
0
13 Jun 2024
Language-Driven Closed-Loop Grasping with Model-Predictive Trajectory
  Replanning
Language-Driven Closed-Loop Grasping with Model-Predictive Trajectory Replanning
Huy Hoang Nguyen
Minh Nhat Vu
F. Beck
Gerald Ebmer
Anh Nguyen
Andreas Kugi
18
0
0
13 Jun 2024
Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Xinyu Zhang
Yuhan Liu
Haonan Chang
Abdeslam Boularias
61
1
0
12 Jun 2024
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor
  Control
Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control
Dongyoon Hwang
ByungKun Lee
Hojoon Lee
Hyunseung Kim
Jaegul Choo
53
0
0
10 Jun 2024
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
Guanxing Lu
Zifeng Gao
Tianxing Chen
Wen-Dao Dai
Ziwei Wang
Yansong Tang
Yansong Tang
DiffM
73
14
0
03 Jun 2024
Learning Manipulation by Predicting Interaction
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
53
20
0
01 Jun 2024
Video-Language Critic: Transferable Reward Functions for
  Language-Conditioned Robotics
Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
Minttu Alakuijala
Reginald McLean
Isaac Woungang
Nariman Farsad
Samuel Kaski
Pekka Marttinen
Kai Yuan
LM&Ro
42
1
0
30 May 2024
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for
  Embodied Manipulation
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Junjie Zhang
Chenjia Bai
Haoran He
Wenke Xia
Zhigang Wang
Bin Zhao
Xiu Li
Xuelong Li
40
12
0
30 May 2024
Grasp as You Say: Language-guided Dexterous Grasp Generation
Grasp as You Say: Language-guided Dexterous Grasp Generation
Yi-Lin Wei
Jian-Jian Jiang
Chengyi Xing
Xiantuo Tan
Xiao-Ming Wu
Hao Li
M. Cutkosky
Wei-Shi Zheng
57
14
0
29 May 2024
Learning to Recover from Plan Execution Errors during Robot
  Manipulation: A Neuro-symbolic Approach
Learning to Recover from Plan Execution Errors during Robot Manipulation: A Neuro-symbolic Approach
Namasivayam Kalithasan
Arnav Tuli
Vishal Bindal
H. Singh
Parag Singla
Rohan Paul
39
0
0
29 May 2024
Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based
  Behaviour Cloning
Render and Diffuse: Aligning Image and Action Spaces for Diffusion-based Behaviour Cloning
Vitalis Vosylius
Younggyo Seo
Jafar Uruç
Stephen James
30
12
0
28 May 2024
Interpretable Robotic Manipulation from Language
Interpretable Robotic Manipulation from Language
Boyuan Zheng
Jianlong Zhou
Fang Chen
LM&Ro
41
0
0
27 May 2024
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung
Pang-Chi Lo
Jia-Fong Yeh
Han-Yuan Hsu
Yi-Ting Chen
Winston H. Hsu
33
0
0
26 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
42
0
23 May 2024
Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance
Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance
Kaifeng Zhang
Zhao-Heng Yin
Weirui Ye
Yang Gao
70
3
0
22 May 2024
One-Shot Imitation Learning with Invariance Matching for Robotic
  Manipulation
One-Shot Imitation Learning with Invariance Matching for Robotic Manipulation
Xinyu Zhang
Abdeslam Boularias
42
9
0
21 May 2024
Meta-Control: Automatic Model-based Control Synthesis for Heterogeneous
  Robot Skills
Meta-Control: Automatic Model-based Control Synthesis for Heterogeneous Robot Skills
Tianhao Wei
Liqian Ma
Rui Chen
Weiye Zhao
Changliu Liu
45
3
0
18 May 2024
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
TRANSIC: Sim-to-Real Policy Transfer by Learning from Online Correction
Yunfan Jiang
Chen Wang
Ruohan Zhang
Jiajun Wu
Fei-Fei Li
OnRL
37
26
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
33
12
0
16 May 2024
Natural Language Can Help Bridge the Sim2Real Gap
Natural Language Can Help Bridge the Sim2Real Gap
Albert Yu
Adeline Foote
Raymond J. Mooney
Roberto Martín-Martín
LM&Ro
51
11
0
16 May 2024
OpenBot-Fleet: A System for Collective Learning with Real Robots
OpenBot-Fleet: A System for Collective Learning with Real Robots
Matthias M¨uller
Samarth Brahmbhatt
Ankur Deka
Quentin Leboutet
David Hafner
V. Koltun
34
0
0
13 May 2024
Bi-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic
  Dexterous Manipulations
Bi-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Dexterous Manipulations
Koffivi Fidele Gbagbe
Miguel Altamirano Cabrera
Ali Alabbas
Oussama Alyunes
Artem Lykov
Dzmitry Tsetserukou
LM&Ro
40
18
0
09 May 2024
Composable Part-Based Manipulation
Composable Part-Based Manipulation
Weiyu Liu
Jiayuan Mao
Joy Hsu
Tucker Hermans
Animesh Garg
Jiajun Wu
43
12
0
09 May 2024
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot
  Control
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
Yide Shentu
Philipp Wu
Aravind Rajeswaran
Pieter Abbeel
37
9
0
08 May 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A
  Survey
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Hongze Yu
Jun Shi
Xiaoshuai Hao
Peng Hao
Huaping Liu
Gang Hua
Bin Fang
AI4CE
LM&Ro
72
13
0
28 Apr 2024
PhyRecon: Physically Plausible Neural Scene Reconstruction
PhyRecon: Physically Plausible Neural Scene Reconstruction
Junfeng Ni
Yixin Chen
Bohan Jing
Nan Jiang
Bin Wang
Bo Dai
Puhao Li
Yixin Zhu
Song-Chun Zhu
Siyuan Huang
49
13
0
25 Apr 2024
FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities
  in Semantic Dataset Deduplication
FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Eric Slyman
Stefan Lee
Scott D. Cohen
Kushal Kafle
VLM
41
5
0
24 Apr 2024
Previous
123456...8910
Next