ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.06886
  4. Cited By
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

9 July 2024
Yang Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
    LM&Ro
    SyDa
    AI4CE
ArXivPDFHTML

Papers citing "Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI"

44 / 94 papers shown
Title
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments
  for Embodied AI
Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
Santhosh Kumar Ramakrishnan
Aaron Gokaslan
Erik Wijmans
Oleksandr Maksymets
Alexander Clegg
...
Andrew Westbury
Angel X. Chang
Manolis Savva
Yili Zhao
Dhruv Batra
43
375
0
16 Sep 2021
ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and
  Tactile Representations
ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations
Ruohan Gao
Yen-Yu Chang
Shivani Mall
Li Fei-Fei
Jiajun Wu
65
80
0
16 Sep 2021
Taxim: An Example-based Simulation Model for GelSight Tactile Sensors
Taxim: An Example-based Simulation Model for GelSight Tactile Sensors
Zilin Si
Wenzhen Yuan
33
96
0
09 Sep 2021
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday
  Household Tasks
iGibson 2.0: Object-Centric Simulation for Robot Learning of Everyday Household Tasks
Chengshu Li
Fei Xia
Roberto Martín-Martín
Michael Lingelbach
S. Srivastava
...
Karen Liu
H. Gweon
Jiajun Wu
Li Fei-Fei
Silvio Savarese
LM&Ro
180
227
0
06 Aug 2021
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D
  Visual Grounding
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He
Yusheng Zhao
Junyu Luo
Tianrui Hui
Shaofei Huang
Aixi Zhang
Si Liu
ViT
38
94
0
05 Aug 2021
Neighbor-view Enhanced Model for Vision and Language Navigation
Neighbor-view Enhanced Model for Vision and Language Navigation
Dongyan An
Yuankai Qi
Yan Huang
Qi Wu
Liang Wang
Tieniu Tan
LM&Ro
46
68
0
15 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
39
95
0
07 Jul 2021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Zhengyuan Yang
Songyang Zhang
Liwei Wang
Jiebo Luo
3DPC
62
122
0
24 May 2021
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding
Aishwarya Kamath
Mannat Singh
Yann LeCun
Gabriel Synnaeve
Ishan Misra
Nicolas Carion
ObjD
VLM
152
876
0
26 Apr 2021
The MIT Humanoid Robot: Design, Motion Planning, and Control For
  Acrobatic Behaviors
The MIT Humanoid Robot: Design, Motion Planning, and Control For Acrobatic Behaviors
Matthew Chignoli
Donghyun Kim
Elijah Stanger-Jones
Sangbae Kim
41
136
0
19 Apr 2021
A Survey of Embodied AI: From Simulators to Research Tasks
A Survey of Embodied AI: From Simulators to Research Tasks
Jiafei Duan
Samson Yu
Tangyao Li
Huaiyu Zhu
Cheston Tan
LM&Ro
48
280
0
08 Mar 2021
Tactile Object Pose Estimation from the First Touch with Geometric
  Contact Rendering
Tactile Object Pose Estimation from the First Touch with Geometric Contact Rendering
Maria Bauzá
Eric Valls
Bryan Lim
Theo Sechopoulos
Alberto Rodriguez
95
76
0
09 Dec 2020
Language-guided Navigation via Cross-Modal Grounding and Alternate
  Adversarial Learning
Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning
Weixia Zhang
Chao Ma
Qi Wu
Xiaokang Yang
63
44
0
22 Nov 2020
Point Transformer
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
147
1,972
0
02 Nov 2020
ALFWorld: Aligning Text and Embodied Environments for Interactive
  Learning
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Mohit Shridhar
Xingdi Yuan
Marc-Alexandre Côté
Yonatan Bisk
Adam Trischler
Matthew J. Hausknecht
LM&Ro
LLMAG
60
423
0
08 Oct 2020
GP-SLAM+: real-time 3D lidar SLAM based on improved regionalized
  Gaussian process map reconstruction
GP-SLAM+: real-time 3D lidar SLAM based on improved regionalized Gaussian process map reconstruction
Jianyuan Ruan
Bo Li
Yingqiang Wang
Zhou Fang
GP
22
12
0
03 Aug 2020
3D Shape Reconstruction from Vision and Touch
3D Shape Reconstruction from Vision and Touch
Edward James Smith
Roberto Calandra
Adriana Romero
Georgia Gkioxari
David Meger
Jitendra Malik
M. Drozdzal
38
71
0
07 Jul 2020
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
Li Jiang
Hengshuang Zhao
Shaoshuai Shi
Shu Liu
Chi-Wing Fu
Jiaya Jia
3DPC
71
433
0
03 Apr 2020
SAPIEN: A SimulAted Part-based Interactive ENvironment
SAPIEN: A SimulAted Part-based Interactive ENvironment
Fanbo Xiang
Yuzhe Qin
Kaichun Mo
Yikuan Xia
Hao Zhu
...
He Wang
Li Yi
Angel X. Chang
Leonidas Guibas
Hao Su
232
496
0
19 Mar 2020
Sim2Real Transfer for Reinforcement Learning without Dynamics
  Randomization
Sim2Real Transfer for Reinforcement Learning without Dynamics Randomization
M. Kaspar
J. D. M. Osorio
J. Bock
43
98
0
19 Feb 2020
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday
  Tasks
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Mohit Shridhar
Jesse Thomason
Daniel Gordon
Yonatan Bisk
Winson Han
Roozbeh Mottaghi
Luke Zettlemoyer
Dieter Fox
LM&Ro
79
760
0
03 Dec 2019
VideoNavQA: Bridging the Gap between Visual and Embodied Question
  Answering
VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering
Cătălina Cangea
Eugene Belilovsky
Pietro Lio
Aaron Courville
79
17
0
14 Aug 2019
Making Sense of Vision and Touch: Learning Multimodal Representations
  for Contact-Rich Tasks
Making Sense of Vision and Touch: Learning Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
Peter Zachares
Matthew Tan
K. Srinivasan
Silvio Savarese
Fei-Fei Li
Animesh Garg
Jeannette Bohg
SSL
41
210
0
28 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
408
24,160
0
26 Jul 2019
VoteNet: A Deep Learning Label Fusion Method for Multi-Atlas
  Segmentation
VoteNet: A Deep Learning Label Fusion Method for Multi-Atlas Segmentation
Zhipeng Ding
Xu Han
Marc Niethammer
46
93
0
18 Apr 2019
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
Chris Choy
JunYoung Gwak
Silvio Savarese
3DPC
119
1,768
0
18 Apr 2019
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Yuehua Wu
Lu Jiang
Yi Yang
LM&Ro
54
30
0
08 Apr 2019
From pixels to percepts: Highly robust edge perception and contour
  following using deep learning and an optical biomimetic tactile sensor
From pixels to percepts: Highly robust edge perception and contour following using deep learning and an optical biomimetic tactile sensor
Nathan Lepora
Alex Church
Conrad de Kerckhove
R. Hadsell
John Lloyd
51
101
0
07 Dec 2018
Robust Learning of Tactile Force Estimation through Robot Interaction
Robust Learning of Tactile Force Estimation through Robot Interaction
Balakumar Sundaralingam
Alexander Lambert
Ankur Handa
Byron Boots
Tucker Hermans
Stan Birchfield
Nathan D. Ratliff
Dieter Fox
OOD
39
59
0
15 Oct 2018
DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments
DS-SLAM: A Semantic Visual SLAM towards Dynamic Environments
Chao Yu
Zuxin Liu
Xinjun Liu
F. Xie
Yi Yang
Qi Wei
F. Qiao
69
754
0
22 Sep 2018
Learning Dexterous In-Hand Manipulation
Learning Dexterous In-Hand Manipulation
OpenAI OpenAI
Marcin Andrychowicz
Bowen Baker
Maciek Chociej
Rafal Jozefowicz
...
Szymon Sidor
Joshua Tobin
Peter Welinder
Lilian Weng
Wojciech Zaremba
81
1,865
0
01 Aug 2018
DynaSLAM: Tracking, Mapping and Inpainting in Dynamic Scenes
DynaSLAM: Tracking, Mapping and Inpainting in Dynamic Scenes
Berta Bescós
José M. Fácil
Javier Civera
José Neira
82
853
0
14 Jun 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
106
1,062
0
27 Mar 2018
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement
  Learning for Planned-Ahead Vision-and-Language Navigation
Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation
Xin Eric Wang
Wenhan Xiong
Hongmin Wang
William Yang Wang
56
200
0
21 Mar 2018
AI2-THOR: An Interactive 3D Environment for Visual AI
AI2-THOR: An Interactive 3D Environment for Visual AI
Eric Kolve
Roozbeh Mottaghi
Winson Han
Eli VanderBilt
Luca Weihs
...
Daniel Gordon
Yuke Zhu
Aniruddha Kembhavi
Abhinav Gupta
Ali Farhadi
LM&Ro
35
1,091
0
14 Dec 2017
MINOS: Multimodal Indoor Simulator for Navigation in Complex
  Environments
MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments
Manolis Savva
Angel X. Chang
Alexey Dosovitskiy
Thomas Funkhouser
V. Koltun
59
247
0
11 Dec 2017
The Feeling of Success: Does Touch Sensing Help Predict Grasp Outcomes?
The Feeling of Success: Does Touch Sensing Help Predict Grasp Outcomes?
Roberto Calandra
Andrew Owens
M. Upadhyaya
Wenzhen Yuan
Justin Lin
Edward H. Adelson
Sergey Levine
86
195
0
16 Oct 2017
Matterport3D: Learning from RGB-D Data in Indoor Environments
Matterport3D: Learning from RGB-D Data in Indoor Environments
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
114
1,880
0
18 Sep 2017
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous
  Vehicles
AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles
S. Shah
Debadeepta Dey
Chris Lovett
Ashish Kapoor
83
1,976
0
15 May 2017
Domain Randomization for Transferring Deep Neural Networks from
  Simulation to the Real World
Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Joshua Tobin
Rachel Fong
Alex Ray
Jonas Schneider
Wojciech Zaremba
Pieter Abbeel
171
2,948
0
20 Mar 2017
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
212
4,001
0
14 Feb 2017
Preparing for the Unknown: Learning a Universal Policy with Online
  System Identification
Preparing for the Unknown: Learning a Universal Policy with Online System Identification
Wenhao Yu
Jie Tan
Chenxi Liu
Greg Turk
OffRL
57
305
0
08 Feb 2017
Semantic Scene Completion from a Single Depth Image
Semantic Scene Completion from a Single Depth Image
Shuran Song
Feng Yu
Andy Zeng
Angel X. Chang
Manolis Savva
Thomas Funkhouser
3DV
62
1,235
0
28 Nov 2016
Multi-View 3D Object Detection Network for Autonomous Driving
Multi-View 3D Object Detection Network for Autonomous Driving
Xiaozhi Chen
Huimin Ma
Ji Wan
Bo Li
Tian Xia
3DPC
149
2,762
0
23 Nov 2016
Previous
12