ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.14638
  4. Cited By
Towards Embodied Scene Description

Towards Embodied Scene Description

30 April 2020
Sinan Tan
Huaping Liu
Di Guo
Xinyu Zhang
F. Sun
    LM&Ro
ArXivPDFHTML

Papers citing "Towards Embodied Scene Description"

35 / 35 papers shown
Title
Deep Reinforcement Learning based Automatic Exploration for Navigation
  in Unknown Environment
Deep Reinforcement Learning based Automatic Exploration for Navigation in Unknown Environment
Haoran Li
Qichao Zhang
Dongbin Zhao
21
195
0
23 Jul 2020
Emergence of Exploratory Look-Around Behaviors through Active
  Observation Completion
Emergence of Exploratory Look-Around Behaviors through Active Observation Completion
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
39
40
0
27 Jun 2019
Multi-Target Embodied Question Answering
Multi-Target Embodied Question Answering
Licheng Yu
Xinlei Chen
Georgia Gkioxari
Joey Tianyi Zhou
Tamara L. Berg
Dhruv Batra
47
103
0
09 Apr 2019
Embodied Visual Recognition
Embodied Visual Recognition
Jianwei Yang
Zhile Ren
Mingze Xu
Xinlei Chen
David J. Crandall
Devi Parikh
Dhruv Batra
48
26
0
09 Apr 2019
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Revisiting EmbodiedQA: A Simple Baseline and Beyond
Yuehua Wu
Lu Jiang
Yi Yang
LM&Ro
54
30
0
08 Apr 2019
Embodied Question Answering in Photorealistic Environments with Point
  Cloud Perception
Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Erik Wijmans
Samyak Datta
Oleksandr Maksymets
Abhishek Das
Georgia Gkioxari
Stefan Lee
Irfan Essa
Devi Parikh
Dhruv Batra
3DPC
LM&Ro
57
167
0
06 Apr 2019
A Behavioral Approach to Visual Navigation with Graph Localization
  Networks
A Behavioral Approach to Visual Navigation with Graph Localization Networks
Kevin Chen
Juan Pablo de Vicente
G. Sepulveda
Fei Xia
Á. Soto
Nathan Tsoi
Silvio Savarese
GNN
43
100
0
01 Mar 2019
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching
Fereshteh Sadeghi
41
28
0
18 Feb 2019
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
83
435
0
26 Dec 2018
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning
  for Vision-Language Navigation
Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation
Xin Eric Wang
Qiuyuan Huang
Asli Celikyilmaz
Jianfeng Gao
Dinghan Shen
Yuan-fang Wang
William Yang Wang
Lei Zhang
LM&Ro
SSL
69
534
0
25 Nov 2018
Neural Modular Control for Embodied Question Answering
Neural Modular Control for Embodied Question Answering
Abhishek Das
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
162
130
0
26 Oct 2018
GAPLE: Generalizable Approaching Policy LEarning for Robotic Object
  Searching in Indoor Environment
GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment
Xin Ye
Zhe Lin
Joon-Young Lee
Jianming Zhang
Shibin Zheng
Yezhou Yang
25
20
0
21 Sep 2018
Sidekick Policy Learning for Active Visual Exploration
Sidekick Policy Learning for Active Visual Exploration
Santhosh Kumar Ramakrishnan
Kristen Grauman
89
29
0
29 Jul 2018
OIL: Observational Imitation Learning
OIL: Observational Imitation Learning
Ge Li
Matthias Muller
Vincent Casser
Neil G. Smith
D. L. Michels
Guohao Li
61
41
0
03 Mar 2018
AI2-THOR: An Interactive 3D Environment for Visual AI
AI2-THOR: An Interactive 3D Environment for Visual AI
Eric Kolve
Roozbeh Mottaghi
Winson Han
Eli VanderBilt
Luca Weihs
...
Daniel Gordon
Yuke Zhu
Aniruddha Kembhavi
Abhinav Gupta
Ali Farhadi
LM&Ro
35
1,091
0
14 Dec 2017
IQA: Visual Question Answering in Interactive Environments
IQA: Visual Question Answering in Interactive Environments
Daniel Gordon
Aniruddha Kembhavi
Mohammad Rastegari
Joseph Redmon
Dieter Fox
Ali Farhadi
LM&Ro
61
388
0
09 Dec 2017
Embodied Question Answering
Embodied Question Answering
Abhishek Das
Samyak Datta
Georgia Gkioxari
Stefan Lee
Devi Parikh
Dhruv Batra
LM&Ro
71
644
0
30 Nov 2017
Vision-and-Language Navigation: Interpreting visually-grounded
  navigation instructions in real environments
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments
Peter Anderson
Qi Wu
Damien Teney
Jake Bruce
Mark Johnson
Niko Sünderhauf
Ian Reid
Stephen Gould
Anton Van Den Hengel
LM&Ro
79
1,299
0
20 Nov 2017
Learning to Look Around: Intelligently Exploring Unseen Environments for
  Unknown Tasks
Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks
Dinesh Jayaraman
Kristen Grauman
SSL
43
103
0
01 Sep 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video
  Captioning
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
50
219
0
08 Aug 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
102
4,201
0
25 Jul 2017
Recurrent Topic-Transition GAN for Visual Paragraph Generation
Recurrent Topic-Transition GAN for Visual Paragraph Generation
Xiaodan Liang
Zhiting Hu
Huatian Zhang
Chuang Gan
Eric Xing
GAN
45
200
0
21 Mar 2017
Pyramid Scene Parsing Network
Pyramid Scene Parsing Network
Hengshuang Zhao
Jianping Shi
Xiaojuan Qi
Xiaogang Wang
Jiaya Jia
VOS
SSeg
356
11,941
0
04 Dec 2016
A Hierarchical Approach for Generating Descriptive Image Paragraphs
A Hierarchical Approach for Generating Descriptive Image Paragraphs
J. Krause
Justin Johnson
Ranjay Krishna
Li Fei-Fei
VLM
55
376
0
20 Nov 2016
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning
  Challenge
Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
65
853
0
21 Sep 2016
Target-driven Visual Navigation in Indoor Scenes using Deep
  Reinforcement Learning
Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning
Yuke Zhu
Roozbeh Mottaghi
Eric Kolve
Joseph J. Lim
Abhinav Gupta
Li Fei-Fei
Ali Farhadi
VGen
54
1,516
0
16 Sep 2016
Image Captioning with Semantic Attention
Image Captioning with Semantic Attention
Quanzeng You
Hailin Jin
Zhaowen Wang
Chen Fang
Jiebo Luo
VLM
159
1,657
0
12 Mar 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
109
1,165
0
24 Nov 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
410
61,900
0
04 Jun 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
281
10,034
0
10 Feb 2015
CIDEr: Consensus-based Image Description Evaluation
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
224
4,451
0
20 Nov 2014
Show and Tell: A Neural Image Caption Generator
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
186
6,009
0
17 Nov 2014
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
119
6,046
0
17 Nov 2014
Distributed Representations of Words and Phrases and their
  Compositionality
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
296
33,445
0
16 Oct 2013
1