Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.02521
Cited By
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos
7 March 2017
De-An Huang
Joseph J. Lim
Li Fei-Fei
Juan Carlos Niebles
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos"
10 / 10 papers shown
Title
Joint Discovery of Object States and Manipulation Actions
Jean-Baptiste Alayrac
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
64
79
0
09 Feb 2017
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal
Arjun Chandrasekaran
Dhruv Batra
Devi Parikh
Joey Tianyi Zhou
47
60
0
23 Jun 2016
Simpler Context-Dependent Logical Forms via Model Projections
R. Long
Panupong Pasupat
Percy Liang
248
102
0
16 Jun 2016
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
Justin Johnson
A. Karpathy
Li Fei-Fei
VLM
124
1,167
0
24 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction
Anna Rohrbach
Marcus Rohrbach
Ronghang Hu
Trevor Darrell
Bernt Schiele
73
496
0
12 Nov 2015
Alignment-based compositional semantics for instruction following
Jacob Andreas
Dan Klein
55
102
0
26 Aug 2015
Unsupervised Learning from Narrated Instruction Videos
Jean-Baptiste Alayrac
Piotr Bojanowski
Nishant Agrawal
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
SSL
80
289
0
30 Jun 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
187
2,047
0
19 May 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
105
5,583
0
07 Dec 2014
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
227
6,018
0
17 Nov 2014
1