Unsupervised Visual-Linguistic Reference Resolution in Instructional
Videos

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

7 March 2017

De-An Huang

Li Fei-Fei

Juan Carlos Niebles

Papers citing "Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos"

10 / 10 papers shown

Title
Joint Discovery of Object States and Manipulation Actions Jean-Baptiste Alayrac Josef Sivic Ivan Laptev Simon Lacoste-Julien 64 79 0 09 Feb 2017
Sort Story: Sorting Jumbled Images and Captions into Stories Harsh Agrawal Arjun Chandrasekaran Dhruv Batra Devi Parikh Joey Tianyi Zhou 47 60 0 23 Jun 2016
Simpler Context-Dependent Logical Forms via Model Projections R. Long Panupong Pasupat Percy Liang 248 102 0 16 Jun 2016
DenseCap: Fully Convolutional Localization Networks for Dense Captioning Justin Johnson A. Karpathy Li Fei-Fei VLM 124 1,167 0 24 Nov 2015
Grounding of Textual Phrases in Images by Reconstruction Anna Rohrbach Marcus Rohrbach Ronghang Hu Trevor Darrell Bernt Schiele 73 496 0 12 Nov 2015
Alignment-based compositional semantics for instruction following Jacob Andreas Dan Klein 55 102 0 26 Aug 2015
Unsupervised Learning from Narrated Instruction Videos Jean-Baptiste Alayrac Piotr Bojanowski Nishant Agrawal Josef Sivic Ivan Laptev Simon Lacoste-Julien SSL 80 289 0 30 Jun 2015
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models Bryan A. Plummer Liwei Wang Christopher M. Cervantes Juan C. Caicedo Julia Hockenmaier Svetlana Lazebnik 187 2,047 0 19 May 2015
Deep Visual-Semantic Alignments for Generating Image Descriptions A. Karpathy Li Fei-Fei 105 5,583 0 07 Dec 2014
Show and Tell: A Neural Image Caption Generator Oriol Vinyals Alexander Toshev Samy Bengio D. Erhan 3DV 227 6,018 0 17 Nov 2014