A Dataset for Movie Description

12 January 2015

Bernt Schiele

Papers citing "A Dataset for Movie Description"

50 / 257 papers shown

Title
ODSQA: Open-domain Spoken Question Answering Dataset Chia-Hsuan Lee Shang-Ming Wang Huan-Cheng Chang Hung-yi Lee RALM 30 52 0 07 Aug 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics Nayyer Aafaq Ajmal Mian Wen Liu Syed Zulqarnain Gilani Mubarak Shah 14 91 0 01 Jun 2018
Stories for Images-in-Sequence by using Visual and Narrative Components Marko Smilevski Ilija Lalkovski Gjorgji Madjarov 19 19 0 15 May 2018
Reward Learning from Narrated Demonstrations H. Tung Adam W. Harley Liang-Kang Huang Katerina Fragkiadaki LM&Ro SSL 39 28 0 27 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset Dima Damen Hazel Doughty G. Farinella Sanja Fidler Antonino Furnari ... Davide Moltisanti Jonathan Munro Toby Perrett Will Price Michael Wray EgoV 25 1,000 0 08 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data Antoine Miech Ivan Laptev Josef Sivic 22 233 0 07 Apr 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH) Pelin Dogan Boyang Albert Li Leonid Sigal Markus Gross AI4TS 30 19 0 19 Feb 2018
Visual Text Correction Amir Mazaheri M. Shah 52 11 0 06 Jan 2018
MovieGraphs: Towards Understanding Human-Centric Situations from Videos Paul Vicol Makarand Tapaswi Lluis Castrejon Sanja Fidler 42 136 0 19 Dec 2017
Multimodal Visual Concept Learning with Weakly Supervised Techniques Giorgos Bouritsas Petros Koutras Athanasia Zlatintsi Petros Maragos 14 7 0 03 Dec 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption Wangli Hao Zhaoxiang Zhang He Guan Guibo Zhu 42 36 0 22 Nov 2017
Localizing Moments in Video with Natural Language Lisa Anne Hendricks Oliver Wang Eli Shechtman Josef Sivic Trevor Darrell Bryan C. Russell 55 927 0 04 Aug 2017
Zero-Shot Activity Recognition with Verb Attribute Induction Rowan Zellers Yejin Choi 11 51 0 29 Jul 2017
Learning from Video and Text via Large-Scale Discriminative Clustering Antoine Miech Jean-Baptiste Alayrac Piotr Bojanowski Ivan Laptev Josef Sivic 29 44 0 27 Jul 2017
Video Highlight Prediction Using Audience Chat Reactions Cheng-Yang Fu Joon Lee Joey Tianyi Zhou Alexander C. Berg 19 34 0 26 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey Hirokatsu Kataoka Soma Shirakabe Yun He S. Ueta Teppei Suzuki ... Ryousuke Takasawa Masataka Fuchida Yudai Miyashita Kazushige Okayasu Yuta Matsuzaki 30 1 0 20 Jul 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze Data Youngjae Yu Jongwook Choi Yeonhwa Kim Kyung Yoo Sang-Hun Lee Gunhee Kim 25 69 0 19 Jul 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks Kyung-Min Kim Min-Oh Heo Seongho Choi Byoung-Tak Zhang 26 174 0 04 Jul 2017
The "something something" video database for learning and evaluating visual common sense Raghav Goyal Samira Ebrahimi Kahou Vincent Michalski Joanna Materzynska S. Westphal ... Moritz Mueller-Freitag F. Hoppe Christian Thurau Ingo Bax Roland Memisevic VLM 39 1,496 0 13 Jun 2017
Dense-Captioning Events in Videos Ranjay Krishna Kenji Hata F. Ren Li Fei-Fei Juan Carlos Niebles 70 1,218 0 02 May 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions Amir Mazaheri Dong-Ming Zhang M. Shah 17 12 0 15 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People Anna Rohrbach Marcus Rohrbach Siyu Tang Seong Joon Oh Bernt Schiele 330 72 0 05 Apr 2017
Temporal Tessellation: A Unified Approach for Video Analysis Dotan Kaufman Gil Levi Tal Hassner Lior Wolf 19 16 0 21 Dec 2016
MarioQA: Answering Questions by Watching Gameplay Videos Jonghwan Mun Paul Hongsuck Seo Ilchae Jung Bohyung Han 50 108 0 06 Dec 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi C. Grana Rita Cucchiara 28 191 0 28 Nov 2016
Visual Dialog Abhishek Das Satwik Kottur Khushi Gupta Avi Singh Deshraj Yadav José M. F. Moura Devi Parikh Dhruv Batra 69 990 0 26 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering Tegan Maharaj Nicolas Ballas Anna Rohrbach Aaron Courville C. Pal VGen 15 107 0 23 Nov 2016
Video Captioning with Transferred Semantic Attributes Yingwei Pan Ting Yao Houqiang Li Tao Mei 27 329 0 23 Nov 2016
Deep Learning for Video Classification and Captioning Zuxuan Wu Ting Yao Yanwei Fu Yu-Gang Jiang 3DV VLM 30 123 0 22 Sep 2016
Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data J. Gao Ram Nevatia 16 1 0 08 Sep 2016
Title Generation for User Generated Videos Kuo-Hao Zeng Tseng-Hung Chen Juan Carlos Niebles Min Sun 35 69 0 25 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation Rakshith Shetty Jorma T. Laaksonen 21 94 0 17 Aug 2016
The KIT Motion-Language Dataset Matthias Plappert Christian Mandery Tamim Asfour 193 273 0 13 Jul 2016
VideoMCC: a New Benchmark for Video Comprehension Du Tran Maksim Bolonkin Manohar Paluri Lorenzo Torresani 29 1 0 23 Jun 2016
Bidirectional Long-Short Term Memory for Video Description Yi Bin Yang Yang Zi Huang Fumin Shen Xing Xu Heng Tao Shen 39 60 0 15 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey Hirokatsu Kataoka Yudai Miyashita Tomoaki K. Yamabe Soma Shirakabe Shin-ichi Sato ... Kaori Abe Takaaki Imanari Naomichi Kobayashi Shinichiro Morita Akio Nakamura 24 2 0 26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences Andrew Shin Katsunori Ohnishi Tatsuya Harada 17 31 0 18 May 2016
Movie Description Anna Rohrbach Atousa Torabi Marcus Rohrbach Niket Tandon C. Pal Hugo Larochelle Aaron Courville Bernt Schiele 3DV VGen 32 353 0 12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering Mateusz Malinowski Marcus Rohrbach Mario Fritz 24 101 0 09 May 2016
Attributes as Semantic Units between Natural Language and Visual Recognition Marcus Rohrbach VLM 22 3 0 12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description Yuncheng Li Yale Song Liangliang Cao Joel R. Tetreault Larry Goldberg A. Jaimes Jiebo Luo 25 270 0 10 Apr 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding Gunnar A. Sigurdsson Gül Varol Xinyu Wang Ali Farhadi Ivan Laptev Abhinav Gupta VGen 43 1,224 0 06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text Subhashini Venugopalan Lisa Anne Hendricks Raymond J. Mooney Kate Saenko VLM 28 117 0 06 Apr 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures Raffaella Bernardi Ruken Cakici Desmond Elliott Aykut Erdem Erkut Erdem Nazli Ikizler-Cinbis Frank Keller A. Muscat Barbara Plank EGVM VLM 27 363 0 15 Jan 2016
Video captioning with recurrent networks based on frame- and video-level features and visual content classification Rakshith Shetty Jorma T. Laaksonen 13 31 0 09 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering Makarand Tapaswi Yukun Zhu Rainer Stiefelhagen Antonio Torralba R. Urtasun Sanja Fidler 43 736 0 09 Dec 2015
Uncovering Temporal Context for Video Question and Answering Linchao Zhu Zhongwen Xu Yi Yang Alexander G. Hauptmann BDL 27 44 0 15 Nov 2015
Oracle performance for visual captioning L. Yao Nicolas Ballas Kyunghyun Cho John R. Smith Yoshua Bengio VLM 39 8 0 14 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning Pingbo Pan Zhongwen Xu Yi Yang Fei Wu Yueting Zhuang 24 385 0 11 Nov 2015
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books Yukun Zhu Ryan Kiros R. Zemel Ruslan Salakhutdinov R. Urtasun Antonio Torralba Sanja Fidler 60 2,517 0 22 Jun 2015