Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1501.02530
Cited By
A Dataset for Movie Description
12 January 2015
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Dataset for Movie Description"
50 / 257 papers shown
Title
ODSQA: Open-domain Spoken Question Answering Dataset
Chia-Hsuan Lee
Shang-Ming Wang
Huan-Cheng Chang
Hung-yi Lee
RALM
30
52
0
07 Aug 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
14
91
0
01 Jun 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
19
19
0
15 May 2018
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&Ro
SSL
39
28
0
27 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
25
1,000
0
08 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
22
233
0
07 Apr 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Visual Text Correction
Amir Mazaheri
M. Shah
52
11
0
06 Jan 2018
MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Paul Vicol
Makarand Tapaswi
Lluis Castrejon
Sanja Fidler
42
136
0
19 Dec 2017
Multimodal Visual Concept Learning with Weakly Supervised Techniques
Giorgos Bouritsas
Petros Koutras
Athanasia Zlatintsi
Petros Maragos
14
7
0
03 Dec 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
42
36
0
22 Nov 2017
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
55
927
0
04 Aug 2017
Zero-Shot Activity Recognition with Verb Attribute Induction
Rowan Zellers
Yejin Choi
11
51
0
29 Jul 2017
Learning from Video and Text via Large-Scale Discriminative Clustering
Antoine Miech
Jean-Baptiste Alayrac
Piotr Bojanowski
Ivan Laptev
Josef Sivic
29
44
0
27 Jul 2017
Video Highlight Prediction Using Audience Chat Reactions
Cheng-Yang Fu
Joon Lee
Joey Tianyi Zhou
Alexander C. Berg
19
34
0
26 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu
Jongwook Choi
Yeonhwa Kim
Kyung Yoo
Sang-Hun Lee
Gunhee Kim
25
69
0
19 Jul 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks
Kyung-Min Kim
Min-Oh Heo
Seongho Choi
Byoung-Tak Zhang
26
174
0
04 Jul 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
39
1,496
0
13 Jun 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
70
1,218
0
02 May 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong-Ming Zhang
M. Shah
17
12
0
15 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People
Anna Rohrbach
Marcus Rohrbach
Siyu Tang
Seong Joon Oh
Bernt Schiele
330
72
0
05 Apr 2017
Temporal Tessellation: A Unified Approach for Video Analysis
Dotan Kaufman
Gil Levi
Tal Hassner
Lior Wolf
19
16
0
21 Dec 2016
MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun
Paul Hongsuck Seo
Ilchae Jung
Bohyung Han
50
108
0
06 Dec 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
28
191
0
28 Nov 2016
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
69
990
0
26 Nov 2016
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Tegan Maharaj
Nicolas Ballas
Anna Rohrbach
Aaron Courville
C. Pal
VGen
15
107
0
23 Nov 2016
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
27
329
0
23 Nov 2016
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DV
VLM
30
123
0
22 Sep 2016
Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data
J. Gao
Ram Nevatia
16
1
0
08 Sep 2016
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
35
69
0
25 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation
Rakshith Shetty
Jorma T. Laaksonen
21
94
0
17 Aug 2016
The KIT Motion-Language Dataset
Matthias Plappert
Christian Mandery
Tamim Asfour
193
273
0
13 Jul 2016
VideoMCC: a New Benchmark for Video Comprehension
Du Tran
Maksim Bolonkin
Manohar Paluri
Lorenzo Torresani
29
1
0
23 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
15 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
24
2
0
26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Andrew Shin
Katsunori Ohnishi
Tatsuya Harada
17
31
0
18 May 2016
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
353
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
24
101
0
09 May 2016
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
22
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar A. Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
43
1,224
0
06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
28
117
0
06 Apr 2016
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
27
363
0
15 Jan 2016
Video captioning with recurrent networks based on frame- and video-level features and visual content classification
Rakshith Shetty
Jorma T. Laaksonen
13
31
0
09 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
43
736
0
09 Dec 2015
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
27
44
0
15 Nov 2015
Oracle performance for visual captioning
L. Yao
Nicolas Ballas
Kyunghyun Cho
John R. Smith
Yoshua Bengio
VLM
39
8
0
14 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan
Zhongwen Xu
Yi Yang
Fei Wu
Yueting Zhuang
24
385
0
11 Nov 2015
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books
Yukun Zhu
Ryan Kiros
R. Zemel
Ruslan Salakhutdinov
R. Urtasun
Antonio Torralba
Sanja Fidler
60
2,517
0
22 Jun 2015
Previous
1
2
3
4
5
6
Next