ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1501.02530
  4. Cited By
A Dataset for Movie Description

A Dataset for Movie Description

12 January 2015
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
    VGen
ArXivPDFHTML

Papers citing "A Dataset for Movie Description"

50 / 257 papers shown
Title
ODSQA: Open-domain Spoken Question Answering Dataset
ODSQA: Open-domain Spoken Question Answering Dataset
Chia-Hsuan Lee
Shang-Ming Wang
Huan-Cheng Chang
Hung-yi Lee
RALM
30
52
0
07 Aug 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
14
91
0
01 Jun 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
19
19
0
15 May 2018
Reward Learning from Narrated Demonstrations
Reward Learning from Narrated Demonstrations
H. Tung
Adam W. Harley
Liang-Kang Huang
Katerina Fragkiadaki
LM&Ro
SSL
39
28
0
27 Apr 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
25
1,000
0
08 Apr 2018
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data
Antoine Miech
Ivan Laptev
Josef Sivic
22
233
0
07 Apr 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Visual Text Correction
Visual Text Correction
Amir Mazaheri
M. Shah
52
11
0
06 Jan 2018
MovieGraphs: Towards Understanding Human-Centric Situations from Videos
MovieGraphs: Towards Understanding Human-Centric Situations from Videos
Paul Vicol
Makarand Tapaswi
Lluis Castrejon
Sanja Fidler
42
136
0
19 Dec 2017
Multimodal Visual Concept Learning with Weakly Supervised Techniques
Multimodal Visual Concept Learning with Weakly Supervised Techniques
Giorgos Bouritsas
Petros Koutras
Athanasia Zlatintsi
Petros Maragos
14
7
0
03 Dec 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
42
36
0
22 Nov 2017
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
55
927
0
04 Aug 2017
Zero-Shot Activity Recognition with Verb Attribute Induction
Zero-Shot Activity Recognition with Verb Attribute Induction
Rowan Zellers
Yejin Choi
11
51
0
29 Jul 2017
Learning from Video and Text via Large-Scale Discriminative Clustering
Learning from Video and Text via Large-Scale Discriminative Clustering
Antoine Miech
Jean-Baptiste Alayrac
Piotr Bojanowski
Ivan Laptev
Josef Sivic
29
44
0
27 Jul 2017
Video Highlight Prediction Using Audience Chat Reactions
Video Highlight Prediction Using Audience Chat Reactions
Cheng-Yang Fu
Joon Lee
Joey Tianyi Zhou
Alexander C. Berg
19
34
0
26 Jul 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600
  Papers Survey
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze
  Data
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu
Jongwook Choi
Yeonhwa Kim
Kyung Yoo
Sang-Hun Lee
Gunhee Kim
25
69
0
19 Jul 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks
DeepStory: Video Story QA by Deep Embedded Memory Networks
Kyung-Min Kim
Min-Oh Heo
Seongho Choi
Byoung-Tak Zhang
26
174
0
04 Jul 2017
The "something something" video database for learning and evaluating
  visual common sense
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
39
1,496
0
13 Jun 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
70
1,218
0
02 May 2017
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal
  Attentions
Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions
Amir Mazaheri
Dong-Ming Zhang
M. Shah
17
12
0
15 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People
Generating Descriptions with Grounded and Co-Referenced People
Anna Rohrbach
Marcus Rohrbach
Siyu Tang
Seong Joon Oh
Bernt Schiele
330
72
0
05 Apr 2017
Temporal Tessellation: A Unified Approach for Video Analysis
Temporal Tessellation: A Unified Approach for Video Analysis
Dotan Kaufman
Gil Levi
Tal Hassner
Lior Wolf
19
16
0
21 Dec 2016
MarioQA: Answering Questions by Watching Gameplay Videos
MarioQA: Answering Questions by Watching Gameplay Videos
Jonghwan Mun
Paul Hongsuck Seo
Ilchae Jung
Bohyung Han
50
108
0
06 Dec 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
28
191
0
28 Nov 2016
Visual Dialog
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
69
990
0
26 Nov 2016
A dataset and exploration of models for understanding video data through
  fill-in-the-blank question-answering
A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Tegan Maharaj
Nicolas Ballas
Anna Rohrbach
Aaron Courville
C. Pal
VGen
15
107
0
23 Nov 2016
Video Captioning with Transferred Semantic Attributes
Video Captioning with Transferred Semantic Attributes
Yingwei Pan
Ting Yao
Houqiang Li
Tao Mei
27
329
0
23 Nov 2016
Deep Learning for Video Classification and Captioning
Deep Learning for Video Classification and Captioning
Zuxuan Wu
Ting Yao
Yanwei Fu
Yu-Gang Jiang
3DV
VLM
30
123
0
22 Sep 2016
Learning Action Concept Trees and Semantic Alignment Networks from
  Image-Description Data
Learning Action Concept Trees and Semantic Alignment Networks from Image-Description Data
J. Gao
Ram Nevatia
16
1
0
08 Sep 2016
Title Generation for User Generated Videos
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
35
69
0
25 Aug 2016
Frame- and Segment-Level Features and Candidate Pool Evaluation for
  Video Caption Generation
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation
Rakshith Shetty
Jorma T. Laaksonen
21
94
0
17 Aug 2016
The KIT Motion-Language Dataset
The KIT Motion-Language Dataset
Matthias Plappert
Christian Mandery
Tamim Asfour
193
273
0
13 Jul 2016
VideoMCC: a New Benchmark for Video Comprehension
VideoMCC: a New Benchmark for Video Comprehension
Du Tran
Maksim Bolonkin
Manohar Paluri
Lorenzo Torresani
29
1
0
23 Jun 2016
Bidirectional Long-Short Term Memory for Video Description
Bidirectional Long-Short Term Memory for Video Description
Yi Bin
Yang Yang
Zi Huang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
15 Jun 2016
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
cvpaper.challenge in 2015 - A review of CVPR2015 and DeepSurvey
Hirokatsu Kataoka
Yudai Miyashita
Tomoaki K. Yamabe
Soma Shirakabe
Shin-ichi Sato
...
Kaori Abe
Takaaki Imanari
Naomichi Kobayashi
Shinichiro Morita
Akio Nakamura
24
2
0
26 May 2016
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Beyond Caption To Narrative: Video Captioning With Multiple Sentences
Andrew Shin
Katsunori Ohnishi
Tatsuya Harada
17
31
0
18 May 2016
Movie Description
Movie Description
Anna Rohrbach
Atousa Torabi
Marcus Rohrbach
Niket Tandon
C. Pal
Hugo Larochelle
Aaron Courville
Bernt Schiele
3DV
VGen
32
353
0
12 May 2016
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
24
101
0
09 May 2016
Attributes as Semantic Units between Natural Language and Visual
  Recognition
Attributes as Semantic Units between Natural Language and Visual Recognition
Marcus Rohrbach
VLM
22
3
0
12 Apr 2016
TGIF: A New Dataset and Benchmark on Animated GIF Description
TGIF: A New Dataset and Benchmark on Animated GIF Description
Yuncheng Li
Yale Song
Liangliang Cao
Joel R. Tetreault
Larry Goldberg
A. Jaimes
Jiebo Luo
25
270
0
10 Apr 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity
  Understanding
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar A. Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
43
1,224
0
06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined
  from Text
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
28
117
0
06 Apr 2016
Automatic Description Generation from Images: A Survey of Models,
  Datasets, and Evaluation Measures
Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures
Raffaella Bernardi
Ruken Cakici
Desmond Elliott
Aykut Erdem
Erkut Erdem
Nazli Ikizler-Cinbis
Frank Keller
A. Muscat
Barbara Plank
EGVM
VLM
27
363
0
15 Jan 2016
Video captioning with recurrent networks based on frame- and video-level
  features and visual content classification
Video captioning with recurrent networks based on frame- and video-level features and visual content classification
Rakshith Shetty
Jorma T. Laaksonen
13
31
0
09 Dec 2015
MovieQA: Understanding Stories in Movies through Question-Answering
MovieQA: Understanding Stories in Movies through Question-Answering
Makarand Tapaswi
Yukun Zhu
Rainer Stiefelhagen
Antonio Torralba
R. Urtasun
Sanja Fidler
43
736
0
09 Dec 2015
Uncovering Temporal Context for Video Question and Answering
Uncovering Temporal Context for Video Question and Answering
Linchao Zhu
Zhongwen Xu
Yi Yang
Alexander G. Hauptmann
BDL
27
44
0
15 Nov 2015
Oracle performance for visual captioning
Oracle performance for visual captioning
L. Yao
Nicolas Ballas
Kyunghyun Cho
John R. Smith
Yoshua Bengio
VLM
39
8
0
14 Nov 2015
Hierarchical Recurrent Neural Encoder for Video Representation with
  Application to Captioning
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan
Zhongwen Xu
Yi Yang
Fei Wu
Yueting Zhuang
24
385
0
11 Nov 2015
Aligning Books and Movies: Towards Story-like Visual Explanations by
  Watching Movies and Reading Books
Aligning Books and Movies: Towards Story-like Visual Explanations by Watching Movies and Reading Books
Yukun Zhu
Ryan Kiros
R. Zemel
Ruslan Salakhutdinov
R. Urtasun
Antonio Torralba
Sanja Fidler
60
2,517
0
22 Jun 2015
Previous
123456
Next