Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00487
Cited By
Sequence to Sequence -- Video to Text
3 May 2015
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence to Sequence -- Video to Text"
50 / 459 papers shown
Title
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Lea Frermann
Shay B. Cohen
Mirella Lapata
36
26
0
31 Oct 2017
BENCHIP: Benchmarking Intelligence Processors
Jinhua Tao
Zidong Du
Qi Guo
Huiying Lan
Lei Zhang
...
Allen Rush
Willian Chen
Shaoli Liu
Yunji Chen
Tianshi Chen
36
35
0
23 Oct 2017
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Tz-Ying Wu
Ting-An Chien
C. Chan
Chan-Wei Hu
Min Sun
46
21
0
20 Oct 2017
Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling
Chaitanya Ahuja
Louis-Philippe Morency
35
4
0
06 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks
Anh Nguyen
Dimitrios Kanoulas
L. Muratore
D. Caldwell
Nikos G. Tsagarakis
27
71
0
01 Oct 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
18
215
0
19 Sep 2017
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
John E. Ball
Derek T. Anderson
Chee Seng Chan
27
521
0
01 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
19
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
37
20
0
31 Aug 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
6
303
0
31 Aug 2017
Action Classification and Highlighting in Videos
Atousa Torabi
Leonid Sigal
16
5
0
31 Aug 2017
mAnI: Movie Amalgamation using Neural Imitation
Naveen Panwar
Shreya Khare
Neelamadhav Gantayat
Rahul Aralikatte
Senthil Mani
A. Sankaran
DiffM
VGen
21
0
0
16 Aug 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Licheng Yu
Joey Tianyi Zhou
Tamara L. Berg
36
66
0
09 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
40
219
0
08 Aug 2017
Reinforced Video Captioning with Entailment Rewards
Ramakanth Pasunuru
Joey Tianyi Zhou
28
114
0
07 Aug 2017
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
55
927
0
04 Aug 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu
Jongwook Choi
Yeonhwa Kim
Kyung Yoo
Sang-Hun Lee
Gunhee Kim
25
69
0
19 Jul 2017
Show and Recall: Learning What Makes Videos Memorable
Sumit Shekhar
Dhruv Singal
Harvineet Singh
Manav Kedia
Akhil Shetty
20
40
0
17 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM Translator
Jae Hyeon Yoo
VLM
20
11
0
13 Jul 2017
Aggregating Frame-level Features for Large-Scale Video Classification
Shaoxiang Chen
Xi Wang
Yongyi Tang
Xinpeng Chen
Zuxuan Wu
Yu-Gang Jiang
18
22
0
04 Jul 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
48
166
0
05 Jun 2017
Learning to Pour
Yongqiang Huang
Yu Sun
35
18
0
25 May 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
35
324
0
04 May 2017
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
21
9
0
03 May 2017
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
W. Hsu
Jianlong Fu
Min Sun
31
141
0
02 May 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
65
1,218
0
02 May 2017
Multi-Task Video Captioning with Video and Entailment Generation
Ramakanth Pasunuru
Joey Tianyi Zhou
33
116
0
24 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
34
547
0
14 Apr 2017
Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen
Abhinav Gupta
ObjD
25
164
0
13 Apr 2017
Predictive-Corrective Networks for Action Detection
Achal Dave
Olga Russakovsky
Deva Ramanan
AI4TS
38
55
0
12 Apr 2017
Egocentric Video Description based on Temporally-Linked Sequences
Marc Bolaños
Álvaro Peris
F. Casacuberta
Sergi Soler
Petia Radeva
EgoV
34
25
0
07 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People
Anna Rohrbach
Marcus Rohrbach
Siyu Tang
Seong Joon Oh
Bernt Schiele
330
72
0
05 Apr 2017
Weakly Supervised Dense Video Captioning
Zhiqiang Shen
Jianguo Li
Zhou Su
Minjun Li
Yurong Chen
Yu-Gang Jiang
Xiangyang Xue
32
134
0
05 Apr 2017
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation
Albert Gatt
E. Krahmer
LM&MA
ELM
27
810
0
29 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
31
423
0
20 Mar 2017
Improving Interpretability of Deep Neural Networks with Semantic Information
Yinpeng Dong
Hang Su
Jun Zhu
Bo Zhang
27
122
0
12 Mar 2017
MAT: A Multimodal Attentive Translator for Image Captioning
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
20
58
0
18 Feb 2017
Dataset Augmentation in Feature Space
Terrance Devries
Graham W. Taylor
23
423
0
17 Feb 2017
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Iacer Calixto
Qun Liu
N. Campbell
40
179
0
04 Feb 2017
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Iacer Calixto
Qun Liu
Nick Campbell
32
154
0
23 Jan 2017
Top-down Visual Saliency Guided by Captions
Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
21
142
0
21 Dec 2016
Temporal Tessellation: A Unified Approach for Video Analysis
Dotan Kaufman
Gil Levi
Tal Hassner
Lior Wolf
19
16
0
21 Dec 2016
Few-Shot Object Recognition from Machine-Labeled Web Images
Zhongwen Xu
Linchao Zhu
Yi Yang
VLM
18
66
0
19 Dec 2016
Video Captioning with Multi-Faceted Attention
Xiang Long
Chuang Gan
Gerard de Melo
24
88
0
01 Dec 2016
Sequential Person Recognition in Photo Albums with a Recurrent Network
Yao Li
Guosheng Lin
Bohan Zhuang
Lingqiao Liu
Chunhua Shen
Anton Van Den Hengel
26
29
0
30 Nov 2016
Deep Quantization: Encoding Convolutional Activations with Deep Generative Model
Zhaofan Qiu
Ting Yao
Tao Mei
DRL
MQ
32
59
0
29 Nov 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
28
191
0
28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu
Zhongwen Xu
Yi Yang
35
76
0
28 Nov 2016
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
69
990
0
26 Nov 2016
Previous
1
2
3
...
10
7
8
9
Next