Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1505.00487
Cited By
Sequence to Sequence -- Video to Text
3 May 2015
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence to Sequence -- Video to Text"
50 / 459 papers shown
Title
Towards Audio to Scene Image Synthesis using Generative Adversarial Network
Chia-Hung Wan
Shun-Po Chuang
Hung-yi Lee
GAN
20
61
0
13 Aug 2018
Live Video Comment Generation Based on Surrounding Frames and Live Comments
Damai Dai
VGen
8
0
0
13 Aug 2018
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Victor Escorcia
Ranjay Krishna
S. Buch
Cuong Duc Dao
42
65
0
11 Aug 2018
Road Segmentation Using CNN and Distributed LSTM
Yecheng Lyu
Lin Bai
Xinming Huang
25
6
0
10 Aug 2018
A Joint Sequence Fusion Model for Video Question Answering and Retrieval
Youngjae Yu
Jongseok Kim
Gunhee Kim
34
340
0
07 Aug 2018
Doubly Attentive Transformer Machine Translation
Hasan Sait Arslan
Mark Fishel
G. Anbarjafari
35
13
0
30 Jul 2018
Textual Explanations for Self-Driving Vehicles
Jinkyu Kim
Anna Rohrbach
Trevor Darrell
John F. Canny
Zeynep Akata
10
328
0
30 Jul 2018
Improving Sequential Determinantal Point Processes for Supervised Video Summarization
Aidean Sharghi
Ali Borji
Chengtao Li
Tianbao Yang
Boqing Gong
AI4TS
33
47
0
28 Jul 2018
Move Forward and Tell: A Progressive Generator of Video Descriptions
Yilei Xiong
Bo Dai
Dahua Lin
29
101
0
26 Jul 2018
Recurrent Fusion Network for Image Captioning
Wenhao Jiang
Lin Ma
Yu-Gang Jiang
Wen Liu
Tong Zhang
ObjD
33
233
0
26 Jul 2018
Video Storytelling: Textual Summaries for Events
Junnan Li
Yongkang Wong
Qi Zhao
Mohan Kankanhalli
DiffM
21
44
0
25 Jul 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
46
37
0
21 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Chenyu You
24
18
0
08 Jul 2018
Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos
Yuan Liu
Moyini Yao
25
1
0
25 Jun 2018
Deep Sequence Learning with Auxiliary Information for Traffic Prediction
Binbing Liao
Jingqing Zhang
Chao Wu
Douglas McIlwraith
Tong Chen
Shengwen Yang
Yike Guo
Fei Wu
30
209
0
13 Jun 2018
PipeDream: Fast and Efficient Pipeline Parallel DNN Training
A. Harlap
Deepak Narayanan
Amar Phanishayee
Vivek Seshadri
Nikhil R. Devanur
G. Ganger
Phillip B. Gibbons
AI4CE
21
252
0
08 Jun 2018
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
18
3
0
05 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wen Liu
Syed Zulqarnain Gilani
Mubarak Shah
11
91
0
01 Jun 2018
Deep Reinforcement Learning For Sequence to Sequence Models
Yaser Keneshloo
Tian Shi
Naren Ramakrishnan
Chandan K. Reddy
AIMat
3DV
OffRL
30
208
0
24 May 2018
A Recurrent Convolutional Neural Network Approach for Sensorless Force Estimation in Robotic Surgery
Arturo Marbán
Vignesh Srinivasan
Wojciech Samek
Josep Fernández
A. Casals
19
84
0
22 May 2018
Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation
Qiuyuan Huang
Zhe Gan
Asli Celikyilmaz
D. Wu
Jianfeng Wang
Xiaodong He
BDL
21
91
0
21 May 2018
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization
Yuan Cheng
Guangya Li
Hai-Bao Chen
S. Tan
Hao Yu
14
3
0
21 May 2018
ECO: Efficient Convolutional Network for Online Video Understanding
Mohammadreza Zolfaghari
Kamaljeet Singh
Thomas Brox
142
496
0
24 Apr 2018
Jointly Localizing and Describing Events for Dense Video Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
27
168
0
23 Apr 2018
Sampling-free Uncertainty Estimation in Gated Recurrent Units with Exponential Families
Seong Jae Hwang
Ronak R. Mehta
Hyunwoo J. Kim
Vikas Singh
BDL
UQCV
12
3
0
19 Apr 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
Qing Guo
Yuan-fang Wang
William Yang Wang
16
76
0
15 Apr 2018
Road Segmentation Using CNN with GRU
Yecheng Lyu
Xinming Huang
29
18
0
14 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
25
319
0
13 Apr 2018
Decoupled Novel Object Captioner
Yuehua Wu
Linchao Zhu
Lu Jiang
Yi Yang
18
62
0
11 Apr 2018
Natural Language Statistical Features of LSTM-generated Texts
Marco Lippi
M. Montemurro
M. Degli Esposti
G. Cristadoro
DeLMO
13
71
0
10 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer
Luowei Zhou
Yingbo Zhou
Jason J. Corso
R. Socher
Caiming Xiong
34
525
0
03 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wen Liu
Yong-mei Xu
19
203
0
31 Mar 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
41
317
0
30 Mar 2018
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani
Hessam Bagherinezhad
Joseph Redmon
Roozbeh Mottaghi
Ali Farhadi
VGen
22
59
0
28 Mar 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
3DV
40
184
0
23 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
22
56
0
21 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Feiyu Xiong
Qingming Huang
17
200
0
05 Mar 2018
Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu
Boyang Albert Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko
11
51
0
28 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Self-Supervised Video Hashing with Hierarchical Binary Auto-encoder
Jingkuan Song
Hanwang Zhang
Xiangpeng Li
Lianli Gao
Ming Wang
Richang Hong
24
245
0
07 Feb 2018
Video-based Sign Language Recognition without Temporal Segmentation
Jie Huang
Wen-gang Zhou
Qilin Zhang
Houqiang Li
Weiping Li
SLR
30
408
0
30 Jan 2018
Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing
Ravi Kiran Sarvadevabhatla
Shiv Surya
Trisha Mittal
Venkatesh Babu Radhakrishnan
24
14
0
29 Jan 2018
Let's Dance: Learning From Online Dance Videos
Daniel Castro
Steven Hickson
Patsorn Sangkloy
Bhavishya Mittal
Sean Dai
James Hays
Irfan Essa
35
24
0
23 Jan 2018
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
32
228
0
29 Nov 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
42
36
0
22 Nov 2017
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks
Franyell Silfa
Gem Dot
J. Arnau
Antonio González
33
39
0
20 Nov 2017
Grounded Objects and Interactions for Video Captioning
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
35
6
0
16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
33
145
0
16 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
24
30
0
07 Nov 2017
Learning Word Embeddings from Speech
Yu-An Chung
James R. Glass
SSL
44
19
0
05 Nov 2017
Previous
1
2
3
...
10
6
7
8
9
Next