ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1505.00487
  4. Cited By
Sequence to Sequence -- Video to Text

Sequence to Sequence -- Video to Text

3 May 2015
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
ArXivPDFHTML

Papers citing "Sequence to Sequence -- Video to Text"

50 / 459 papers shown
Title
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Whodunnit? Crime Drama as a Case for Natural Language Understanding
Lea Frermann
Shay B. Cohen
Mirella Lapata
36
26
0
31 Oct 2017
BENCHIP: Benchmarking Intelligence Processors
BENCHIP: Benchmarking Intelligence Processors
Jinhua Tao
Zidong Du
Qi Guo
Huiying Lan
Lei Zhang
...
Allen Rush
Willian Chen
Shaoli Liu
Yunji Chen
Tianshi Chen
36
35
0
23 Oct 2017
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Tz-Ying Wu
Ting-An Chien
C. Chan
Chan-Wei Hu
Min Sun
46
21
0
20 Oct 2017
Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency
  for Sequence Modeling
Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling
Chaitanya Ahuja
Louis-Philippe Morency
35
4
0
06 Oct 2017
Translating Videos to Commands for Robotic Manipulation with Deep
  Recurrent Neural Networks
Translating Videos to Commands for Robotic Manipulation with Deep Recurrent Neural Networks
Anh Nguyen
Dimitrios Kanoulas
L. Muratore
D. Caldwell
Nikos G. Tsagarakis
27
71
0
01 Oct 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term
  Memory
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
18
215
0
19 Sep 2017
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories,
  Tools and Challenges for the Community
A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community
John E. Ball
Derek T. Anderson
Chee Seng Chan
27
521
0
01 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
19
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
37
20
0
31 Aug 2017
Video Summarization with Attention-Based Encoder-Decoder Networks
Video Summarization with Attention-Based Encoder-Decoder Networks
Zhong Ji
Kailin Xiong
Yanwei Pang
Xuelong Li
6
303
0
31 Aug 2017
Action Classification and Highlighting in Videos
Action Classification and Highlighting in Videos
Atousa Torabi
Leonid Sigal
16
5
0
31 Aug 2017
mAnI: Movie Amalgamation using Neural Imitation
mAnI: Movie Amalgamation using Neural Imitation
Naveen Panwar
Shreya Khare
Neelamadhav Gantayat
Rahul Aralikatte
Senthil Mani
A. Sankaran
DiffM
VGen
21
0
0
16 Aug 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Hierarchically-Attentive RNN for Album Summarization and Storytelling
Licheng Yu
Joey Tianyi Zhou
Tamara L. Berg
36
66
0
09 Aug 2017
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video
  Captioning
From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning
Jingkuan Song
Yuyu Guo
Lianli Gao
Xuelong Li
Alan Hanjalic
Heng Tao Shen
40
219
0
08 Aug 2017
Reinforced Video Captioning with Entailment Rewards
Reinforced Video Captioning with Entailment Rewards
Ramakanth Pasunuru
Joey Tianyi Zhou
28
114
0
07 Aug 2017
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
55
927
0
04 Aug 2017
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600
  Papers Survey
cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey
Hirokatsu Kataoka
Soma Shirakabe
Yun He
S. Ueta
Teppei Suzuki
...
Ryousuke Takasawa
Masataka Fuchida
Yudai Miyashita
Kazushige Okayasu
Yuta Matsuzaki
30
1
0
20 Jul 2017
Supervising Neural Attention Models for Video Captioning by Human Gaze
  Data
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu
Jongwook Choi
Yeonhwa Kim
Kyung Yoo
Sang-Hun Lee
Gunhee Kim
25
69
0
19 Jul 2017
Show and Recall: Learning What Makes Videos Memorable
Show and Recall: Learning What Makes Videos Memorable
Sumit Shekhar
Dhruv Singal
Harvineet Singh
Manav Kedia
Akhil Shetty
20
40
0
17 Jul 2017
Large-scale Video Classification guided by Batch Normalized LSTM
  Translator
Large-scale Video Classification guided by Batch Normalized LSTM Translator
Jae Hyeon Yoo
VLM
20
11
0
13 Jul 2017
Aggregating Frame-level Features for Large-Scale Video Classification
Aggregating Frame-level Features for Large-Scale Video Classification
Shaoxiang Chen
Xi Wang
Yongyi Tang
Xinpeng Chen
Zuxuan Wu
Yu-Gang Jiang
18
22
0
04 Jul 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
Jingkuan Song
Zhao Guo
Lianli Gao
Wu Liu
Dongxiang Zhang
Heng Tao Shen
48
166
0
05 Jun 2017
Learning to Pour
Learning to Pour
Yongqiang Huang
Yu Sun
35
18
0
25 May 2017
Action Tubelet Detector for Spatio-Temporal Action Localization
Action Tubelet Detector for Spatio-Temporal Action Localization
Vicky Kalogeiton
Philippe Weinzaepfel
V. Ferrari
Cordelia Schmid
35
324
0
04 May 2017
The Forgettable-Watcher Model for Video Question Answering
The Forgettable-Watcher Model for Video Question Answering
Hongyang Xue
Zhou Zhao
Deng Cai
21
9
0
03 May 2017
Show, Adapt and Tell: Adversarial Training of Cross-domain Image
  Captioner
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen
Yuan-Hong Liao
Ching-Yao Chuang
W. Hsu
Jianlong Fu
Min Sun
31
141
0
02 May 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
65
1,218
0
02 May 2017
Multi-Task Video Captioning with Video and Entailment Generation
Multi-Task Video Captioning with Video and Entailment Generation
Ramakanth Pasunuru
Joey Tianyi Zhou
33
116
0
24 Apr 2017
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Y. Jang
Yale Song
Youngjae Yu
Youngjin Kim
Gunhee Kim
34
547
0
14 Apr 2017
Spatial Memory for Context Reasoning in Object Detection
Spatial Memory for Context Reasoning in Object Detection
Xinlei Chen
Abhinav Gupta
ObjD
25
164
0
13 Apr 2017
Predictive-Corrective Networks for Action Detection
Predictive-Corrective Networks for Action Detection
Achal Dave
Olga Russakovsky
Deva Ramanan
AI4TS
38
55
0
12 Apr 2017
Egocentric Video Description based on Temporally-Linked Sequences
Egocentric Video Description based on Temporally-Linked Sequences
Marc Bolaños
Álvaro Peris
F. Casacuberta
Sergi Soler
Petia Radeva
EgoV
34
25
0
07 Apr 2017
Generating Descriptions with Grounded and Co-Referenced People
Generating Descriptions with Grounded and Co-Referenced People
Anna Rohrbach
Marcus Rohrbach
Siyu Tang
Seong Joon Oh
Bernt Schiele
330
72
0
05 Apr 2017
Weakly Supervised Dense Video Captioning
Weakly Supervised Dense Video Captioning
Zhiqiang Shen
Jianguo Li
Zhou Su
Minjun Li
Yurong Chen
Yu-Gang Jiang
Xiangyang Xue
32
134
0
05 Apr 2017
Survey of the State of the Art in Natural Language Generation: Core
  tasks, applications and evaluation
Survey of the State of the Art in Natural Language Generation: Core tasks, applications and evaluation
Albert Gatt
E. Krahmer
LM&MA
ELM
27
810
0
29 Mar 2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement
  Learning
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning
Abhishek Das
Satwik Kottur
J. M. F. Moura
Stefan Lee
Dhruv Batra
OffRL
31
423
0
20 Mar 2017
Improving Interpretability of Deep Neural Networks with Semantic
  Information
Improving Interpretability of Deep Neural Networks with Semantic Information
Yinpeng Dong
Hang Su
Jun Zhu
Bo Zhang
27
122
0
12 Mar 2017
MAT: A Multimodal Attentive Translator for Image Captioning
MAT: A Multimodal Attentive Translator for Image Captioning
Chang Liu
F. Sun
Changhu Wang
Feng Wang
Alan Yuille
20
58
0
18 Feb 2017
Dataset Augmentation in Feature Space
Dataset Augmentation in Feature Space
Terrance Devries
Graham W. Taylor
23
423
0
17 Feb 2017
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation
Iacer Calixto
Qun Liu
N. Campbell
40
179
0
04 Feb 2017
Incorporating Global Visual Features into Attention-Based Neural Machine
  Translation
Incorporating Global Visual Features into Attention-Based Neural Machine Translation
Iacer Calixto
Qun Liu
Nick Campbell
32
154
0
23 Jan 2017
Top-down Visual Saliency Guided by Captions
Top-down Visual Saliency Guided by Captions
Vasili Ramanishka
Abir Das
Jianming Zhang
Kate Saenko
21
142
0
21 Dec 2016
Temporal Tessellation: A Unified Approach for Video Analysis
Temporal Tessellation: A Unified Approach for Video Analysis
Dotan Kaufman
Gil Levi
Tal Hassner
Lior Wolf
19
16
0
21 Dec 2016
Few-Shot Object Recognition from Machine-Labeled Web Images
Few-Shot Object Recognition from Machine-Labeled Web Images
Zhongwen Xu
Linchao Zhu
Yi Yang
VLM
18
66
0
19 Dec 2016
Video Captioning with Multi-Faceted Attention
Video Captioning with Multi-Faceted Attention
Xiang Long
Chuang Gan
Gerard de Melo
24
88
0
01 Dec 2016
Sequential Person Recognition in Photo Albums with a Recurrent Network
Sequential Person Recognition in Photo Albums with a Recurrent Network
Yao Li
Guosheng Lin
Bohan Zhuang
Lingqiao Liu
Chunhua Shen
Anton Van Den Hengel
26
29
0
30 Nov 2016
Deep Quantization: Encoding Convolutional Activations with Deep
  Generative Model
Deep Quantization: Encoding Convolutional Activations with Deep Generative Model
Zhaofan Qiu
Ting Yao
Tao Mei
DRL
MQ
32
59
0
29 Nov 2016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi
C. Grana
Rita Cucchiara
28
191
0
28 Nov 2016
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu
Zhongwen Xu
Yi Yang
35
76
0
28 Nov 2016
Visual Dialog
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
69
990
0
26 Nov 2016
Previous
123...10789
Next