ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.08029
  4. Cited By
Describing Videos by Exploiting Temporal Structure

Describing Videos by Exploiting Temporal Structure

27 February 2015
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
ArXivPDFHTML

Papers citing "Describing Videos by Exploiting Temporal Structure"

50 / 372 papers shown
Title
An Attention-Based Approach for Single Image Super Resolution
An Attention-Based Approach for Single Image Super Resolution
Yuan Liu
Yuancheng Wang
Nan Li
Xu Cheng
Yifeng Zhang
Yongming Huang
Guojun Lu
SupR
24
42
0
18 Jul 2018
DeepDiff: Deep-learning for predicting Differential gene expression from
  histone modifications
DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications
Arshdeep Sekhon
Ritambhara Singh
Yanjun Qi
14
52
0
10 Jul 2018
Video Captioning with Boundary-aware Hierarchical Language Decoding and
  Joint Video Prediction
Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction
Xiangxi Shi
Jianfei Cai
Jiuxiang Gu
Chenyu You
19
18
0
08 Jul 2018
Best Vision Technologies Submission to ActivityNet Challenge 2018-Task:
  Dense-Captioning Events in Videos
Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos
Yuan Liu
Moyini Yao
17
1
0
25 Jun 2018
RUC+CMU: System Report for Dense Captioning Events in Videos
RUC+CMU: System Report for Dense Captioning Events in Videos
Shizhe Chen
Yuqing Song
Yida Zhao
Jiarong Qiu
Qin Jin
Alexander G. Hauptmann
19
7
0
22 Jun 2018
Ad-Net: Audio-Visual Convolutional Neural Network for Advertisement
  Detection In Videos
Ad-Net: Audio-Visual Convolutional Neural Network for Advertisement Detection In Videos
Shervin Minaee
Imed Bouazizi
Prakash Kolan
Hossein Najafzadeh
20
11
0
22 Jun 2018
Mining for meaning: from vision to language through multiple networks
  consensus
Mining for meaning: from vision to language through multiple networks consensus
Iulia Duta
Andrei Liviu Nicolicioiu
Simion-Vlad Bogolin
Marius Leordeanu
18
3
0
05 Jun 2018
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Video Description: A Survey of Methods, Datasets and Evaluation Metrics
Nayyer Aafaq
Ajmal Mian
Wei Liu
Syed Zulqarnain Gilani
Mubarak Shah
6
91
0
01 Jun 2018
Context-aware Cascade Attention-based RNN for Video Emotion Recognition
Context-aware Cascade Attention-based RNN for Video Emotion Recognition
Man-Chin Sun
Shih-Huan Hsu
Min Yang
Jen-Hsien Chien
30
18
0
30 May 2018
Amortized Context Vector Inference for Sequence-to-Sequence Networks
Amortized Context Vector Inference for Sequence-to-Sequence Networks
S. Chatzis
Aristotelis Charalampous
Kyriacos Tolias
14
0
0
23 May 2018
Hierarchically Structured Reinforcement Learning for Topically Coherent
  Visual Story Generation
Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation
Qiuyuan Huang
Zhe Gan
Asli Celikyilmaz
D. Wu
Jianfeng Wang
Xiaodong He
BDL
21
91
0
21 May 2018
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices
  Compressed with Quantization and Tensorization
DEEPEYE: A Compact and Accurate Video Comprehension at Terminal Devices Compressed with Quantization and Tensorization
Yuan Cheng
Guangya Li
Hai-Bao Chen
S. Tan
Hao Yu
12
3
0
21 May 2018
DeepPhys: Video-Based Physiological Measurement Using Convolutional
  Attention Networks
DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks
W. Chen
Daniel J. McDuff
3DH
HAI
31
470
0
21 May 2018
Stories for Images-in-Sequence by using Visual and Narrative Components
Stories for Images-in-Sequence by using Visual and Narrative Components
Marko Smilevski
Ilija Lalkovski
Gjorgji Madjarov
11
19
0
15 May 2018
Jointly Localizing and Describing Events for Dense Video Captioning
Jointly Localizing and Describing Events for Dense Video Captioning
Yehao Li
Ting Yao
Yingwei Pan
Hongyang Chao
Tao Mei
24
168
0
23 Apr 2018
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal
  Attentions for Video Captioning
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
Qing Guo
Yuan-fang Wang
William Yang Wang
13
76
0
15 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
25
319
0
13 Apr 2018
End-to-End Dense Video Captioning with Masked Transformer
End-to-End Dense Video Captioning with Masked Transformer
Luowei Zhou
Yingbo Zhou
Jason J. Corso
R. Socher
Caiming Xiong
20
524
0
03 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video
  Captioning
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wei Liu
Yong-mei Xu
14
203
0
31 Mar 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wei Liu
38
317
0
30 Mar 2018
Memory Warps for Learning Long-Term Online Video Representations
Memory Warps for Learning Long-Term Online Video Representations
Tuan-Hung Vu
Wongun Choi
S. Schulter
Manmohan Chandraker
8
11
0
28 Mar 2018
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani
Hessam Bagherinezhad
Joseph Redmon
Roozbeh Mottaghi
Ali Farhadi
VGen
14
59
0
28 Mar 2018
End-to-End Video Captioning with Multitask Reinforcement Learning
End-to-End Video Captioning with Multitask Reinforcement Learning
Lijun Li
Boqing Gong
22
56
0
21 Mar 2018
Attention-based Temporal Weighted Convolutional Neural Network for
  Action Recognition
Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition
J. Zang
Le Wang
Zi-yi Liu
Qilin Zhang
Zhenxing Niu
G. Hua
N. Zheng
19
71
0
19 Mar 2018
Attention-GAN for Object Transfiguration in Wild Images
Attention-GAN for Object Transfiguration in Wild Images
Xinyuan Chen
Chang Xu
Xiaokang Yang
Dacheng Tao
32
176
0
19 Mar 2018
Less Is More: Picking Informative Frames for Video Captioning
Less Is More: Picking Informative Frames for Video Captioning
Yangyu Chen
Shuhui Wang
Feiyu Xiong
Qingming Huang
12
200
0
05 Mar 2018
Joint Event Detection and Description in Continuous Video Streams
Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu
Boyang Albert Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko
8
51
0
28 Feb 2018
Multimodal Named Entity Recognition for Short Social Media Posts
Multimodal Named Entity Recognition for Short Social Media Posts
Seungwhan Moon
Leonardo Neves
Vitor R. Carvalho
25
152
0
22 Feb 2018
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
A Neural Multi-sequence Alignment TeCHnique (NeuMATCH)
Pelin Dogan
Boyang Albert Li
Leonid Sigal
Markus Gross
AI4TS
30
19
0
19 Feb 2018
Online Learning for Effort Reduction in Interactive Neural Machine
  Translation
Online Learning for Effort Reduction in Interactive Neural Machine Translation
Álvaro Peris
F. Casacuberta
8
49
0
10 Feb 2018
Video-based Sign Language Recognition without Temporal Segmentation
Video-based Sign Language Recognition without Temporal Segmentation
Jie Huang
Wen-gang Zhou
Qilin Zhang
Houqiang Li
Weiping Li
SLR
30
408
0
30 Jan 2018
Describing Semantic Representations of Brain Activity Evoked by Visual
  Stimuli
Describing Semantic Representations of Brain Activity Evoked by Visual Stimuli
Eri Matsuo
Ichiro Kobayashi
Shinji Nishimoto
S. Nishida
H. Asoh
18
14
0
19 Jan 2018
RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face
  Alignment
RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment
Xi Peng
Rogerio Feris
Xiaoyu Wang
Dimitris N. Metaxas
CVBM
3DH
29
21
0
17 Jan 2018
Recent Advances in Recurrent Neural Networks
Recent Advances in Recurrent Neural Networks
Hojjat Salehinejad
Sharan Sankar
Joseph Barfett
E. Colak
S. Valaee
AI4TS
30
573
0
29 Dec 2017
Video Captioning via Hierarchical Reinforcement Learning
Video Captioning via Hierarchical Reinforcement Learning
Xin Eric Wang
Wenhu Chen
Jiawei Wu
Yuan-fang Wang
William Yang Wang
24
228
0
29 Nov 2017
Attended End-to-end Architecture for Age Estimation from Facial
  Expression Videos
Attended End-to-end Architecture for Age Estimation from Facial Expression Videos
Wenjie Pei
H. Dibeklioğlu
T. Baltrušaitis
David Tax
CVBM
21
43
0
23 Nov 2017
Integrating both Visual and Audio Cues for Enhanced Video Caption
Wangli Hao
Zhaoxiang Zhang
He Guan
Guibo Zhu
37
36
0
22 Nov 2017
Excitation Backprop for RNNs
Excitation Backprop for RNNs
Sarah Adel Bargal
Andrea Zunino
Donghyun Kim
Jianming Zhang
Vittorio Murino
Stan Sclaroff
24
48
0
18 Nov 2017
Grounded Objects and Interactions for Video Captioning
Grounded Objects and Interactions for Video Captioning
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
35
6
0
16 Nov 2017
Attend and Interact: Higher-Order Object Interactions for Video
  Understanding
Attend and Interact: Higher-Order Object Interactions for Video Understanding
Chih-Yao Ma
Asim Kadav
I. Melvin
Z. Kira
G. Al-Regib
H. Graf
33
145
0
16 Nov 2017
End-to-end Video-level Representation Learning for Action Recognition
End-to-end Video-level Representation Learning for Action Recognition
Jiagang Zhu
Wei Zou
Zheng Zhu
25
89
0
11 Nov 2017
Learning to diagnose from scratch by exploiting dependencies among
  labels
Learning to diagnose from scratch by exploiting dependencies among labels
L. Yao
Eric Poblenz
Dmitry Dagunts
Ben Covington
D. Bernard
Kevin Lyman
30
332
0
28 Oct 2017
ActivityNet Challenge 2017 Summary
ActivityNet Challenge 2017 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Ranjay Krishna
Victor Escorcia
Kenji Hata
S. Buch
45
48
0
22 Oct 2017
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Tz-Ying Wu
Ting-An Chien
C. Chan
Chan-Wei Hu
Min Sun
38
21
0
20 Oct 2017
Monitoring tool usage in surgery videos using boosted convolutional and
  recurrent neural networks
Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks
Hassan Al Hajj
M. Lamard
Pierre-Henri Conze
B. Cochener
G. Quellec
36
3
0
04 Oct 2017
Multimodal Content Analysis for Effective Advertisements on YouTube
Multimodal Content Analysis for Effective Advertisements on YouTube
Nikhita Vedula
Wei Sun
Hyunhwan Lee
Harsh Gupta
Mitsunori Ogihara
Joseph Johnson
Gang Ren
Srinivasan Parthasarathy
29
36
0
12 Sep 2017
Learning the PE Header, Malware Detection with Minimal Domain Knowledge
Learning the PE Header, Malware Detection with Minimal Domain Knowledge
Edward Raff
Jared Sylvester
Charles K. Nicholas
20
118
0
05 Sep 2017
Video Captioning with Guidance of Multimodal Latent Topics
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen
Jia Chen
Qin Jin
Alexander G. Hauptmann
16
67
0
31 Aug 2017
Generating Video Descriptions with Topic Guidance
Generating Video Descriptions with Topic Guidance
Shizhe Chen
Jia Chen
Qin Jin
34
20
0
31 Aug 2017
Action Classification and Highlighting in Videos
Action Classification and Highlighting in Videos
Atousa Torabi
Leonid Sigal
11
5
0
31 Aug 2017
Previous
12345678
Next