ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.08029
  4. Cited By
Describing Videos by Exploiting Temporal Structure

Describing Videos by Exploiting Temporal Structure

27 February 2015
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
ArXivPDFHTML

Papers citing "Describing Videos by Exploiting Temporal Structure"

50 / 372 papers shown
Title
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for
  Mixture Signals
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
Xuankai Chang
Pengcheng Guo
Shinji Watanabe
Yusuke Fujita
Jiaming Xu
Bo Xu
Lei Xie
34
21
0
25 Jun 2020
Comprehensive Information Integration Modeling Framework for Video
  Titling
Comprehensive Information Integration Modeling Framework for Video Titling
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Tan Jiang
Jingren Zhou
Hongxia Yang
Fei Wu
31
40
0
24 Jun 2020
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal
  Transformer
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
Vladimir E. Iashin
Esa Rahtu
22
126
0
17 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise
  Multi-View Decoding
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
40
2
0
16 May 2020
Learning from Noisy Labels with Noise Modeling Network
Learning from Noisy Labels with Noise Modeling Network
Zhuolin Jiang
J. Silovský
M. Siu
William Hartmann
H. Gish
Sancar Adali
NoLa
12
2
0
01 May 2020
TEA: Temporal Excitation and Aggregation for Action Recognition
TEA: Temporal Excitation and Aggregation for Action Recognition
Yan-Ran Li
Bin Ji
Xintian Shi
Jianguo Zhang
Bin Kang
Limin Wang
ViT
25
439
0
03 Apr 2020
Multi-modal Dense Video Captioning
Multi-modal Dense Video Captioning
Vladimir E. Iashin
Esa Rahtu
22
164
0
17 Mar 2020
Video Caption Dataset for Describing Human Actions in Japanese
Video Caption Dataset for Describing Human Actions in Japanese
Yutaro Shigeto
Yuya Yoshikawa
Jiaqing Lin
A. Takeuchi
12
3
0
10 Mar 2020
Better Captioning with Sequence-Level Exploration
Better Captioning with Sequence-Level Exploration
Jia Chen
Qin Jin
37
12
0
08 Mar 2020
OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail
  Enhancement
OVC-Net: Object-Oriented Video Captioning with Temporal Graph and Detail Enhancement
Fangyi Zhu
Lei Li
Zhanyu Ma
Guang Chen
Jun Guo
19
1
0
08 Mar 2020
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Infrared and 3D skeleton feature fusion for RGB-D action recognition
Alban Main De Boissiere
R. Noumeir
25
38
0
28 Feb 2020
Hierarchical Memory Decoding for Video Captioning
Hierarchical Memory Decoding for Video Captioning
Aming Wu
Yahong Han
19
2
0
27 Feb 2020
CLARA: Clinical Report Auto-completion
CLARA: Clinical Report Auto-completion
Siddharth Biswal
Cao Xiao
Lucas Glass
M. P. M. Brandon Westover
Jimeng Sun
24
27
0
26 Feb 2020
Object Relational Graph with Teacher-Recommended Learning for Video
  Captioning
Object Relational Graph with Teacher-Recommended Learning for Video Captioning
Ziqi Zhang
Yaya Shi
Chunfen Yuan
Bing Li
Peijin Wang
Weiming Hu
Zhengjun Zha
VLM
37
271
0
26 Feb 2020
MAST: A Memory-Augmented Self-supervised Tracker
MAST: A Memory-Augmented Self-supervised Tracker
Zihang Lai
Erika Lu
Weidi Xie
VOS
24
184
0
18 Feb 2020
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Nested-Wasserstein Self-Imitation Learning for Sequence Generation
Ruiyi Zhang
Changyou Chen
Zhe Gan
Zheng Wen
Wenlin Wang
Lawrence Carin
31
5
0
20 Jan 2020
Spatio-Temporal Ranked-Attention Networks for Video Captioning
Spatio-Temporal Ranked-Attention Networks for Video Captioning
A. Cherian
Jue Wang
Chiori Hori
Tim K. Marks
AI4TS
22
19
0
17 Jan 2020
Delving Deeper into the Decoder for Video Captioning
Delving Deeper into the Decoder for Video Captioning
Haoran Chen
Jianmin Li
Xiaolin Hu
43
34
0
16 Jan 2020
Actions as Moving Points
Actions as Moving Points
Yixuan Li
Zixu Wang
Limin Wang
Gangshan Wu
22
104
0
14 Jan 2020
Vision and Language: from Visual Perception to Content Creation
Vision and Language: from Visual Perception to Content Creation
Tao Mei
Wei Zhang
Ting Yao
VLM
17
8
0
26 Dec 2019
Action Modifiers: Learning from Adverbs in Instructional Videos
Action Modifiers: Learning from Adverbs in Instructional Videos
Hazel Doughty
Ivan Laptev
W. Mayol-Cuevas
Dima Damen
27
30
0
13 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
311
0
04 Dec 2019
Transform-Invariant Convolutional Neural Networks for Image
  Classification and Search
Transform-Invariant Convolutional Neural Networks for Image Classification and Search
Xu Shen
Xinmei Tian
Anfeng He
Shaoyan Sun
Dacheng Tao
OOD
20
42
0
28 Nov 2019
Patch Reordering: a Novel Way to Achieve Rotation and Translation
  Invariance in Convolutional Neural Networks
Patch Reordering: a Novel Way to Achieve Rotation and Translation Invariance in Convolutional Neural Networks
Xu Shen
Xinmei Tian
Shaoyan Sun
Dacheng Tao
14
7
0
28 Nov 2019
Non-Autoregressive Coarse-to-Fine Video Captioning
Non-Autoregressive Coarse-to-Fine Video Captioning
Bang-ju Yang
Yuexian Zou
Fenglin Liu
Can Zhang
27
11
0
27 Nov 2019
SRG: Snippet Relatedness-based Temporal Action Proposal Generator
SRG: Snippet Relatedness-based Temporal Action Proposal Generator
Hyunjun Eun
Sumin Lee
Jinyoung Moon
Jongyoul Park
Chanho Jung
Changick Kim
11
24
0
26 Nov 2019
Characterizing the impact of using features extracted from pre-trained
  models on the quality of video captioning sequence-to-sequence models
Characterizing the impact of using features extracted from pre-trained models on the quality of video captioning sequence-to-sequence models
Menatallh Hammad
May Hammad
Mohamed Elshenawy
22
2
0
22 Nov 2019
Empirical Autopsy of Deep Video Captioning Frameworks
Empirical Autopsy of Deep Video Captioning Frameworks
Nayyer Aafaq
Naveed Akhtar
Wei Liu
Ajmal Mian
19
6
0
21 Nov 2019
Video Captioning with Text-based Dynamic Attention and Step-by-Step
  Learning
Video Captioning with Text-based Dynamic Attention and Step-by-Step Learning
Huanhou Xiao
Jinglun Shi
11
24
0
05 Nov 2019
Diverse Video Captioning Through Latent Variable Expansion
Diverse Video Captioning Through Latent Variable Expansion
Huanhou Xiao
Jinglun Shi
DiffM
35
15
0
26 Oct 2019
Weakly-Supervised Completion Moment Detection using Temporal Attention
Weakly-Supervised Completion Moment Detection using Temporal Attention
Farnoosh Heidarivincheh
Majid Mirmehdi
Dima Damen
27
9
0
22 Oct 2019
Integrating Temporal and Spatial Attentions for VATEX Video Captioning
  Challenge 2019
Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019
Shizhe Chen
Yida Zhao
Yuqing Song
Qin Jin
Qi Wu
11
0
0
15 Oct 2019
Human Action Sequence Classification
Human Action Sequence Classification
Yan Bin Ng
Basura Fernando
30
4
0
07 Oct 2019
Metric-Based Few-Shot Learning for Video Action Recognition
Metric-Based Few-Shot Learning for Video Action Recognition
Chris Careaga
Brian Hutchinson
Nathan Oken Hodas
Lawrence Phillips
22
22
0
14 Sep 2019
A Better Way to Attend: Attention with Trees for Video Question
  Answering
A Better Way to Attend: Attention with Trees for Video Question Answering
Hongyang Xue
Wenqing Chu
Zhou Zhao
Deng Cai
25
33
0
05 Sep 2019
Cooperative Cross-Stream Network for Discriminative Action
  Representation
Cooperative Cross-Stream Network for Discriminative Action Representation
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
18
5
0
27 Aug 2019
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
74
163
0
27 Aug 2019
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation
H. Emami
Majid Moradi Aliabadi
Ming Dong
R. Chinnam
GAN
23
168
0
19 Aug 2019
Interactive Variance Attention based Online Spoiler Detection for
  Time-Sync Comments
Interactive Variance Attention based Online Spoiler Detection for Time-Sync Comments
Wenmian Yang
Weijia Jia
Wenyuan Gao
Xiaojie Zhou
Yutao Luo
13
9
0
09 Aug 2019
SF-Net: Structured Feature Network for Continuous Sign Language
  Recognition
SF-Net: Structured Feature Network for Continuous Sign Language Recognition
Zhaoyang Yang
Zhenmei Shi
Xiaoyong Shen
Yu-Wing Tai
SLR
27
63
0
04 Aug 2019
Prediction and Description of Near-Future Activities in Video
Prediction and Description of Near-Future Activities in Video
T. Mahmud
Mohammad Billah
Mahmudul Hasan
A. Roy-Chowdhury
28
16
0
02 Aug 2019
Learning Visual Actions Using Multiple Verb-Only Labels
Learning Visual Actions Using Multiple Verb-Only Labels
Michael Wray
Dima Damen
25
7
0
25 Jul 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Chenyu You
Jiuxiang Gu
19
29
0
21 Jul 2019
Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events
  in Videos
Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos
Shizhe Chen
Yuqing Song
Yida Zhao
Qin Jin
Zhaoyang Zeng
Bei Liu
Jianlong Fu
Alexander G. Hauptmann
4
11
0
11 Jul 2019
Video Question Generation via Cross-Modal Self-Attention Networks
  Learning
Video Question Generation via Cross-Modal Self-Attention Networks Learning
Yu-Siang Wang
Hung-Ting Su
Chen-Hsi Chang
Zhe-Yu Liu
Winston H. Hsu
27
9
0
05 Jul 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video
  Captioning
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
16
170
0
11 Jun 2019
FASTER Recurrent Networks for Efficient Video Classification
FASTER Recurrent Networks for Efficient Video Classification
Linchao Zhu
Laura Sevilla-Lara
Du Tran
Matt Feiszli
Yi Yang
Heng Wang
43
6
0
10 Jun 2019
An Attention-based Recurrent Convolutional Network for Vehicle Taillight
  Recognition
An Attention-based Recurrent Convolutional Network for Vehicle Taillight Recognition
Kuan-Hui Lee
Takaaki Tagawa
Jia-Yu Pan
Adrien Gaidon
B. Douillard
ViT
24
15
0
09 Jun 2019
Attention is all you need for Videos: Self-attention based Video
  Summarization using Universal Transformers
Attention is all you need for Videos: Self-attention based Video Summarization using Universal Transformers
Manjot Bilkhu
Siyang Wang
Tushar Dobhal
ViT
11
15
0
06 Jun 2019
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via
  Question Answering
ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering
Zhou Yu
D. Xu
Jun-chen Yu
Ting Yu
Zhou Zhao
Yueting Zhuang
Dacheng Tao
15
435
0
06 Jun 2019
Previous
12345678
Next