ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.02101
  4. Cited By
TALL: Temporal Activity Localization via Language Query
v1v2 (latest)

TALL: Temporal Activity Localization via Language Query

5 May 2017
J. Gao
Chen Sun
Zhenheng Yang
Ram Nevatia
ArXiv (abs)PDFHTML

Papers citing "TALL: Temporal Activity Localization via Language Query"

33 / 433 papers shown
Title
WSLLN: Weakly Supervised Natural Language Localization Networks
WSLLN: Weakly Supervised Natural Language Localization Networks
M. Gao
L. Davis
R. Socher
Caiming Xiong
65
80
0
31 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query
  in Video using Guided Attention
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
94
147
0
20 Aug 2019
Sentence Specified Dynamic Video Thumbnail Generation
Sentence Specified Dynamic Video Thumbnail Generation
Yiitan Yuan
Lin Ma
Wenwu Zhu
75
30
0
12 Aug 2019
Exploiting Temporal Relationships in Video Moment Localization with
  Natural Language
Exploiting Temporal Relationships in Video Moment Localization with Natural Language
Songyang Zhang
Jinsong Su
Jiebo Luo
60
74
0
11 Aug 2019
Finding Moments in Video Collections Using Natural Language
Finding Moments in Video Collections Using Natural Language
Victor Escorcia
Mattia Soldan
Josef Sivic
Guohao Li
Bryan C. Russell
55
7
0
30 Jul 2019
Localizing Unseen Activities in Video via Image Query
Localizing Unseen Activities in Video via Image Query
Zhu Zhang
Zhou Zhao
Zhijie Lin
Jingkuan Song
Deng Cai
ViT
49
13
0
28 Jun 2019
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in
  Videos
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Zhenxin Xiao
65
213
0
06 Jun 2019
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video
  Question Answering
Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering
Junyeong Kim
Minuk Ma
Kyungsu Kim
Sungjin Kim
Chang D. Yoo
62
27
0
28 May 2019
Spatio-temporal Video Re-localization by Warp LSTM
Spatio-temporal Video Re-localization by Warp LSTM
Yang Feng
Lin Ma
Wei Liu
Jiebo Luo
61
38
0
10 May 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
75
229
0
25 Apr 2019
Tripping through time: Efficient Localization of Activities in Videos
Tripping through time: Efficient Localization of Activities in Videos
Meera Hahn
Asim Kadav
James M. Rehg
H. Graf
106
86
0
22 Apr 2019
Referring to Objects in Videos using Spatio-Temporal Identifying
  Descriptions
Referring to Objects in Videos using Spatio-Temporal Identifying Descriptions
Peratham Wiriyathammabhum
Abhinav Shrivastava
Vlad I. Morariu
L. Davis
60
5
0
08 Apr 2019
Weakly Supervised Video Moment Retrieval From Text Queries
Weakly Supervised Video Moment Retrieval From Text Queries
Niluthpol Chowdhury Mithun
S. Paul
Amit K. Roy-Chowdhury
138
194
0
05 Apr 2019
ExCL: Extractive Clip Localization Using Natural Language Descriptions
ExCL: Extractive Clip Localization Using Natural Language Descriptions
Soham Ghosh
Anuva Agarwal
Zarana Parekh
Alexander G. Hauptmann
CLIP
61
153
0
04 Apr 2019
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding
  Natural Language Descriptions in Videos
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
Dongliang He
Xiang Zhao
Jizhou Huang
Fu Li
Xiao-Chang Liu
Shilei Wen
81
154
0
21 Jan 2019
Weakly Supervised Dense Event Captioning in Videos
Weakly Supervised Dense Event Captioning in Videos
Xuguang Duan
Wen-bing Huang
Chuang Gan
Jingdong Wang
Wenwu Zhu
Junzhou Huang
86
151
0
10 Dec 2018
Multi-modal Capsule Routing for Actor and Action Video Segmentation
  Conditioned on Natural Language Queries
Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries
Bruce McIntosh
Kevin Duarte
Yogesh S Rawat
M. Shah
MedIm
64
17
0
02 Dec 2018
MAN: Moment Alignment Network for Natural Language Moment Retrieval via
  Iterative Graph Adjustment
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
Da Zhang
Xiyang Dai
Xin Eric Wang
Yuan-fang Wang
L. Davis
86
305
0
30 Nov 2018
MAC: Mining Activity Concepts for Language-based Temporal Localization
MAC: Mining Activity Concepts for Language-based Temporal Localization
Runzhou Ge
J. Gao
Kan Chen
Ram Nevatia
84
179
0
21 Nov 2018
TVQA: Localized, Compositional Video Question Answering
TVQA: Localized, Compositional Video Question Answering
Muhammad Abdul Wahab
Licheng Yu
Mounir Nasr Allah
Tamara L. Berg
116
643
0
05 Sep 2018
Localizing Moments in Video with Temporal Language
Localizing Moments in Video with Temporal Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
100
159
0
05 Sep 2018
Video Re-localization
Video Re-localization
Yang Feng
Lin Ma
Wen Liu
Tong Zhang
Jiebo Luo
108
71
0
05 Aug 2018
CTAP: Complementary Temporal Action Proposal Generation
CTAP: Complementary Temporal Action Proposal Generation
J. Gao
Kan Chen
Ram Nevatia
ViT
68
178
0
12 Jul 2018
To Find Where You Talk: Temporal Sentence Localization in Video with
  Attention Based Location Regression
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression
Yitian Yuan
Tao Mei
Wenwu Zhu
86
333
0
19 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
80
323
0
13 Apr 2018
Bidirectional Attentive Fusion with Context Gating for Dense Video
  Captioning
Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning
Jingwen Wang
Wenhao Jiang
Lin Ma
Wen Liu
Yong-mei Xu
94
208
0
31 Mar 2018
Motion-Appearance Co-Memory Networks for Video Question Answering
Motion-Appearance Co-Memory Networks for Video Question Answering
J. Gao
Runzhou Ge
Kan Chen
Ram Nevatia
143
241
0
29 Mar 2018
Actor and Action Video Segmentation from a Sentence
Actor and Action Video Segmentation from a Sentence
Kirill Gavrilyuk
Amir Ghodrati
Zhenyang Li
Cees G. M. Snoek
VLM
85
151
0
20 Mar 2018
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding
Kan Chen
J. Gao
Ram Nevatia
71
90
0
11 Mar 2018
Joint Event Detection and Description in Continuous Video Streams
Joint Event Detection and Description in Continuous Video Streams
Huijuan Xu
Boyang Albert Li
Vasili Ramanishka
Leonid Sigal
Kate Saenko
55
53
0
28 Feb 2018
Online Detection of Action Start in Untrimmed, Streaming Videos
Online Detection of Action Start in Untrimmed, Streaming Videos
Zheng Shou
Junting Pan
Jonathan Chan
K. Miyazawa
Hassan Mansour
A. Vetro
Xavier Giró-i-Nieto
Shih-Fu Chang
144
61
0
19 Feb 2018
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
133
954
0
04 Aug 2017
RED: Reinforced Encoder-Decoder Networks for Action Anticipation
RED: Reinforced Encoder-Decoder Networks for Action Anticipation
J. Gao
Zhenheng Yang
Ram Nevatia
110
197
0
16 Jul 2017
Previous
123456789