ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.13931
  4. Cited By
Span-based Localizing Network for Natural Language Video Localization

Span-based Localizing Network for Natural Language Video Localization

29 April 2020
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
ArXivPDFHTML

Papers citing "Span-based Localizing Network for Natural Language Video Localization"

50 / 179 papers shown
Title
You Can Ground Earlier than See: An Effective and Efficient Pipeline for
  Temporal Sentence Grounding in Compressed Videos
You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos
Xiang Fang
Daizong Liu
Pan Zhou
Guoshun Nan
23
37
0
14 Mar 2023
Generation-Guided Multi-Level Unified Network for Video Grounding
Generation-Guided Multi-Level Unified Network for Video Grounding
Xingyi Cheng
Xiangyu Wu
Dong Shen
Hezheng Lin
Fan Yang
21
0
0
14 Mar 2023
Towards Diverse Temporal Grounding under Single Positive Labels
Towards Diverse Temporal Grounding under Single Positive Labels
Hao Zhou
Chongyang Zhang
Yanjun Chen
Chuanping Hu
26
1
0
12 Mar 2023
Learning Grounded Vision-Language Representation for Versatile
  Understanding in Untrimmed Videos
Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
Teng Wang
Jinrui Zhang
Feng Zheng
Wenhao Jiang
Ran Cheng
Ping Luo
VLM
33
11
0
11 Mar 2023
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
Yimeng Zhang
Xin Chen
Jinghan Jia
Sijia Liu
Ke Ding
18
25
0
09 Mar 2023
Jointly Visual- and Semantic-Aware Graph Memory Networks for Temporal
  Sentence Localization in Videos
Jointly Visual- and Semantic-Aware Graph Memory Networks for Temporal Sentence Localization in Videos
Daizong Liu
Pan Zhou
VOS
19
4
0
02 Mar 2023
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection
  to Image-Text Pre-Training
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo
Jiabo Huang
S. Gong
Hailin Jin
Yang Liu
VGen
21
28
0
28 Feb 2023
Localizing Moments in Long Video Via Multimodal Guidance
Localizing Moments in Long Video Via Multimodal Guidance
Wayner Barrios
Mattia Soldan
Alberto M. Ceballos-Arroyo
Fabian Caba Heilbron
Guohao Li
30
20
0
26 Feb 2023
Tracking Objects and Activities with Attention for Temporal Sentence
  Grounding
Tracking Objects and Activities with Attention for Temporal Sentence Grounding
Zeyu Xiong
Daizong Liu
Pan Zhou
Jiahao Zhu
26
5
0
21 Feb 2023
Constraint and Union for Partially-Supervised Temporal Sentence
  Grounding
Constraint and Union for Partially-Supervised Temporal Sentence Grounding
Chen Ju
Haicheng Wang
Jinxian Liu
Chaofan Ma
Ya-Qin Zhang
Peisen Zhao
Jianlong Chang
Qi Tian
30
15
0
20 Feb 2023
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Raghav Goyal
E. Mavroudi
Xitong Yang
Sainbayar Sukhbaatar
Leonid Sigal
Matt Feiszli
Lorenzo Torresani
Du Tran
23
7
0
16 Feb 2023
Multi-video Moment Ranking with Multimodal Clue
Multi-video Moment Ranking with Multimodal Clue
Danyang Hou
Liang Pang
Yanyan Lan
Huawei Shen
Xueqi Cheng
13
0
0
29 Jan 2023
Variational Cross-Graph Reasoning and Adaptive Structured Semantics
  Learning for Compositional Temporal Grounding
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding
Juncheng Li
Siliang Tang
Linchao Zhu
Wenqiao Zhang
Yi Yang
Tat-Seng Chua
Fei Wu
Y. Zhuang
BDL
24
14
0
22 Jan 2023
Hypotheses Tree Building for One-Shot Temporal Sentence Localization
Hypotheses Tree Building for One-Shot Temporal Sentence Localization
Daizong Liu
Xiang Fang
Pan Zhou
Xing Di
Weining Lu
Yu Cheng
32
19
0
05 Jan 2023
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory
Santhosh Kumar Ramakrishnan
Ziad Al-Halah
Kristen Grauman
117
39
0
02 Jan 2023
Rethinking the Video Sampling and Reasoning Strategies for Temporal
  Sentence Grounding
Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding
Jiahao Zhu
Daizong Liu
Pan Zhou
Xing Di
Yu Cheng
...
Wenzheng Xu
Zichuan Xu
Yao Wan
Lichao Sun
Zeyu Xiong
27
18
0
02 Jan 2023
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding
Wei Ji
Long Chen
Yin-wei Wei
Yiming Wu
Tat-Seng Chua
AI4TS
27
18
0
26 Dec 2022
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Guo Chen
Sen Xing
Zhe Chen
Yi Wang
Kunchang Li
...
Hongjie Zhang
Tong Lu
Yali Wang
Liming Wang
Yu Qiao
35
46
0
17 Nov 2022
An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language
  Queries Challenge 2022
An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022
Zhijian Hou
Wanjun Zhong
Lei Ji
Difei Gao
Kun Yan
W. Chan
Chong-Wah Ngo
Zheng Shou
Nan Duan
6
6
0
16 Nov 2022
Visual Answer Localization with Cross-modal Mutual Knowledge Transfer
Visual Answer Localization with Cross-modal Mutual Knowledge Transfer
Yixuan Weng
Bin Li
24
6
0
26 Oct 2022
Weakly-Supervised Temporal Article Grounding
Weakly-Supervised Temporal Article Grounding
Long Chen
Yulei Niu
Brian Chen
Xudong Lin
G. Han
Christopher Thomas
Hammad A. Ayyubi
Heng Ji
Shih-Fu Chang
AI4TS
29
13
0
22 Oct 2022
Selective Query-guided Debiasing for Video Corpus Moment Retrieval
Selective Query-guided Debiasing for Video Corpus Moment Retrieval
Sunjae Yoon
Jiajing Hong
Eunseop Yoon
Dahyun Kim
Junyeong Kim
Hee Suk Yoon
Changdong Yoo
38
21
0
17 Oct 2022
Learning to Locate Visual Answer in Video Corpus Using Question
Learning to Locate Visual Answer in Video Corpus Using Question
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
8
5
0
11 Oct 2022
Towards Parameter-Efficient Integration of Pre-Trained Language Models
  In Temporal Video Grounding
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding
Erica K. Shimomoto
Edison Marrese-Taylor
Hiroya Takamura
Ichiro Kobayashi
Hideki Nakayama
Yusuke Miyao
27
7
0
26 Sep 2022
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
Xiang Fang
Daizong Liu
Pan Zhou
Yuchong Hu
77
39
0
23 Sep 2022
CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video
  Temporal Grounding
CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Zhijian Hou
Wanjun Zhong
Lei Ji
Difei Gao
Kun Yan
W. Chan
Chong-Wah Ngo
Zheng Shou
Nan Duan
AI4TS
34
24
0
22 Sep 2022
Video-Guided Curriculum Learning for Spoken Video Grounding
Video-Guided Curriculum Learning for Spoken Video Grounding
Yan Xia
Zhou Zhao
Shangwei Ye
Yang Zhao
Haoyuan Li
Yi Ren
26
11
0
01 Sep 2022
Hierarchical Local-Global Transformer for Temporal Sentence Grounding
Hierarchical Local-Global Transformer for Temporal Sentence Grounding
Xiang Fang
Daizong Liu
Pan Zhou
Zichuan Xu
Rui Li
19
28
0
31 Aug 2022
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training
  Framework for Temporal Grounding
Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding
Jiachang Hao
Haifeng Sun
Pengfei Ren
Jingyu Wang
Q. Qi
J. Liao
23
26
0
29 Jul 2022
Reducing the Vision and Language Bias for Temporal Sentence Grounding
Reducing the Vision and Language Bias for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Wei Hu
19
49
0
27 Jul 2022
Skimming, Locating, then Perusing: A Human-Like Framework for Natural
  Language Video Localization
Skimming, Locating, then Perusing: A Human-Like Framework for Natural Language Video Localization
Daizong Liu
Wei Hu
22
39
0
27 Jul 2022
EgoEnv: Human-centric environment representations from egocentric video
EgoEnv: Human-centric environment representations from egocentric video
Tushar Nagarajan
Santhosh Kumar Ramakrishnan
Ruta Desai
James M. Hillis
Kristen Grauman
EgoV
33
19
0
22 Jul 2022
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
EgoV
29
7
0
04 Jul 2022
ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language Queries
  Challenge 2022
ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language Queries Challenge 2022
Na Liu
Xiaohan Wang
Xiaobo Li
Yi Yang
Yueting Zhuang
24
18
0
01 Jul 2022
Video Activity Localisation with Uncertainties in Temporal Boundary
Video Activity Localisation with Uncertainties in Temporal Boundary
Jiabo Huang
Hailin Jin
S. Gong
Yang Liu
24
23
0
26 Jun 2022
Egocentric Video-Language Pretraining
Egocentric Video-Language Pretraining
Kevin Qinghong Lin
Alex Jinpeng Wang
Mattia Soldan
Michael Wray
Rui Yan
...
Hongfa Wang
Dima Damen
Guohao Li
Wei Liu
Mike Zheng Shou
VLM
EgoV
46
188
0
03 Jun 2022
You Need to Read Again: Multi-granularity Perception Network for Moment
  Retrieval in Videos
You Need to Read Again: Multi-granularity Perception Network for Moment Retrieval in Videos
Xin Sun
Xinyu Wang
Jialin Gao
Qiong Liu
Xiaoping Zhou
30
33
0
25 May 2022
Contrastive Language-Action Pre-training for Temporal Localization
Contrastive Language-Action Pre-training for Temporal Localization
Mengmeng Xu
Erhan Gundogdu
⋆⋆ Maksim
Guohao Li
M. Donoser
Loris Bazzani
38
27
0
26 Apr 2022
Video Moment Retrieval from Text Queries via Single Frame Annotation
Video Moment Retrieval from Text Queries via Single Frame Annotation
Ran Cui
Tianwen Qian
Pai Peng
E. Daskalaki
Jingjing Chen
Xiao-Wei Guo
Huyang Sun
Yu-Gang Jiang
17
35
0
20 Apr 2022
Animal Kingdom: A Large and Diverse Dataset for Animal Behavior
  Understanding
Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Xun Long Ng
Kian Eng Ong
Qichen Zheng
Yun Ni
S. Yeo
Xiaozhong Liu
VGen
16
81
0
18 Apr 2022
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal
  Grounding
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding
Ziyue Wu
Junyu Gao
Shucheng Huang
Changsheng Xu
28
4
0
04 Apr 2022
TubeDETR: Spatio-Temporal Video Grounding with Transformers
TubeDETR: Spatio-Temporal Video Grounding with Transformers
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
ViT
28
94
0
30 Mar 2022
Searching for fingerspelled content in American Sign Language
Searching for fingerspelled content in American Sign Language
Bowen Shi
D. Brentari
G. Shakhnarovich
Karen Livescu
22
5
0
24 Mar 2022
Compositional Temporal Grounding with Structured Variational Cross-Graph
  Correspondence Learning
Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
Juncheng Li
Junlin Xie
Long Qian
Linchao Zhu
Siliang Tang
Fei Wu
Yi Yang
Yueting Zhuang
Qing Guo
36
73
0
24 Mar 2022
Towards Visual-Prompt Temporal Answering Grounding in Medical
  Instructional Video
Towards Visual-Prompt Temporal Answering Grounding in Medical Instructional Video
Bin Li
Yixuan Weng
Bin Sun
Shutao Li
32
24
0
13 Mar 2022
Multi-Scale Self-Contrastive Learning with Hard Negative Mining for
  Weakly-Supervised Query-based Video Grounding
Multi-Scale Self-Contrastive Learning with Hard Negative Mining for Weakly-Supervised Query-based Video Grounding
Shentong Mo
Daizong Liu
Wei Hu
SSL
21
6
0
08 Mar 2022
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for
  Temporal Sentence Grounding
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
Daizong Liu
Xiang Fang
Wei Hu
Pan Zhou
17
37
0
06 Mar 2022
A Dataset for Medical Instructional Video Classification and Question
  Answering
A Dataset for Medical Instructional Video Classification and Question Answering
D. Gupta
Kush Attal
Dina Demner-Fushman
42
31
0
30 Jan 2022
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With
  Transformer for Sentence Grounding in Videos
Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos
Sangmin Woo
Jinyoung Park
Inyong Koo
Sumin Lee
Minki Jeong
Changick Kim
44
3
0
25 Jan 2022
Temporal Sentence Grounding in Videos: A Survey and Future Directions
Temporal Sentence Grounding in Videos: A Survey and Future Directions
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
3DGS
36
38
0
20 Jan 2022
Previous
1234
Next