Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2209.11572
Cited By
v1
v2 (latest)
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval
23 September 2022
Xiang Fang
Daizong Liu
Pan Zhou
Yuchong Hu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval"
50 / 57 papers shown
Title
Hierarchical Local-Global Transformer for Temporal Sentence Grounding
Xiang Fang
Daizong Liu
Pan Zhou
Zichuan Xu
Rui Li
95
30
0
31 Aug 2022
Exploring Optical-Flow-Guided Motion and Detection-Based Appearance for Temporal Sentence Grounding
Daizong Liu
Xiang Fang
Wei Hu
Pan Zhou
77
37
0
06 Mar 2022
Unsupervised Temporal Video Grounding with Deep Semantic Clustering
Daizong Liu
Xiaoye Qu
Yinzhen Wang
Xing Di
Kai Zou
Yu Cheng
Zichuan Xu
Pan Zhou
76
51
0
14 Jan 2022
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations
Mohammadreza Zolfaghari
Yi Zhu
Peter V. Gehler
Thomas Brox
178
129
0
30 Sep 2021
Adaptive Proposal Generation Network for Temporal Sentence Localization in Videos
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
71
55
0
14 Sep 2021
Learning Cross-modal Contrastive Features for Video Domain Adaptation
Donghyun Kim
Yi-Hsuan Tsai
Bingbing Zhuang
Xiang Yu
Stan Sclaroff
Kate Saenko
Manmohan Chandraker
75
72
0
26 Aug 2021
Uncertainty-aware Clustering for Unsupervised Domain Adaptive Object Re-identification
Pengfei Wang
Changxing Ding
Wentao Tan
Biwei Huang
Kui Jia
Dacheng Tao
62
41
0
22 Aug 2021
Weakly Supervised Temporal Adjacent Network for Language Grounding
Yuechen Wang
Jiajun Deng
Wen-gang Zhou
Houqiang Li
89
67
0
30 Jun 2021
Context-aware Biaffine Localizing Network for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Yu Cheng
Wei Wei
Zichuan Xu
Yulai Xie
63
145
0
22 Mar 2021
Boundary Proposal Network for Two-Stage Natural Language Video Localization
Shaoning Xiao
Long Chen
Songyang Zhang
Wei Ji
Jian Shao
Lu Ye
Jun Xiao
52
160
0
15 Mar 2021
Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection
Dayan Guan
Jiaxing Huang
Aoran Xiao
Shijian Lu
Yanpeng Cao
241
117
0
27 Feb 2021
Natural Language Video Localization: A Revisit in Span-based Question Answering Framework
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
176
87
0
26 Feb 2021
Embracing Domain Differences in Fake News: Cross-domain Fake News Detection using Multi-modal Data
Amila Silva
Ling Luo
S. Karunasekera
C. Leckie
82
129
0
11 Feb 2021
ANIMC: A Soft Framework for Auto-weighted Noisy and Incomplete Multi-view Clustering
Xiang Fang
Yuchong Hu
Pan Zhou
Dapeng Wu
68
36
0
20 Nov 2020
Unbalanced Incomplete Multi-view Clustering via the Scheme of View Evolution: Weak Views are Meat; Strong Views do Eat
Xiang Fang
Yuchong Hu
Pan Zhou
Dapeng Wu
43
53
0
20 Nov 2020
VLG-Net: Video-Language Graph Matching Network for Video Grounding
Mattia Soldan
Mengmeng Xu
Sisi Qu
Jesper N. Tegnér
Guohao Li
81
70
0
19 Nov 2020
Frame-wise Cross-modal Matching for Video Moment Retrieval
Haoyu Tang
Jihua Zhu
Meng Liu
Zan Gao
Zhiyong Cheng
64
62
0
22 Sep 2020
VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval
Minuk Ma
Sunjae Yoon
Junyeong Kim
Youngjoon Lee
Sunghun Kang
Chang D. Yoo
82
78
0
24 Aug 2020
Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Jieming Zhu
Xiuqiang He
68
69
0
19 Aug 2020
Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
Daizong Liu
Xiaoye Qu
Xiao-Yang Liu
Jianfeng Dong
Pan Zhou
Zichuan Xu
75
129
0
04 Aug 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLM
VLM
OffRL
AI4TS
129
504
0
01 May 2020
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
90
315
0
29 Apr 2020
Weakly-Supervised Multi-Level Attentional Reconstruction Network for Grounding Textual Queries in Videos
Yijun Song
Jingwen Wang
Lin Ma
Zhou Yu
Jun Yu
58
61
0
16 Mar 2020
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
Jonathan Munro
Dima Damen
EgoV
65
194
0
27 Jan 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
203
286
0
24 Jan 2020
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
Songyang Zhang
Houwen Peng
Jianlong Fu
Jiebo Luo
75
470
0
08 Dec 2019
xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
M. Jaritz
Tuan-Hung Vu
Raoul de Charette
É. Wirbel
P. Pérez
3DPC
86
198
0
28 Nov 2019
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network
Zhijie Lin
Zhou Zhao
Zhu Zhang
Qi. Wang
Huasheng Liu
64
150
0
19 Nov 2019
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
70
242
0
18 Nov 2019
Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
Yitian Yuan
Lin Ma
Jingwen Wang
Wei Liu
Wenwu Zhu
84
244
0
31 Oct 2019
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval
Reuben Tan
Huijuan Xu
Kate Saenko
Bryan A. Plummer
64
67
0
27 Sep 2019
Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction
Jingwen Wang
Lin Ma
Wenhao Jiang
76
182
0
11 Sep 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
69
147
0
20 Aug 2019
Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video
Zhenfang Chen
Lin Ma
Wenhan Luo
Kwan-Yee K. Wong
95
103
0
06 Jun 2019
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
Zhu Zhang
Zhijie Lin
Zhou Zhao
Zhenxin Xiao
51
213
0
06 Jun 2019
Weakly Supervised Video Moment Retrieval From Text Queries
Niluthpol Chowdhury Mithun
S. Paul
Amit K. Roy-Chowdhury
123
194
0
05 Apr 2019
Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-identification
Zhun Zhong
Liang Zheng
Zhiming Luo
Shaozi Li
Yi Yang
OOD
116
564
0
03 Apr 2019
Weakly Supervised Dense Event Captioning in Videos
Xuguang Duan
Wen-bing Huang
Chuang Gan
Jingdong Wang
Wenwu Zhu
Junzhou Huang
69
150
0
10 Dec 2018
MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment
Da Zhang
Xiyang Dai
Xin Eric Wang
Yuan-fang Wang
L. Davis
78
305
0
30 Nov 2018
MAC: Mining Activity Concepts for Language-based Temporal Localization
Runzhou Ge
J. Gao
Kan Chen
Ram Nevatia
76
179
0
21 Nov 2018
Localizing Moments in Video with Temporal Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
95
159
0
05 Sep 2018
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression
Yitian Yuan
Tao Mei
Wenwu Zhu
82
333
0
19 Apr 2018
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
Huijuan Xu
Kun He
Bryan A. Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
CLIP
67
323
0
13 Apr 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
Licheng Yu
Zhe Lin
Xiaohui Shen
Jimei Yang
Xin Lu
Joey Tianyi Zhou
Tamara L. Berg
ObjD
111
831
0
24 Jan 2018
MHTN: Modal-adversarial Hybrid Transfer Network for Cross-modal Retrieval
Xin Huang
Yuxin Peng
Mingkuan Yuan
GAN
59
110
0
08 Aug 2017
Query-guided Regression Network with Context Policy for Phrase Grounding
Kan Chen
Rama Kovvuri
Ram Nevatia
68
142
0
04 Aug 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
789
132,454
0
12 Jun 2017
TALL: Temporal Activity Localization via Language Query
J. Gao
Chen Sun
Zhenheng Yang
Ram Nevatia
127
824
0
05 May 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
144
1,250
0
02 May 2017
Adversarial Discriminative Domain Adaptation
Eric Tzeng
Judy Hoffman
Kate Saenko
Trevor Darrell
GAN
OOD
270
4,673
0
17 Feb 2017
1
2
Next