Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.05997
Cited By
Exploiting Auxiliary Caption for Video Grounding
15 January 2023
Hongxiang Li
Meng Cao
Xuxin Cheng
Zhihong Zhu
Yaowei Li
Yuexian Zou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploiting Auxiliary Caption for Video Grounding"
9 / 9 papers shown
Title
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval
Haoran Tang
Meng Cao
Jinfa Huang
Ruyang Liu
Peng Jin
Ge Li
Xiaodan Liang
Mamba
99
4
0
24 Feb 2025
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension
Ting Liu
Zunnan Xu
Yue Hu
Liangtao Shi
Zhiqiang Wang
Quanjun Yin
65
2
0
03 Jan 2025
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
Xiaoyu Jin
Zunnan Xu
Mingwen Ou
Wenming Yang
DiffM
46
7
0
29 Aug 2024
Textual Inversion and Self-supervised Refinement for Radiology Report Generation
Yuanjiang Luo
Hongxiang Li
Xuan Wu
Meng Cao
Xiaoshuang Huang
Zhihong Zhu
Peixi Liao
Hu Chen
Yi Zhang
MedIm
35
2
0
31 May 2024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
Meng Cao
Haoran Tang
Jinfa Huang
Peng Jin
Can Zhang
Ruyang Liu
Long Chen
Xiaodan Liang
Li-ming Yuan
Ge Li
101
11
0
29 May 2024
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Zunnan Xu
Yukang Lin
Haonan Han
Sicheng Yang
Ronghui Li
Yachao Zhang
Xiu Li
Mamba
46
25
0
14 Mar 2024
Transform-Equivariant Consistency Learning for Temporal Sentence Grounding
Daizong Liu
Xiaoye Qu
Jianfeng Dong
Pan Zhou
Zichuan Xu
Yining Qi
Xing Di
Weining Lu
Yu Cheng
51
8
0
06 May 2023
Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition
Haotao Wang
Aston Zhang
Yi Zhu
Shuai Zheng
Mu Li
Alexander J. Smola
Zhangyang Wang
OODD
140
48
0
04 Jul 2022
Natural Language Video Localization: A Revisit in Span-based Question Answering Framework
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
113
84
0
26 Feb 2021
1