ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.02982
  4. Cited By
CLIP-guided Prototype Modulating for Few-shot Action Recognition

CLIP-guided Prototype Modulating for Few-shot Action Recognition

6 March 2023
Xiang Wang
Shiwei Zhang
Jun Cen
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
    VLM
ArXivPDFHTML

Papers citing "CLIP-guided Prototype Modulating for Few-shot Action Recognition"

29 / 29 papers shown
Title
Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition
Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition
Congqi Cao
Peiheng Han
Y. Zhang
Yating Yu
Qinyi Lv
Lingtong Min
Yanning Zhang
VLM
80
0
0
09 May 2025
DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification
Minghui Lin
Shu Wang
Xiang Wang
Jianhua Tang
Longbin Fu
Zhengrong Zuo
Nong Sang
VLM
60
0
0
15 Apr 2025
VTD-CLIP: Video-to-Text Discretization via Prompting CLIP
VTD-CLIP: Video-to-Text Discretization via Prompting CLIP
Wencheng Zhu
Yuexin Wang
Hongxuan Li
Pengfei Zhu
Q. Hu
CLIP
60
0
0
24 Mar 2025
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers
Mert Bulent Sariyildiz
Philippe Weinzaepfel
Thomas Lucas
Pau de Jorge
Diane Larlus
Yannis Kalantidis
71
0
0
18 Mar 2025
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Shuang Cui
Yi Li
Jiangmeng Li
Xiongxin Tang
Fuchun Sun
Fanjiang Xu
Hui Xiong
76
0
0
15 Jan 2025
TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition
TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition
Yilong Wang
Zilin Gao
Qilong Wang
Zhaofeng Chen
P. Li
Q. Hu
102
1
0
28 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
131
1
0
18 Nov 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via
  Regional Bootstrapping
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
Taolin Zhang
Jinqiao Wang
Hang Guo
Tao Dai
Bin Chen
Shu-Tao Xia
VLM
TTA
42
0
0
20 Oct 2024
Task-Adapter: Task-specific Adaptation of Image Models for Few-shot
  Action Recognition
Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition
Congqi Cao
Guibiao Liao
Yating Yu
Kanglin Liu
Lingtong Min
Yanning Zhang
67
4
0
01 Aug 2024
SOAP: Enhancing Spatio-Temporal Relation and Motion Information
  Capturing for Few-Shot Action Recognition
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
Wenbo Huang
Jinghui Zhang
Xuwei Qian
Zhen Wu
Meng Wang
Lei Zhang
44
1
0
23 Jul 2024
A Comprehensive Review of Few-shot Action Recognition
A Comprehensive Review of Few-shot Action Recognition
Yuyang Wanyan
Xiaoshan Yang
Weiming Dong
Changsheng Xu
VLM
92
3
0
20 Jul 2024
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain
  Few-shot Action Recognition
DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Fei-Yu Guo
YiKang Wang
Han Qi
Li Zhu
Jing Sun
62
2
0
08 Jul 2024
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via
  Multi-modal LLM
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM
Huaxin Zhang
Xiaohao Xu
Xiang Wang
Jialong Zuo
Chuchu Han
Xiaonan Huang
Changxin Gao
Yuehuan Wang
Nong Sang
71
18
0
18 Jun 2024
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition
Hongyu Qu
Rui Yan
Xiangbo Shu
Haoliang Gao
Peng Huang
Guo-Sen Xie
72
4
0
03 May 2024
Using Left and Right Brains Together: Towards Vision and Language
  Planning
Using Left and Right Brains Together: Towards Vision and Language Planning
Jun Cen
Chenfei Wu
Xiao Liu
Sheng-Siang Yin
Yixuan Pei
Jinglong Yang
Qifeng Chen
Nan Duan
Jianguo Zhang
68
3
0
16 Feb 2024
Multi-view Distillation based on Multi-modal Fusion for Few-shot Action
  Recognition(CLIP-$\mathrm{M^2}$DF)
Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition(CLIP-M2\mathrm{M^2}M2DF)
Fei-Yu Guo
YiKang Wang
Han Qi
WenPing Jin
Li Zhu
37
2
0
16 Jan 2024
D$^2$ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for
  Few-shot Action Recognition
D2^22ST-Adapter: Disentangled-and-Deformable Spatio-Temporal Adapter for Few-shot Action Recognition
Wenjie Pei
Qizhong Tan
Guangming Lu
Jiandong Tian
48
3
0
03 Dec 2023
Consistency Prototype Module and Motion Compensation for Few-Shot Action
  Recognition (CLIP-CP$\mathbf{M^2}$C)
Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CPM2\mathbf{M^2}M2C)
Fei-Yu Guo
Li Zhu
YiKang Wang
Han Qi
57
2
0
02 Dec 2023
Few-shot Action Recognition with Captioning Foundation Models
Few-shot Action Recognition with Captioning Foundation Models
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
VLM
78
7
0
16 Oct 2023
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
Xin Li
Dongze Lian
Zhihe Lu
Jiawang Bai
Zhibo Chen
Xinchao Wang
VLM
69
63
0
24 Sep 2023
Big-model Driven Few-shot Continual Learning
Big-model Driven Few-shot Continual Learning
Ziqi Gu
Chunyan Xu
Zihan Lu
Xin Liu
Anbo Dai
Zhen Cui
CLL
40
1
0
02 Sep 2023
Multimodal Adaptation of CLIP for Few-Shot Action Recognition
Multimodal Adaptation of CLIP for Few-Shot Action Recognition
Jiazheng Xing
Mengmeng Wang
Xiaojun Hou
Guangwen Dai
Jingdong Wang
Yong-Jin Liu
VLM
27
1
0
03 Aug 2023
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot
  Action Recognition
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
Xiang Wang
Shiwei Zhang
Zhiwu Qing
Changxin Gao
Yingya Zhang
Deli Zhao
Nong Sang
29
42
0
03 Apr 2023
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language
  Modeling
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling
Renrui Zhang
Rongyao Fang
Wei Zhang
Peng Gao
Kunchang Li
Jifeng Dai
Yu Qiao
Hongsheng Li
VLM
205
387
0
06 Nov 2021
ActionCLIP: A New Paradigm for Video Action Recognition
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
154
366
0
17 Sep 2021
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
374
2,307
0
02 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge
  Distillation
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLM
ObjD
230
900
0
28 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
353
3,749
0
11 Feb 2021
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
638
11,762
0
09 Mar 2017
1