Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.10091
Cited By
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition
20 April 2023
Jun Zhu
Jia Jin
Zihan Yang
Xiaohao Wu
X. Wang
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition"
8 / 8 papers shown
Title
Adversarial Semantic and Label Perturbation Attack for Pedestrian Attribute Recognition
Weizhe Kong
Xiao Wang
Ruichong Gao
Chenglong Li
Yu Zhang
Xing Yang
Yaowei Wang
Jin Tang
AAML
58
0
0
29 May 2025
CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention
Yaxiong Chen
Minghong Wei
Zixuan Zheng
Jingliang Hu
Yilei Shi
Shengwu Xiong
Xiao Xiang Zhu
Lichao Mou
MedIm
80
1
0
20 Mar 2025
SPACE: SPAtial-aware Consistency rEgularization for anomaly detection in Industrial applications
Daehwan Kim
Hyungmin Kim
Daun Jeong
Sungho Suh
Hansang Cho
109
0
0
05 Nov 2024
Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Tianlin Li
Qian Zhu
Jiandong Jin
Jun Zhu
Futian Wang
Bowei Jiang
Yaowei Wang
Yonghong Tian
ViT
84
4
0
27 Apr 2024
KVQ: Kwai Video Quality Assessment for Short-form Videos
Yiting Lu
Xin Li
Yajing Pei
Kun Yuan
Qizhi Xie
Yunpeng Qu
Ming Sun
Chao Zhou
Zhibo Chen
110
20
0
11 Feb 2024
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-based Person Re-Identification
Yajing Zhai
Yawen Zeng
Zhiyong Huang
Zheng Qin
Xin Jin
Dandan Cao
64
18
0
28 Dec 2023
SequencePAR: Understanding Pedestrian Attributes via A Sequence Generation Paradigm
Jiandong Jin
Tianlin Li
Chenglong Li
Lili Huang
Jin Tang
AI4TS
62
7
0
04 Dec 2023
Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Dong Li
Jiandong Jin
Yuhao Zhang
Yanlin Zhong
Yaoyang Wu
Lan Chen
Tianlin Li
Bin Luo
111
6
0
30 Nov 2023
1