
Title |
|---|
![]() EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025 |
Video Action DifferencingInternational Conference on Learning Representations (ICLR), 2025 |
![]() Progress-Aware Video Frame CaptioningComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() CAST: Cross-modal Alignment Similarity Test for Vision Language ModelsInternational Conference on Computational Linguistics (COLING), 2024 |
![]() Explaining Datasets in Words: Statistical Models with Natural Language ParametersNeural Information Processing Systems (NeurIPS), 2024 |
![]() Predicting Text Preference Via Structured Comparative ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |