
Title |
|---|
![]() CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video UnderstandingACM Symposium on User Interface Software and Technology (UIST), 2025 |
![]() ImpliHateVid: A Benchmark Dataset and Two-stage Contrastive Learning Framework for Implicit Hate Speech Detection in VideosAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() A Survey on Video Temporal Grounding with Multimodal Large Language ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 |