
Title |
|---|
![]() Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior RecognitionIEEE transactions on multimedia (IEEE TMM), 2024 |
![]() Contextual AD Narration with Interleaved Multimodal SequenceComputer Vision and Pattern Recognition (CVPR), 2024 |
![]() Knowledge Conflicts for LLMs: A SurveyConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial
Creation on Physical TasksInternational Conference on Human Factors in Computing Systems (CHI), 2024 |
![]() CAT: Enhancing Multimodal Large Language Model to Answer Questions in
Dynamic Audio-Visual ScenariosEuropean Conference on Computer Vision (ECCV), 2024 |