MovieCLIP: Visual Scene Recognition in Movies

MovieCLIP: Visual Scene Recognition in Movies

20 October 2022

Krishna Somandepalli

K. Cole-McLaughlin

Shrikanth Narayanan

Papers citing "MovieCLIP: Visual Scene Recognition in Movies"

16 / 16 papers shown

Title
SafeVid: Toward Safety Aligned Video Large Multimodal Models Yixu Wang Jiaxin Song Yifeng Gao Xin Wang Yang Yao Yan Teng Xingjun Ma Yingchun Wang Yu-Gang Jiang 14 0 0 17 May 2025
Pose-Aware Weakly-Supervised Action Segmentation Seth Z. Zhao Reza Ghoddoosian Isht Dwivedi Nakul Agarwal Behzad Dariush 44 0 0 08 Apr 2025
Comparative Analysis of Image, Video, and Audio Classifiers for Automated News Video Segmentation Jonathan Attard Dylan Seychell 50 0 0 27 Mar 2025
Personalized Video Summarization by Multimodal Video Understanding Brian Chen Xiangyuan Zhao Yingnan Zhu 46 1 0 05 Nov 2024
Movie Trailer Genre Classification Using Multimodal Pretrained Features Serkan Sulun Paula Viana M. Davies CLIP 23 2 0 11 Oct 2024
Diffusion Feedback Helps CLIP See Better Wenxuan Wang Quan-Sen Sun Fan Zhang Yepeng Tang Jing Liu Xinlong Wang VLM 53 14 0 29 Jul 2024
Visual Objectification in Films: Towards a New AI Task for Video Interpretation Julie Tores L. Sassatelli Hui-Yin Wu Clement Bergman Lea Andolfi ... F. Precioso Thierry Devars Magali Guaresi Virginie Julliard Sarah Lecossais 38 2 0 24 Jan 2024
MM-AU:Towards Multimodal Understanding of Advertisement Videos Digbalay Bose Rajat Hebbar Tiantian Feng Krishna Somandepalli Anfeng Xu Shrikanth Narayanan 34 5 0 27 Aug 2023
Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models Peiyan Zhang Hao Liu Chaozhuo Li Xing Xie Sunghun Kim Haohan Wang VLM OOD 39 8 0 21 Aug 2023
Long-range Multimodal Pretraining for Movie Understanding Dawit Mureja Argaw Joon-Young Lee Markus Woodson In So Kweon Fabian Caba Heilbron VLM 32 7 0 18 Aug 2023
Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos Md Zahid Hasan Jiajing Chen Jiyang Wang Mohammed Shaiqur Rahman Ameya Joshi Senem Velipasalar Chinmay Hegde Anuj Sharma Soumik Sarkar VLM 55 18 0 16 Jun 2023
A survey of Generative AI Applications Roberto Gozalo-Brizuela Eduardo C. Garrido-Merchán 3DV MedIm 38 80 0 05 Jun 2023
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset Young-Jun Lee ByungSoo Ko Han-Gyu Kim Jonghwan Hyeon Ho-Jin Choi 29 7 0 08 Dec 2022
How Much Can CLIP Benefit Vision-and-Language Tasks? Sheng Shen Liunian Harold Li Hao Tan Joey Tianyi Zhou Anna Rohrbach Kai-Wei Chang Z. Yao Kurt Keutzer CLIP VLM MLLM 204 406 0 13 Jul 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation Xiuye Gu Nayeon Lee Weicheng Kuo Huayu Chen VLM ObjD 227 899 0 28 Apr 2021
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius Heng Wang Lorenzo Torresani ViT 283 1,992 0 09 Feb 2021