Text-Enhanced Zero-Shot Action Recognition: A training-free approach

Text-Enhanced Zero-Shot Action Recognition: A training-free approach

29 August 2024

Massimo Bosetti

Shibingfeng Zhang

Bendetta Liberatori

Papers citing "Text-Enhanced Zero-Shot Action Recognition: A training-free approach"

17 / 17 papers shown

Title
Language-based Action Concept Spaces Improve Video Self-Supervised Learning Kanchana Ranasinghe Michael S. Ryoo SSL VLM 77 12 0 20 Jul 2023
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts Karsten Roth Jae Myung Kim A. Sophia Koepke Oriol Vinyals Cordelia Schmid Zeynep Akata VLM 58 74 0 12 Jun 2023
What does a platypus look like? Generating customized prompts for zero-shot image classification Sarah M Pratt Ian Covert Rosanne Liu Ali Farhadi VLM 160 223 0 07 Sep 2022
Expanding Language-Image Pretrained Models for General Video Recognition Bolin Ni Houwen Peng Minghao Chen Songyang Zhang Gaofeng Meng Jianlong Fu Shiming Xiang Haibin Ling VLM CLIP ViT 96 325 0 04 Aug 2022
Zero-Shot Temporal Action Detection via Vision-Language Prompting Sauradip Nag Xiatian Zhu Yi-Zhe Song Tao Xiang VLM 64 67 0 17 Jul 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection H. Rasheed Muhammad Maaz Muhammad Uzair Khattak Salman Khan Fahad Shahbaz Khan ObjD VLM 93 154 0 07 Jul 2022
Universal Prototype Transport for Zero-Shot Action Recognition and Localization Pascal Mettes 63 5 0 08 Mar 2022
Prompting Visual-Language Models for Efficient Video Understanding Chen Ju Tengda Han Kunhao Zheng Ya Zhang Weidi Xie VPVLM VLM 77 376 0 08 Dec 2021
Elaborative Rehearsal for Zero-shot Action Recognition Shizhe Chen Dong Huang VLM 65 96 0 05 Aug 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation Xiuye Gu Nayeon Lee Weicheng Kuo Huayu Chen VLM ObjD 267 915 0 28 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 866 29,341 0 26 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu H. Pham Quoc V. Le Yun-hsuan Sung Zhen Li Tom Duerig VLM CLIP 438 3,839 0 11 Feb 2021
Language Models are Few-Shot Learners Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 739 41,894 0 28 May 2020
Temporal Interlacing Network Hao Shao Shengju Qian Yu Liu 56 93 0 17 Jan 2020
Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition Devraj Mandal Sanath Narayan Sai Kumar Dwivedi Vikram Gupta Shuaib Ahmed Fahad Shahbaz Khan Ling Shao OODD 44 141 0 18 Apr 2019
Zero-Shot Activity Recognition with Verb Attribute Induction Rowan Zellers Yejin Choi 42 52 0 29 Jul 2017
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild K. Soomro Amir Zamir M. Shah CLIP VGen 143 6,147 0 03 Dec 2012