Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.16412
Cited By
Text-Enhanced Zero-Shot Action Recognition: A training-free approach
29 August 2024
Massimo Bosetti
Shibingfeng Zhang
Bendetta Liberatori
Giacomo Zara
Elisa Ricci
Paolo Rota
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Text-Enhanced Zero-Shot Action Recognition: A training-free approach"
17 / 17 papers shown
Title
Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Kanchana Ranasinghe
Michael S. Ryoo
SSL
VLM
77
12
0
20 Jul 2023
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
VLM
58
74
0
12 Jun 2023
What does a platypus look like? Generating customized prompts for zero-shot image classification
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
160
223
0
07 Sep 2022
Expanding Language-Image Pretrained Models for General Video Recognition
Bolin Ni
Houwen Peng
Minghao Chen
Songyang Zhang
Gaofeng Meng
Jianlong Fu
Shiming Xiang
Haibin Ling
VLM
CLIP
ViT
96
325
0
04 Aug 2022
Zero-Shot Temporal Action Detection via Vision-Language Prompting
Sauradip Nag
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
VLM
64
67
0
17 Jul 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
93
154
0
07 Jul 2022
Universal Prototype Transport for Zero-Shot Action Recognition and Localization
Pascal Mettes
63
5
0
08 Mar 2022
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLM
VLM
77
376
0
08 Dec 2021
Elaborative Rehearsal for Zero-shot Action Recognition
Shizhe Chen
Dong Huang
VLM
65
96
0
05 Aug 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Nayeon Lee
Weicheng Kuo
Huayu Chen
VLM
ObjD
267
915
0
28 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
866
29,341
0
26 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
438
3,839
0
11 Feb 2021
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
739
41,894
0
28 May 2020
Temporal Interlacing Network
Hao Shao
Shengju Qian
Yu Liu
56
93
0
17 Jan 2020
Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition
Devraj Mandal
Sanath Narayan
Sai Kumar Dwivedi
Vikram Gupta
Shuaib Ahmed
Fahad Shahbaz Khan
Ling Shao
OODD
44
141
0
18 Apr 2019
Zero-Shot Activity Recognition with Verb Attribute Induction
Rowan Zellers
Yejin Choi
42
52
0
29 Jul 2017
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
143
6,147
0
03 Dec 2012
1