Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.05075
Cited By
Diving Deep into the Motion Representation of Video-Text Models
7 June 2024
Chinmaya Devaraj
Cornelia Fermuller
Yiannis Aloimonos
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diving Deep into the Motion Representation of Video-Text Models"
4 / 4 papers shown
Title
Text-Only Training for Image Captioning using Noise-Injected CLIP
David Nukrai
Ron Mokady
Amir Globerson
VLM
CLIP
60
94
0
01 Nov 2022
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Wenhao Wu
Zhun Sun
Wanli Ouyang
VLM
103
93
0
04 Jul 2022
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
152
362
0
17 Sep 2021
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation
Marzena Karpinska
Nader Akoury
Mohit Iyyer
220
106
0
14 Sep 2021
1