Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.08024
Cited By
LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training
16 July 2022
Sumanth Gurram
An Fang
David M. Chan
John F. Canny
VLM
AI4TS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LAVA: Language Audio Vision Alignment for Contrastive Video Pre-Training"
5 / 5 papers shown
Title
Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation
Bolin Lai
Fiona Ryan
Wenqi Jia
Miao Liu
James M. Rehg
EgoV
21
8
0
06 May 2023
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Yin Cui
Boqing Gong
ViT
248
577
0
22 Apr 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
Self-supervised Co-training for Video Representation Learning
Tengda Han
Weidi Xie
Andrew Zisserman
SSL
215
308
0
19 Oct 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
194
205
0
23 Jan 2020
1