Inference Compute-Optimal Video Vision Language Models

Inference Compute-Optimal Video Vision Language Models

Papers citing "Inference Compute-Optimal Video Vision Language Models"

29 / 29 papers shown
Title
Video Instruction Tuning With Synthetic Data
Video Instruction Tuning With Synthetic Data
Yuanhan Zhang
Jinming Wu
Wei Li
Bo Li
Zejun Ma
Ziwei Liu
Chunyuan Li
99
192
0
03 Oct 2024
LLaVA-OneVision: Easy Visual Task Transfer
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li
Yuanhan Zhang
Dong Guo
Renrui Zhang
Feng Li
Hao Zhang
Kaichen Zhang
Yanwei Li
Ziwei Liu
Chunyuan Li
108
775
0
06 Aug 2024